20,903 research outputs found
Learning from accidents : machine learning for safety at railway stations
In railway systems, station safety is a critical aspect of the overall structure, and yet, accidents at stations still occur. It is time to learn from these errors and improve conventional methods by utilizing the latest technology, such as machine learning (ML), to analyse accidents and enhance safety systems. ML has been employed in many fields, including engineering systems, and it interacts with us throughout our daily lives. Thus, we must consider the available technology in general and ML in particular in the context of safety
in the railway industry. This paper explores the employment of the decision tree (DT) method in safety classification and the analysis of accidents at railway stations to predict the traits of passengers affected by accidents. The critical contribution of this study is the presentation of ML and an explanation of how this technique is applied for ensuring safety, utilizing automated processes, and gaining benefits from this powerful technology. To apply and explore this method, a case study has been selected that focuses on the fatalities caused by accidents at railway stations. An analysis of some of these fatal accidents as reported by the Rail Safety and Standards Board (RSSB) is performed and presented in this paper to provide a broader summary of the application of supervised ML for improving safety at railway stations. Finally, this research shows the vast potential of the innovative application of ML in safety analysis for the railway industry
Curriculum Guidelines for Undergraduate Programs in Data Science
The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program
met for the purpose of composing guidelines for undergraduate programs in Data
Science. The group consisted of 25 undergraduate faculty from a variety of
institutions in the U.S., primarily from the disciplines of mathematics,
statistics and computer science. These guidelines are meant to provide some
structure for institutions planning for or revising a major in Data Science
Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models.
Knowing the catalytic turnover numbers of enzymes is essential for understanding the growth rate, proteome composition, and physiology of organisms, but experimental data on enzyme turnover numbers is sparse and noisy. Here, we demonstrate that machine learning can successfully predict catalytic turnover numbers in Escherichia coli based on integrated data on enzyme biochemistry, protein structure, and network context. We identify a diverse set of features that are consistently predictive for both in vivo and in vitro enzyme turnover rates, revealing novel protein structural correlates of catalytic turnover. We use our predictions to parameterize two mechanistic genome-scale modelling frameworks for proteome-limited metabolism, leading to significantly higher accuracy in the prediction of quantitative proteome data than previous approaches. The presented machine learning models thus provide a valuable tool for understanding metabolism and the proteome at the genome scale, and elucidate structural, biochemical, and network properties that underlie enzyme kinetics
A review of key planning and scheduling in the rail industry in Europe and UK
Planning and scheduling activities within the rail industry have benefited from developments in computer-based simulation and modelling techniques over the last 25 years. Increasingly, the use of computational intelligence in such tasks is featuring more heavily in research publications. This paper examines a number of common rail-based planning and scheduling activities and how they benefit from five broad technology approaches. Summary tables of papers are provided relating to rail planning and scheduling activities and to the use of expert and decision systems in the rail industry.EPSR
A predictive maintenance approach based in big data analysis
With the evolution of information systems, the data flow escalated into new boundaries, allowing enterprises to further develop their approach to important sectors, such as production, logistic, IT and especially maintenance. This last field accompanied industry developments hand in hand in each of the four iterations. More specifically, the fourth iteration (Industry 4.0) marked the capability to connect machines and further enhance data extraction, which allowed companies to use a new data-driven approach into their specific problems. Nevertheless, with a wider flow of data being generated, understanding data became a priority for maintenance-related decision-making processes. Therefore, the correct elaboration of a roadmap to apply predictive maintenance (PM) is a key step for companies. A roadmap can allow a safe approach, where resources may be placed strategically with a ratio of low risk and high reward. By analysing multiple approaches to PM, a generic model is proposed, which contains an array of guidelines. This combination aims to assist maintenance departments that wish to understand the feasibility of implementing a predictive maintenance solution in their company. To understand the utility of the developed artefact, a practical application was conducted to a production line of HFA, a Portuguese Small and Medium Enterprise.Através da evolução dos sistemas de informação (SI), o fluxo de dados atingiu novos limites, permitindo assim às empresas desenvolver diferentes focos e aplicar novas perspetivas nos departamentos fulcrais à sua atividade, tais como produção, logística e, mais especificamente, a manutenção. Esta última componente evolui paralelamente à indústria, evidenciando novos desenvolvimentos em cada iteração da mesma. Particularmente, a quarta revolução industrial destacou-se pela capacidade de conectar máquinas entre si e pela evolução posterior do processo de extração de dados. Assim, surgiu uma nova perspetiva focada na utilização dos dados extraídos para resolução de problemas. Consequentemente, esta inovação fomentou uma redefinição das prioridades nas decisões tomadas relativas à manutenção, dando primazia à compreensão dos dados gerados. Por conseguinte, a correta elaboração de um plano de implementação de manutenção preditiva (MP) destaca-se como um passo fulcral para as empresas. Este plano tem como objetivo permitir uma abordagem mais segura, possibilitando assim alocar os recursos estrategicamente, reduzindo o risco e potenciando a recompensa. Mediante a análise de múltiplas abordagens de MP, é proposto um modelo genérico que reúne um conjunto diretrizes. Este tem intuito de auxiliar os departamentos de manutenção que pretendem compreender a viabilidade da instalação de uma solução de MP na empresa. A fim de perceber a utilidade dos artefactos desenvolvidos, foi realizada uma aplicação prática do modelo numa pequena e média empresa (PME)
PORÓWNANIE SKUTECZNOŚCI ALGORYTMÓW UCZENIA MASZYNOWEGO DLA KONSERWACJI PREDYKCYJNEJ
The consequences of failures and unscheduled maintenance are the reasons why engineers have been trying to increase the reliability of industrial equipment for years. In modern solutions, predictive maintenance is a frequently used method. It allows to forecast failures and alert about their possibility. This paper presents a summary of the machine learning algorithms that can be used in predictive maintenance and comparison of their performance. The analysis was made on the basis of data set from Microsoft Azure AI Gallery. The paper presents a comprehensive approach to the issue including feature engineering, preprocessing, dimensionality reduction techniques, as well as tuning of model parameters in order to obtain the highest possible performance. The conducted research allowed to conclude that in the analysed case , the best algorithm achieved 99.92% accuracy out of over 122 thousand test data records. In conclusion, predictive maintenance based on machine learning represents the future of machine reliability in industry.Skutki związane z awariami oraz niezaplanowaną konserwacją to powody, dla których od lat inżynierowie próbują zwiększyć niezawodność osprzętu przemysłowego. W nowoczesnych rozwiązaniach obok tradycyjnych metod stosowana jest również tzw. konserwacja predykcyjna, która pozwala przewidywać awarie i alarmować o możliwości ich powstawania. W niniejszej pracy przedstawiono zestawienie algorytmów uczenia maszynowego, które można zastosować w konserwacji predykcyjnej oraz porównanie ich skuteczności. Analizy dokonano na podstawie zbioru danych Azure AI Gallery udostępnionych przez firmę Microsoft. Praca przedstawia kompleksowe podejście do analizowanego zagadnienia uwzględniające wydobywanie cech charakterystycznych, wstępne przygotowanie danych, zastosowanie technik redukcji wymiarowości, a także dostrajanie parametrów poszczególnych modeli w celu uzyskania najwyższej możliwej skuteczności. Przeprowadzone badania pozwoliły wskazać najlepszy algorytm, który uzyskał dokładność na poziomie 99,92%, spośród ponad 122 tys. rekordów danych testowych. Na podstawie tego można stwierdzić, że konserwacja predykcyjna prowadzona w oparciu o uczenie maszynowe stanowi przyszłość w zakresie podniesienia niezawodności maszyn w przemyśle
Towards Automated Performance Bug Identification in Python
Context: Software performance is a critical non-functional requirement,
appearing in many fields such as mission critical applications, financial, and
real time systems. In this work we focused on early detection of performance
bugs; our software under study was a real time system used in the
advertisement/marketing domain.
Goal: Find a simple and easy to implement solution, predicting performance
bugs.
Method: We built several models using four machine learning methods, commonly
used for defect prediction: C4.5 Decision Trees, Na\"{\i}ve Bayes, Bayesian
Networks, and Logistic Regression.
Results: Our empirical results show that a C4.5 model, using lines of code
changed, file's age and size as explanatory variables, can be used to predict
performance bugs (recall=0.73, accuracy=0.85, and precision=0.96). We show that
reducing the number of changes delivered on a commit, can decrease the chance
of performance bug injection.
Conclusions: We believe that our approach can help practitioners to eliminate
performance bugs early in the development cycle. Our results are also of
interest to theoreticians, establishing a link between functional bugs and
(non-functional) performance bugs, and explicitly showing that attributes used
for prediction of functional bugs can be used for prediction of performance
bugs
Recommended from our members
Temporal Bayesian classifiers for modelling muscular dystrophy expression data
The analysis of microarray data from time-series experiments requires specialised algorithms, which take the temporal ordering of the data into account. In this paper we explore a new architecture of Bayesian classifier that can be used to understand how biological mechanisms differ with respect to time. We show that this classifier improves the classification of microarray data and at the same time ensures that the models can easily be analysed by biologists by incorporating time transparently. In this paper we focus on data that has been generated to explore different types of muscular dystrophy
- …