6,512 research outputs found

    Data-driven Models for Remaining Useful Life Estimation of Aircraft Engines and Hard Disk Drives

    Get PDF
    Failure of physical devices can cause inconvenience, loss of money, and sometimes even deaths. To improve the reliability of these devices, we need to know the remaining useful life (RUL) of a device at a given point in time. Data-driven approaches use data from a physical device to build a model that can estimate the RUL. They have shown great performance and are often simpler than traditional model-based approaches. Typical statistical and machine learning approaches are often not suited for sequential data prediction. Recurrent Neural Networks are designed to work with sequential data but suffer from the vanishing gradient problem over time. Therefore, I explore the use of Long Short-Term Memory (LSTM) networks for RUL prediction. I perform two experiments. First, I train bidirectional LSTM networks on the Backblaze hard-disk drive dataset. I achieve an accuracy of 96.4\% on a 60 day time window, state-of-the-art performance. Additionally, I use a unique standardization method that standardizes each hard drive instance independently and explore the benefits and downsides of this approach. Finally, I train LSTM models on the NASA N-CMAPSS dataset to predict aircraft engine remaining useful life. I train models on each of the eight sub-datasets, achieving a RMSE of 6.304 on one of the sub-datasets, the second-best in the current literature. I also compare an LSTM network\u27s performance to the performance of a Random Forest and Temporal Convolutional Neural Network model, demonstrating the LSTM network\u27s superior performance. I find that LSTM networks are capable predictors for device remaining useful life and show a thorough model development process that can be reproduced to develop LSTM models for various RUL prediction tasks. These models will be able to improve the reliability of devices such as aircraft engines and hard-disk drives

    A survey of machine learning techniques applied to self organizing cellular networks

    Get PDF
    In this paper, a survey of the literature of the past fifteen years involving Machine Learning (ML) algorithms applied to self organizing cellular networks is performed. In order for future networks to overcome the current limitations and address the issues of current cellular systems, it is clear that more intelligence needs to be deployed, so that a fully autonomous and flexible network can be enabled. This paper focuses on the learning perspective of Self Organizing Networks (SON) solutions and provides, not only an overview of the most common ML techniques encountered in cellular networks, but also manages to classify each paper in terms of its learning solution, while also giving some examples. The authors also classify each paper in terms of its self-organizing use-case and discuss how each proposed solution performed. In addition, a comparison between the most commonly found ML algorithms in terms of certain SON metrics is performed and general guidelines on when to choose each ML algorithm for each SON function are proposed. Lastly, this work also provides future research directions and new paradigms that the use of more robust and intelligent algorithms, together with data gathered by operators, can bring to the cellular networks domain and fully enable the concept of SON in the near future

    HETEROGENEOUS DATA AND PROBABILISTIC SYSTEM MODEL ANALYSES FOR ENHANCED SITUATIONAL AWARENESS AND RESILIENCE OF CRITICAL INFRASTRUCTURE SYSTEMS

    Get PDF
    The protection and resilience of critical infrastructure systems (CIS) are essential for public safety in daily operations and times of crisis and for community preparedness to hazard events. Increasing situational awareness and resilience of CIS includes both comprehensive monitoring of CIS and their surroundings, as well as evaluating CIS behaviors in changing conditions and with different system configurations. Two frameworks for increasing the monitoring capabilities of CIS are presented. The proposed frameworks are (1) a process for classifying social media big data for monitoring CIS and hazard events and (2) a framework for integrating heterogeneous data sources, including social media, using Bayesian inference to update prior probabilities of event occurrence. Applications of both frameworks are presented, including building and evaluating text-based machine learning classifiers for identifying CIS damages and integrating disparate data sources to estimate hazards and CIS damages. Probabilistic analyses of CIS vulnerabilities with varying system parameters and topologies are also presented. In a water network, the impact of varying parameters on component performance is evaluated. In multiple, small-size water networks, the impacts of system topology are assessed to identify characteristics of more resilient networks. This body of work contributes insights and methods for monitoring CIS and assessing their performance. Integrating heterogeneous data sources increases situational awareness of CIS, especially during or after failure events, and evaluating the sensitivity of CIS outcomes to changes in the network facilitates decisions for CIS investments and emergency response.Ph.D

    Data analytics 2016: proceedings of the fifth international conference on data analytics

    Get PDF

    An early-stage decision-support framework for the implementation of intelligent automation

    Get PDF
    The constant pressure on manufacturing companies to improve productivity, reduce the lead time and progress in quality requires new technological developments and adoption.The rapid development of smart technology and robotics and autonomous systems (RAS) technology has a profound impact on manufacturing automation and might determine winners and losers of the next generation’s manufacturing competition. Simultaneously, recent smart technology developments in the areas enable an automation response to new production paradigms such as mass customisation and product-lifecycle considerations in the context of Industry 4.0. New paradigms, like mass customisation, increased both the complexity of the tasks and the risk due to smart technology integration. From a manufacturing automation perspective, intelligent automation has been identified as a possible response to arising demands. The presented research aims to support the industrial uptake of intelligent automation into manufacturing businesses by quantifying risks at the early design stage and business case development. An early-stage decision-support framework for the implementation of intelligent automation in manufacturing businesses is presented in this thesis.The framework is informed by an extensive literature review, updated and verified with surveys and workshops to add to the knowledge base due to the rapid development of the associated technologies. A paradigm shift from cost to a risk-modelling perspective is proposed to provide a more flexible and generic approach applicable throughout the current technology landscape. The proposed probabilistic decision-support framework consists of three parts:• A clustering algorithm to identify the manufacturing functions in manual processes from task analysis to mitigate early-stage design uncertainties• A Bayesian Belief Network (BBN) informed by an expert elicitation via the DELPHI method, where the identified functions become the unit of analysis.• A Markov-Chain Monte-Carlo method modelling the effects of uncertainties on the critical success factors to address issues of factor interdependencies after expert elicitation.Based on the overall decision framework a toolbox was developed in Microsoft Excel. Five different case studies are used to test and validate the framework. Evaluation of the results derived from the toolbox from the industrial feedback suggests a positive validation for commercial use. The main contributions to knowledge in the presented thesis arise from the following four points:• Early-stage decision-support framework for business case evaluation of intelligent automation.• Translating manual tasks to automation function via a novel clustering approach• Application of a Markov-Chain Monte-Carlo Method to simulate correlation between decision criteria• Causal relationship among Critical Success Factors has been established from business and technical perspectives.The implications on practise might be promising. The feedback arising from the created tool was promising from the industry, and a practical realisation of the decision-support tool seems to be desired from an industrial point of view.With respect to further work, the decision-support tool might have established a ground to analyse a human task automatically for automation purposes. The established clustering mechanisms and the related attributes could be connected to sensorial data and analyse a manufacturing task autonomously without the subjective input of task analysis experts. To enable such an autonomous process, however, the psychophysiological understanding must be increased in the future.</div

    Performance Evaluation of Smart Decision Support Systems on Healthcare

    Get PDF
    Medical activity requires responsibility not only from clinical knowledge and skill but also on the management of an enormous amount of information related to patient care. It is through proper treatment of information that experts can consistently build a healthy wellness policy. The primary objective for the development of decision support systems (DSSs) is to provide information to specialists when and where they are needed. These systems provide information, models, and data manipulation tools to help experts make better decisions in a variety of situations. Most of the challenges that smart DSSs face come from the great difficulty of dealing with large volumes of information, which is continuously generated by the most diverse types of devices and equipment, requiring high computational resources. This situation makes this type of system susceptible to not recovering information quickly for the decision making. As a result of this adversity, the information quality and the provision of an infrastructure capable of promoting the integration and articulation among different health information systems (HIS) become promising research topics in the field of electronic health (e-health) and that, for this same reason, are addressed in this research. The work described in this thesis is motivated by the need to propose novel approaches to deal with problems inherent to the acquisition, cleaning, integration, and aggregation of data obtained from different sources in e-health environments, as well as their analysis. To ensure the success of data integration and analysis in e-health environments, it is essential that machine-learning (ML) algorithms ensure system reliability. However, in this type of environment, it is not possible to guarantee a reliable scenario. This scenario makes intelligent SAD susceptible to predictive failures, which severely compromise overall system performance. On the other hand, systems can have their performance compromised due to the overload of information they can support. To solve some of these problems, this thesis presents several proposals and studies on the impact of ML algorithms in the monitoring and management of hypertensive disorders related to pregnancy of risk. The primary goals of the proposals presented in this thesis are to improve the overall performance of health information systems. In particular, ML-based methods are exploited to improve the prediction accuracy and optimize the use of monitoring device resources. It was demonstrated that the use of this type of strategy and methodology contributes to a significant increase in the performance of smart DSSs, not only concerning precision but also in the computational cost reduction used in the classification process. The observed results seek to contribute to the advance of state of the art in methods and strategies based on AI that aim to surpass some challenges that emerge from the integration and performance of the smart DSSs. With the use of algorithms based on AI, it is possible to quickly and automatically analyze a larger volume of complex data and focus on more accurate results, providing high-value predictions for a better decision making in real time and without human intervention.A atividade médica requer responsabilidade não apenas com base no conhecimento e na habilidade clínica, mas também na gestão de uma enorme quantidade de informações relacionadas ao atendimento ao paciente. É através do tratamento adequado das informações que os especialistas podem consistentemente construir uma política saudável de bem-estar. O principal objetivo para o desenvolvimento de sistemas de apoio à decisão (SAD) é fornecer informações aos especialistas onde e quando são necessárias. Esses sistemas fornecem informações, modelos e ferramentas de manipulação de dados para ajudar os especialistas a tomar melhores decisões em diversas situações. A maioria dos desafios que os SAD inteligentes enfrentam advêm da grande dificuldade de lidar com grandes volumes de dados, que é gerada constantemente pelos mais diversos tipos de dispositivos e equipamentos, exigindo elevados recursos computacionais. Essa situação torna este tipo de sistemas suscetível a não recuperar a informação rapidamente para a tomada de decisão. Como resultado dessa adversidade, a qualidade da informação e a provisão de uma infraestrutura capaz de promover a integração e a articulação entre diferentes sistemas de informação em saúde (SIS) tornam-se promissores tópicos de pesquisa no campo da saúde eletrônica (e-saúde) e que, por essa mesma razão, são abordadas nesta investigação. O trabalho descrito nesta tese é motivado pela necessidade de propor novas abordagens para lidar com os problemas inerentes à aquisição, limpeza, integração e agregação de dados obtidos de diferentes fontes em ambientes de e-saúde, bem como sua análise. Para garantir o sucesso da integração e análise de dados em ambientes e-saúde é importante que os algoritmos baseados em aprendizagem de máquina (AM) garantam a confiabilidade do sistema. No entanto, neste tipo de ambiente, não é possível garantir um cenário totalmente confiável. Esse cenário torna os SAD inteligentes suscetíveis à presença de falhas de predição que comprometem seriamente o desempenho geral do sistema. Por outro lado, os sistemas podem ter seu desempenho comprometido devido à sobrecarga de informações que podem suportar. Para tentar resolver alguns destes problemas, esta tese apresenta várias propostas e estudos sobre o impacto de algoritmos de AM na monitoria e gestão de transtornos hipertensivos relacionados com a gravidez (gestação) de risco. O objetivo das propostas apresentadas nesta tese é melhorar o desempenho global de sistemas de informação em saúde. Em particular, os métodos baseados em AM são explorados para melhorar a precisão da predição e otimizar o uso dos recursos dos dispositivos de monitorização. Ficou demonstrado que o uso deste tipo de estratégia e metodologia contribui para um aumento significativo do desempenho dos SAD inteligentes, não só em termos de precisão, mas também na diminuição do custo computacional utilizado no processo de classificação. Os resultados observados buscam contribuir para o avanço do estado da arte em métodos e estratégias baseadas em inteligência artificial que visam ultrapassar alguns desafios que advêm da integração e desempenho dos SAD inteligentes. Como o uso de algoritmos baseados em inteligência artificial é possível analisar de forma rápida e automática um volume maior de dados complexos e focar em resultados mais precisos, fornecendo previsões de alto valor para uma melhor tomada de decisão em tempo real e sem intervenção humana
    corecore