Search CORE

548 research outputs found

Novel deep cross-domain framework for fault diagnosis or rotary machinery in prognostics and health management

Author: Rezaeianjouybari Behnoush
Publication venue: 'University of Missouri Libraries'
Publication date
Field of study

Improving the reliability of engineered systems is a crucial problem in many applications in various engineering fields, such as aerospace, nuclear energy, and water declination industries. This requires efficient and effective system health monitoring methods, including processing and analyzing massive machinery data to detect anomalies and performing diagnosis and prognosis. In recent years, deep learning has been a fast-growing field and has shown promising results for Prognostics and Health Management (PHM) in interpreting condition monitoring signals such as vibration, acoustic emission, and pressure due to its capacity to mine complex representations from raw data. This doctoral research provides a systematic review of state-of-the-art deep learning-based PHM frameworks, an empirical analysis on bearing fault diagnosis benchmarks, and a novel multi-source domain adaptation framework. It emphasizes the most recent trends within the field and presents the benefits and potentials of state-of-the-art deep neural networks for system health management. Besides, the limitations and challenges of the existing technologies are discussed, which leads to opportunities for future research. The empirical study of the benchmarks highlights the evaluation results of the existing models on bearing fault diagnosis benchmark datasets in terms of various performance metrics such as accuracy and training time. The result of the study is very important for comparing or testing new models. A novel multi-source domain adaptation framework for fault diagnosis of rotary machinery is also proposed, which aligns the domains in both feature-level and task-level. The proposed framework transfers the knowledge from multiple labeled source domains into a single unlabeled target domain by reducing the feature distribution discrepancy between the target domain and each source domain. Besides, the model can be easily reduced to a single-source domain adaptation problem. Also, the model can be readily updated to unsupervised domain adaptation problems in other fields such as image classification and image segmentation. Further, the proposed model is modified with a novel conditional weighting mechanism that aligns the class-conditional probability of the domains and reduces the effect of irrelevant source domain which is a critical issue in multi-source domain adaptation algorithms. The experimental verification results show the superiority of the proposed framework over state-of-the-art multi-source domain-adaptation models

University of Missouri: MOspace

How to Do Machine Learning with Small Data? -- A Review from an Industrial Perspective

Author: Ivanov Dmitrij
Ju Yong Chul
Kraljevski Ivan
Tschöpe Constanze
Wolff Matthias
Publication venue
Publication date: 13/11/2023
Field of study

Artificial intelligence experienced a technological breakthrough in science, industry, and everyday life in the recent few decades. The advancements can be credited to the ever-increasing availability and miniaturization of computational resources that resulted in exponential data growth. However, because of the insufficient amount of data in some cases, employing machine learning in solving complex tasks is not straightforward or even possible. As a result, machine learning with small data experiences rising importance in data science and application in several fields. The authors focus on interpreting the general term of "small data" and their engineering and industrial application role. They give a brief overview of the most important industrial applications of machine learning and small data. Small data is defined in terms of various characteristics compared to big data, and a machine learning formalism was introduced. Five critical challenges of machine learning with small data in industrial applications are presented: unlabeled data, imbalanced data, missing data, insufficient data, and rare events. Based on those definitions, an overview of the considerations in domain representation and data acquisition is given along with a taxonomy of machine learning approaches in the context of small data

arXiv.org e-Print Archive

A Deep Learning Approach for Fusing Sensor Data from Screw Compressors

Author: Alonso Castro Serafín
Diaz Blanco Ignacio
Domínguez González Manuel
Domínguez González Manuel
Fuertes Martínez Juan José
Morán Álvarez Antonio
Pérez López Daniel
Publication venue: MDPI
Publication date: 16/04/2024
Field of study

[EN] Chillers are commonly used for thermal regulation to maintain indoor comfort in medium and large buildings. However, inefficiencies in this process produce significant losses, and optimization tasks are limited because of accessibility to the system. Data analysis techniques transform measurements coming from several sensors into useful information. Recent deep learning approaches have achieved excellent results in many applications. These techniques can be used for computing new data representations that provide comprehensive information from the device. This allows real-time monitoring, where information can be checked with current working operation to detect any type of anomaly in the process. In this work, a model based on a 1D convolutional neural network is proposed for fusing data in order to predict four different control stages of a screw compressor in a chiller. The evaluation of the method was performed using real data from a chiller in a hospital building. Results show a satisfactory performance and acceptable training time in comparison with other recent methods. In addition, the model is capable of predicting control states of other screw compressors different than the one used in the training. Furthermore, two failure cases are simulated, providing an early alarm detection when a continuous wrong classification is performed by the model.SIThis research was funded by the Spanish Ministry of Science and Innovation and the European Regional Development Fund under project DPI2015-69891-C2-1-R/2-R.Ministerio de Economía y Competitivida

Leon University (Spain)

Application of data analytics for predictive maintenance in aerospace: an approach to imbalanced learning.

Author: Dangut Maren David
Publication venue
Publication date: 15/07/2022
Field of study

The use of aircraft operational logs to predict potential failure that may lead to disruption poses many challenges and has yet to be fully explored. These logs are captured during each flight and contain streamed data from various aircraft subsystems relating to status and warning indicators. They may, therefore, be regarded as complex multivariate time-series data. Given that aircraft are high-integrity assets, failures are extremely rare, and hence the distribution of relevant data containing prior indicators will be highly skewed to the normal (healthy) case. This will present a significant challenge in using data-driven techniques to 'learning' relationships/patterns that depict fault scenarios since the model will be biased to the heavily weighted no-fault outcomes. This thesis aims to develop a predictive model for aircraft component failure utilising data from the aircraft central maintenance system (ACMS). The initial objective is to determine the suitability of the ACMS data for predictive maintenance modelling. An exploratory analysis of the data revealed several inherent irregularities, including an extreme data imbalance problem, irregular patterns and trends, class overlapping, and small class disjunct, all of which are significant drawbacks for traditional machine learning algorithms, resulting in low-performance models. Four novel advanced imbalanced classification techniques are developed to handle the identified data irregularities. The first algorithm focuses on pattern extraction and uses bootstrapping to oversample the minority class; the second algorithm employs the balanced calibrated hybrid ensemble technique to overcome class overlapping and small class disjunct; the third algorithm uses a derived loss function and new network architecture to handle extremely imbalanced ratios in deep neural networks; and finally, a deep reinforcement learning approach for imbalanced classification problems in log- based datasets is developed. An ACMS dataset and its accompanying maintenance records were used to validate the proposed algorithms. The research's overall finding indicates that an advanced method for handling extremely imbalanced problems using the log-based ACMS datasets is viable for developing robust data-driven predictive maintenance models for aircraft component failure. When the four implementations were compared, deep reinforcement learning (DRL) strategies, specifically the proposed double deep State-action-reward-state-action with prioritised experience reply memory (DDSARSA+PER), outperformed other methods in terms of false-positive and false-negative rates for all the components considered. The validation result further suggests that the DDSARSA+PER model is capable of predicting around 90% of aircraft component replacements with a 0.005 false-negative rate in both A330 and A320 aircraft families studied in this researchPhD in Transport System

Cranfield CERES

Ensemble Feature Learning-Based Event Classification for Cyber-Physical Security of the Smart Grid

Author: Hu Chengming
Publication venue
Publication date: 01/09/2019
Field of study

The power grids are transforming into the cyber-physical smart grid with increasing two-way communications and abundant data flows. Despite the efficiency and reliability promised by this transformation, the growing threats and incidences of cyber attacks targeting the physical power systems have exposed severe vulnerabilities. To tackle such vulnerabilities, intrusion detection systems (IDS) are proposed to monitor threats for the cyber-physical security of electrical power and energy systems in the smart grid with increasing machine-to-machine communication. However, the multi-sourced, correlated, and often noise-contained data, which record various concurring cyber and physical events, are posing significant challenges to the accurate distinction by IDS among events of inadvertent and malignant natures. Hence, in this research, an ensemble learning-based feature learning and classification for cyber-physical smart grid are designed and implemented. The contribution of this research are (i) the design, implementation and evaluation of an ensemble learning-based attack classifier using extreme gradient boosting (XGBoost) to effectively detect and identify attack threats from the heterogeneous cyber-physical information in the smart grid; (ii) the design, implementation and evaluation of stacked denoising autoencoder (SDAE) to extract highlyrepresentative feature space that allow reconstruction of a noise-free input from noise-corrupted perturbations; (iii) the design, implementation and evaluation of a novel ensemble learning-based feature extractors that combine multiple autoencoder (AE) feature extractors and random forest base classifiers, so as to enable accurate reconstruction of each feature and reliable classification against malicious events. The simulation results validate the usefulness of ensemble learning approach in detecting malicious events in the cyber-physical smart grid

Concordia University Research Repository

Survey on highly imbalanced multi-class data

Author: Abdul Hamid Mohd Hakim
Mohamed Azlinah
Yusoff Marina
Publication venue: 'The Science and Information Organization'
Publication date: 01/01/2022
Field of study

Machine learning technology has a massive impact on society because it offers solutions to solve many complicated problems like classification, clustering analysis, and predictions, especially during the COVID-19 pandemic. Data distribution in machine learning has been an essential aspect in providing unbiased solutions. From the earliest literatures published on highly imbalanced data until recently, machine learning research has focused mostly on binary classification data problems. Research on highly imbalanced multi-class data is still greatly unexplored when the need for better analysis and predictions in handling Big Data is required. This study focuses on reviews related to the models or techniques in handling highly imbalanced multi-class data, along with their strengths and weaknesses and related domains. Furthermore, the paper uses the statistical method to explore a case study with a severely imbalanced dataset. This article aims to (1) understand the trend of highly imbalanced multi-class data through analysis of related literatures; (2) analyze the previous and current methods of handling highly imbalanced multi-class data; (3) construct a framework of highly imbalanced multi-class data. The chosen highly imbalanced multi-class dataset analysis will also be performed and adapted to the current methods or techniques in machine learning, followed by discussions on open challenges and the future direction of highly imbalanced multi-class data. Finally, for highly imbalanced multi-class data, this paper presents a novel framework. We hope this research can provide insights on the potential development of better methods or techniques to handle and manipulate highly imbalanced multi-class data

Universiti Teknikal Malaysia Melaka (UTeM) Repository

A Novel Multiview Sampling-based Meta Self-Paced Learning Approach for Class-imbalanced Intelligent Fault Diagnosis

Author: Liu Chao
Lyu Pin
Xia Min
Yu Wenbing
Zheng Pai
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 13/10/2022
Field of study

In practical machine fault diagnosis, the obtained data samples under faulty conditions are usually far less than those under normal conditions, resulting in a class-imbalanced dataset issue. The existing solutions for class-imbalanced scenarios include data-level and model-level strategies, which are either subject to overgeneralization or time-consuming. To address it, this article proposes a novel multiview sampling-based meta self-spaced learning approach. First, the signal processing methods, such as time-domain (TD), frequency-domain (FD), and time-frequency domain (TFD), are used to extract statistical features from the original data to form diverse views. Next, the meta self-paced learning technology is applied to select high-quality samples from multiview feature data to generate a class-balanced dataset. Finally, a fault diagnosis model is trained with the obtained class-balanced dataset. The main contribution of this research has twofold: 1) the introduced multiview sampling method adaptively learns the weight in the sampling process and automatically deletes the noise samples with large loss value to improve the performance of the fault diagnosis model; and 2) the proposed meta self-spaced learning approach eliminates the error caused by setting parameters manually and ensures the quality of the extracted samples. To validate its performance, a comparative study is conducted on a public dataset and the one collected from an industrial motor test platform. Five baseline methods are compared with the proposed one based on the convolutional neural network (CNN) model. Moreover, three traditional machine learning models are to verify the sample quality generated. The experimental results achieve above 90% diagnosis accuracy, which provides a new intelligent manner for the modular service application of class-imbalance fault diagnosis

Aston Publications Explorer

Lancaster E-Prints