2 research outputs found
An Adaptive Oversampling Learning Method for Class-Imbalanced Fault Diagnostics and Prognostics
Data-driven fault diagnostics and prognostics suffers from class-imbalance
problem in industrial systems and it raises challenges to common machine
learning algorithms as it becomes difficult to learn the features of the
minority class samples. Synthetic oversampling methods are commonly used to
tackle these problems by generating the minority class samples to balance the
distributions between majority and minority classes. However, many of
oversampling methods are inappropriate that they cannot generate effective and
useful minority class samples according to different distributions of data,
which further complicate the process of learning samples. Thus, this paper
proposes a novel adaptive oversampling technique: EM-based Weighted Minority
Oversampling TEchnique (EWMOTE) for industrial fault diagnostics and
prognostics. The methods comprises a weighted minority sampling strategy to
identify hard-to-learn informative minority fault samples and Expectation
Maximization (EM) based imputation algorithm to generate fault samples. To
validate the performance of the proposed methods, experiments are conducted in
two real datasets. The results show that the method could achieve better
performance on not only binary class, but multi-class imbalance learning task
in different imbalance ratios than other oversampling-based baseline models.Comment: 8 page
Oversampling Adversarial Network for Class-Imbalanced Fault Diagnosis
The collected data from industrial machines are often imbalanced, which poses
a negative effect on learning algorithms. However, this problem becomes more
challenging for a mixed type of data or while there is overlapping between
classes. Class-imbalance problem requires a robust learning system which can
timely predict and classify the data. We propose a new adversarial network for
simultaneous classification and fault detection. In particular, we restore the
balance in the imbalanced dataset by generating faulty samples from the
proposed mixture of data distribution. We designed the discriminator of our
model to handle the generated faulty samples to prevent outlier and
overfitting. We empirically demonstrate that; (i) the discriminator trained
with a generator to generates samples from a mixture of normal and faulty data
distribution which can be considered as a fault detector; (ii), the quality of
the generated faulty samples outperforms the other synthetic resampling
techniques. Experimental results show that the proposed model performs well
when comparing to other fault diagnosis methods across several evaluation
metrics; in particular, coalescing of generative adversarial network (GAN) and
feature matching function is effective at recognizing faulty samples