3,988 research outputs found
Wavelet Integrated CNNs for Noise-Robust Image Classification
Convolutional Neural Networks (CNNs) are generally prone to noise
interruptions, i.e., small image noise can cause drastic changes in the output.
To suppress the noise effect to the final predication, we enhance CNNs by
replacing max-pooling, strided-convolution, and average-pooling with Discrete
Wavelet Transform (DWT). We present general DWT and Inverse DWT (IDWT) layers
applicable to various wavelets like Haar, Daubechies, and Cohen, etc., and
design wavelet integrated CNNs (WaveCNets) using these layers for image
classification. In WaveCNets, feature maps are decomposed into the
low-frequency and high-frequency components during the down-sampling. The
low-frequency component stores main information including the basic object
structures, which is transmitted into the subsequent layers to extract robust
high-level features. The high-frequency components, containing most of the data
noise, are dropped during inference to improve the noise-robustness of the
WaveCNets. Our experimental results on ImageNet and ImageNet-C (the noisy
version of ImageNet) show that WaveCNets, the wavelet integrated versions of
VGG, ResNets, and DenseNet, achieve higher accuracy and better noise-robustness
than their vanilla versions.Comment: CVPR accepted pape
Deep Structured Features for Semantic Segmentation
We propose a highly structured neural network architecture for semantic
segmentation with an extremely small model size, suitable for low-power
embedded and mobile platforms. Specifically, our architecture combines i) a
Haar wavelet-based tree-like convolutional neural network (CNN), ii) a random
layer realizing a radial basis function kernel approximation, and iii) a linear
classifier. While stages i) and ii) are completely pre-specified, only the
linear classifier is learned from data. We apply the proposed architecture to
outdoor scene and aerial image semantic segmentation and show that the accuracy
of our architecture is competitive with conventional pixel classification CNNs.
Furthermore, we demonstrate that the proposed architecture is data efficient in
the sense of matching the accuracy of pixel classification CNNs when trained on
a much smaller data set.Comment: EUSIPCO 2017, 5 pages, 2 figure
A convolutional neural network based deep learning methodology for recognition of partial discharge patterns from high voltage cables
It is a great challenge to differentiate partial discharge (PD) induced by different types of insulation defects in high-voltage cables. Some types of PD signals have very similar characteristics and are specifically difficult to differentiate, even for the most experienced specialists. To overcome the challenge, a convolutional neural network (CNN)-based deep learning methodology for PD pattern recognition is presented in this paper. First, PD testing for five types of artificial defects in ethylene-propylene-rubber cables is carried out in high voltage laboratory to generate signals containing PD data. Second, 3500 sets of PD transient pulses are extracted, and then 33 kinds of PD features are established. The third stage applies a CNN to the data; typical CNN architecture and the key factors which affect the CNN-based pattern recognition accuracy are described. Factors discussed include the number of the network layers, convolutional kernel size, activation function, and pooling method. This paper presents a flowchart of the CNN-based PD pattern recognition method and an evaluation with 3500 sets of PD samples. Finally, the CNN-based pattern recognition results are shown and the proposed method is compared with two more traditional analysis methods, i.e., support vector machine (SVM) and back propagation neural network (BPNN). The results show that the proposed CNN method has higher pattern recognition accuracy than SVM and BPNN, and that the novel method is especially effective for PD type recognition in cases of signals of high similarity, which is applicable for industrial applications
Classification of Arrhythmia by Using Deep Learning with 2-D ECG Spectral Image Representation
The electrocardiogram (ECG) is one of the most extensively employed signals
used in the diagnosis and prediction of cardiovascular diseases (CVDs). The ECG
signals can capture the heart's rhythmic irregularities, commonly known as
arrhythmias. A careful study of ECG signals is crucial for precise diagnoses of
patients' acute and chronic heart conditions. In this study, we propose a
two-dimensional (2-D) convolutional neural network (CNN) model for the
classification of ECG signals into eight classes; namely, normal beat,
premature ventricular contraction beat, paced beat, right bundle branch block
beat, left bundle branch block beat, atrial premature contraction beat,
ventricular flutter wave beat, and ventricular escape beat. The one-dimensional
ECG time series signals are transformed into 2-D spectrograms through
short-time Fourier transform. The 2-D CNN model consisting of four
convolutional layers and four pooling layers is designed for extracting robust
features from the input spectrograms. Our proposed methodology is evaluated on
a publicly available MIT-BIH arrhythmia dataset. We achieved a state-of-the-art
average classification accuracy of 99.11\%, which is better than those of
recently reported results in classifying similar types of arrhythmias. The
performance is significant in other indices as well, including sensitivity and
specificity, which indicates the success of the proposed method.Comment: 14 pages, 5 figures, accepted for future publication in Remote
Sensing MDPI Journa
- …