4,109 research outputs found
Iteratively Optimized Patch Label Inference Network for Automatic Pavement Disease Detection
We present a novel deep learning framework named the Iteratively Optimized
Patch Label Inference Network (IOPLIN) for automatically detecting various
pavement diseases that are not solely limited to specific ones, such as cracks
and potholes. IOPLIN can be iteratively trained with only the image label via
the Expectation-Maximization Inspired Patch Label Distillation (EMIPLD)
strategy, and accomplish this task well by inferring the labels of patches from
the pavement images. IOPLIN enjoys many desirable properties over the
state-of-the-art single branch CNN models such as GoogLeNet and EfficientNet.
It is able to handle images in different resolutions, and sufficiently utilize
image information particularly for the high-resolution ones, since IOPLIN
extracts the visual features from unrevised image patches instead of the
resized entire image. Moreover, it can roughly localize the pavement distress
without using any prior localization information in the training phase. In
order to better evaluate the effectiveness of our method in practice, we
construct a large-scale Bituminous Pavement Disease Detection dataset named
CQU-BPDD consisting of 60,059 high-resolution pavement images, which are
acquired from different areas at different times. Extensive results on this
dataset demonstrate the superiority of IOPLIN over the state-of-the-art image
classification approaches in automatic pavement disease detection. The source
codes of IOPLIN are released on \url{https://github.com/DearCaat/ioplin}.Comment: Revision on IEEE Trans on IT
Driver Distraction Identification with an Ensemble of Convolutional Neural Networks
The World Health Organization (WHO) reported 1.25 million deaths yearly due
to road traffic accidents worldwide and the number has been continuously
increasing over the last few years. Nearly fifth of these accidents are caused
by distracted drivers. Existing work of distracted driver detection is
concerned with a small set of distractions (mostly, cell phone usage).
Unreliable ad-hoc methods are often used.In this paper, we present the first
publicly available dataset for driver distraction identification with more
distraction postures than existing alternatives. In addition, we propose a
reliable deep learning-based solution that achieves a 90% accuracy. The system
consists of a genetically-weighted ensemble of convolutional neural networks,
we show that a weighted ensemble of classifiers using a genetic algorithm
yields in a better classification confidence. We also study the effect of
different visual elements in distraction detection by means of face and hand
localizations, and skin segmentation. Finally, we present a thinned version of
our ensemble that could achieve 84.64% classification accuracy and operate in a
real-time environment.Comment: arXiv admin note: substantial text overlap with arXiv:1706.0949
Reduced Memory Region Based Deep Convolutional Neural Network Detection
Accurate pedestrian detection has a primary role in automotive safety: for
example, by issuing warnings to the driver or acting actively on car's brakes,
it helps decreasing the probability of injuries and human fatalities. In order
to achieve very high accuracy, recent pedestrian detectors have been based on
Convolutional Neural Networks (CNN). Unfortunately, such approaches require
vast amounts of computational power and memory, preventing efficient
implementations on embedded systems. This work proposes a CNN-based detector,
adapting a general-purpose convolutional network to the task at hand. By
thoroughly analyzing and optimizing each step of the detection pipeline, we
develop an architecture that outperforms methods based on traditional image
features and achieves an accuracy close to the state-of-the-art while having
low computational complexity. Furthermore, the model is compressed in order to
fit the tight constrains of low power devices with a limited amount of embedded
memory available. This paper makes two main contributions: (1) it proves that a
region based deep neural network can be finely tuned to achieve adequate
accuracy for pedestrian detection (2) it achieves a very low memory usage
without reducing detection accuracy on the Caltech Pedestrian dataset.Comment: IEEE 2016 ICCE-Berli
- …