2,841 research outputs found
POL-LWIR Vehicle Detection: Convolutional Neural Networks Meet Polarised Infrared Sensors
For vehicle autonomy, driver assistance and situational awareness, it is
necessary to operate at day and night, and in all weather conditions. In
particular, long wave infrared (LWIR) sensors that receive predominantly
emitted radiation have the capability to operate at night as well as during the
day. In this work, we employ a polarised LWIR (POL-LWIR) camera to acquire data
from a mobile vehicle, to compare and contrast four different convolutional
neural network (CNN) configurations to detect other vehicles in video
sequences. We evaluate two distinct and promising approaches, two-stage
detection (Faster-RCNN) and one-stage detection (SSD), in four different
configurations. We also employ two different image decompositions: the first
based on the polarisation ellipse and the second on the Stokes parameters
themselves. To evaluate our approach, the experimental trials were quantified
by mean average precision (mAP) and processing time, showing a clear trade-off
between the two factors. For example, the best mAP result of 80.94% was
achieved using Faster-RCNN, but at a frame rate of 6.4 fps. In contrast,
MobileNet SSD achieved only 64.51% mAP, but at 53.4 fps.Comment: Computer Vision and Pattern Recognition Workshop 201
Automatic vehicle tracking and recognition from aerial image sequences
This paper addresses the problem of automated vehicle tracking and
recognition from aerial image sequences. Motivated by its successes in the
existing literature focus on the use of linear appearance subspaces to describe
multi-view object appearance and highlight the challenges involved in their
application as a part of a practical system. A working solution which includes
steps for data extraction and normalization is described. In experiments on
real-world data the proposed methodology achieved promising results with a high
correct recognition rate and few, meaningful errors (type II errors whereby
genuinely similar targets are sometimes being confused with one another).
Directions for future research and possible improvements of the proposed method
are discussed
Recommended from our members
Design and performance assessment of correlation filters for the detection of objects in high clutter thermal imagery
The research reported in this thesis has examined means of enhancing the performance of the Optimal Trade-off Maximum Average Correlation Height (OT-MACH) filter for target detection in Forward Looking Infra-Red (FLIR) imagery acquired from a helicopter and border security FLIR camera in northern Kuwait. The data acquired with these FLIR sensors allows real-world evaluation of the comparative performance of the various filters that have been developed in the thesis. The results obtained have been quantified using well known performance measures such as Peak to Side-lobe Ratio (PSR) and Total Detection Error (TDE). The initial focus was to study the effect of modifying the OT-MACH parameters on the correlation metrics. A new optimisation technique has been presented, which computes statistically the filter alpha parameter associated with controlling the response of the filter to clutter noise. A further modification of the OT-MACH filter performance using the Difference of Gaussian bandpass filter (named the D-MACH filter) as a pre-processing stage has been described. The D-MACH has been applied to several test images containing single and multiple targets in the scene. Enhanced performance of the modified filter is demonstrated with improved metrics being obtained with less false side peaks in the correlation plane, especially when multiple targets are present in the test images.
A further pre-processing technique was investigated using the Rayleigh distribution as a pre-processing filter (named the R-MACH filter). The R-MACH filter has been applied
to multiple target types with tests conducted across various image data sets. The filter demonstrated an improvement over the Difference of Gaussian filter in terms of 6 reducing the number of parameters needing to be tuned whilst producing further enhanced correlation plane metrics.
Finally, recommendations for future work has been made to improve the use of the OT-MACH filter in target detection and identification. A novel training image representation is proposed for further investigation, which will minimise the computational intensity of using the MACH filter for unconstrained object recognition
Target classification in multimodal video
The presented thesis focuses on enhancing scene segmentation and target recognition methodologies via the mobilisation of contextual information. The algorithms developed to achieve this goal utilise multi-modal sensor information collected across varying scenarios,
from controlled indoor sequences to challenging rural locations. Sensors are chiefly colour band and long wave infrared (LWIR), enabling persistent surveillance capabilities across all environments. In the drive to develop effectual algorithms towards the outlined goals, key obstacles are identified and examined: the recovery of background scene structure from foreground object ’clutter’, employing contextual foreground knowledge to circumvent training a classifier when labeled data is not readily available, creating a labeled LWIR dataset to train a convolutional neural network (CNN) based object classifier and the viability of spatial context to address long range target classification when big data solutions are not enough. For an environment displaying frequent foreground clutter, such as a busy train station, we propose an algorithm exploiting foreground object presence to segment underlying scene structure that is not often visible. If such a location is outdoors and surveyed by an infra-red (IR) and visible band camera set-up, scene context and contextual knowledge transfer allows reasonable class predictions for thermal signatures within the scene to be determined. Furthermore, a labeled LWIR image corpus is created to train an infrared object classifier, using a CNN approach. The trained network demonstrates effective classification accuracy of 95% over 6 object classes. However, performance is not sustainable for IR targets acquired at long range due to low signal quality and classification accuracy drops. This is addressed by mobilising spatial context to affect network class scores, restoring robust classification capability
Comparison of fusion methods for thermo-visual surveillance tracking
In this paper, we evaluate the appearance tracking performance of multiple fusion schemes that combine information from standard CCTV and thermal infrared spectrum video for the tracking of surveillance objects, such as people, faces, bicycles and vehicles. We show results on numerous real world multimodal surveillance sequences, tracking challenging objects whose appearance changes rapidly. Based on these results we can determine the most promising fusion scheme
Stacked Denoising Autoencoders and Transfer Learning for Immunogold Particles Detection and Recognition
In this paper we present a system for the detection of immunogold particles
and a Transfer Learning (TL) framework for the recognition of these immunogold
particles. Immunogold particles are part of a high-magnification method for the
selective localization of biological molecules at the subcellular level only
visible through Electron Microscopy. The number of immunogold particles in the
cell walls allows the assessment of the differences in their compositions
providing a tool to analise the quality of different plants. For its
quantization one requires a laborious manual labeling (or annotation) of images
containing hundreds of particles. The system that is proposed in this paper can
leverage significantly the burden of this manual task.
For particle detection we use a LoG filter coupled with a SDA. In order to
improve the recognition, we also study the applicability of TL settings for
immunogold recognition. TL reuses the learning model of a source problem on
other datasets (target problems) containing particles of different sizes. The
proposed system was developed to solve a particular problem on maize cells,
namely to determine the composition of cell wall ingrowths in endosperm
transfer cells. This novel dataset as well as the code for reproducing our
experiments is made publicly available.
We determined that the LoG detector alone attained more than 84\% of accuracy
with the F-measure. Developing immunogold recognition with TL also provided
superior performance when compared with the baseline models augmenting the
accuracy rates by 10\%
Recommended from our members
Local feature based automatic target recognition for future 3D active homing seeker missiles
We propose an architecture appropriate for future Light Detection and Ranging (LIDAR) active homing seeker missiles with Automatic Target Recognition (ATR) capabilities. Our proposal enhances military targeting performance by extending ATR into the 3rd dimension. From a military and aerospace industry point of view, this is appealing as weapon effectiveness against camouflage, concealment and deception techniques can be substantially improved.
Specifically, we present a missile seeker 3D ATR architecture that relies on the 3D local feature based SHOT descriptor and a dual-role pipeline with a number of pre and post-processing operations. We evaluate our architecture on a number of missile engagement scenarios in various environmental setups with the missile being under various altitudes, obliquities, distances to the target and scene resolutions. Under these demanding conditions, the recognition performance gained is highly promising. Even in the extreme case of reducing the database entries to a single template per target, our interchangeable ATR architecture still provides a highly acceptable performance.
Although we focus on future intelligent missile systems, our approach can be implemented to a great range of time-critical complex systems for space, air and ground environments for military, law-enforcement, commercial and research purposes
- …