1,177 research outputs found
Vehicle-Rear: A New Dataset to Explore Feature Fusion for Vehicle Identification Using Convolutional Neural Networks
This work addresses the problem of vehicle identification through
non-overlapping cameras. As our main contribution, we introduce a novel dataset
for vehicle identification, called Vehicle-Rear, that contains more than three
hours of high-resolution videos, with accurate information about the make,
model, color and year of nearly 3,000 vehicles, in addition to the position and
identification of their license plates. To explore our dataset we design a
two-stream CNN that simultaneously uses two of the most distinctive and
persistent features available: the vehicle's appearance and its license plate.
This is an attempt to tackle a major problem: false alarms caused by vehicles
with similar designs or by very close license plate identifiers. In the first
network stream, shape similarities are identified by a Siamese CNN that uses a
pair of low-resolution vehicle patches recorded by two different cameras. In
the second stream, we use a CNN for OCR to extract textual information,
confidence scores, and string similarities from a pair of high-resolution
license plate patches. Then, features from both streams are merged by a
sequence of fully connected layers for decision. In our experiments, we
compared the two-stream network against several well-known CNN architectures
using single or multiple vehicle features. The architectures, trained models,
and dataset are publicly available at https://github.com/icarofua/vehicle-rear
License Plate Recognition using Convolutional Neural Networks Trained on Synthetic Images
In this thesis, we propose a license plate recognition system and study the feasibility
of using synthetic training samples to train convolutional neural networks for a
practical application.
First we develop a modular framework for synthetic license plate generation; to
generate different license plate types (or other objects) only the first module needs
to be adapted. The other modules apply variations to the training samples such as
background, occlusions, camera perspective projection, object noise and camera
acquisition noise, with the aim to achieve enough variation of the object that the
trained networks will also recognize real objects of the same class.
Then we design two convolutional neural networks of low-complexity for license
plate detection and character recognition. Both are designed for simultaneous
classification and localization by branching the networks into a classification and a
regression branch and are trained end-to-end simultaneously over both branches, on
only our synthetic training samples.
To recognize real license plates, we design a pipeline for scale invariant license
plate detection with a scale pyramid and a fully convolutional application of the
license plate detection network in order to detect any number of license plates and
of any scale in an image. Before character classification is applied, potential plate
regions are un-skewed based on the detected plate location in order to achieve an as
optimal representation of the characters as possible. The character classification is
also performed with a fully convolutional sweep to simultaneously find all characters
at once.
Both the plate and the character stages apply a refinement classification where
initial classifications are first centered and rescaled. We show that this simple, yet
effective trick greatly improves the accuracy of our classifications, and at a small
increase of complexity. To our knowledge, this trick has not been exploited before.
To show the effectiveness of our system we first apply it on a dataset of photos
of Italian license plates to evaluate the different stages of our system and which
effect the classification thresholds have on the accuracy. We also find robust training
parameters and thresholds that are reliable for classification without any need for
calibration on a validation set of real annotated samples (which may not always be
available) and achieve a balanced precision and recall on the set of Italian license
plates, both in excess of 98%.
Finally, to show that our system generalizes to new plate types, we compare our
system to two reference system on a dataset of Taiwanese license plates. For this, we
only modify the first module of the synthetic plate generation algorithm to produce
Taiwanese license plates and adjust parameters regarding plate dimensions, then we
train our networks and apply the classification pipeline, using the robust parameters,
on the Taiwanese reference dataset. We achieve state-of-the-art performance on plate
detection (99.86% precision and 99.1% recall), single character detection (99.6%)
and full license reading (98.7%)
An advanced combination of semi-supervised Normalizing Flow & Yolo (YoloNF) to detect and recognize vehicle license plates
Fully Automatic License Plate Recognition (ALPR) has been a frequent research
topic due to several practical applications. However, many of the current
solutions are still not robust enough in real situations, commonly depending on
many constraints. This paper presents a robust and efficient ALPR system based
on the state-of-the-art YOLO object detector and Normalizing flows. The model
uses two new strategies. Firstly, a two-stage network using YOLO and a
normalization flow-based model for normalization to detect Licenses Plates (LP)
and recognize the LP with numbers and Arabic characters. Secondly, Multi-scale
image transformations are implemented to provide a solution to the problem of
the YOLO cropped LP detection including significant background noise.
Furthermore, extensive experiments are led on a new dataset with realistic
scenarios, we introduce a larger public annotated dataset collected from
Moroccan plates. We demonstrate that our proposed model can learn on a small
number of samples free of single or multiple characters. The dataset will also
be made publicly available to encourage further studies and research on plate
detection and recognition.Comment: arXiv admin note: text overlap with arXiv:1802.09567 by other
authors; text overlap with arXiv:2012.06737 by other authors without
attributio
- …