26 research outputs found
Vehicle license plate detection and recognition
"December 2013.""A Thesis presented to the Faculty of the Graduate School at the University of Missouri In Partial Fulfillment of the Requirements for the Degree Master of Science."Thesis supervisor: Dr. Zhihai He.In this work, we develop a license plate detection method using a SVM (Support Vector Machine) classifier with HOG (Histogram of Oriented Gradients) features. The system performs window searching at different scales and analyzes the HOG feature using a SVM and locates their bounding boxes using a Mean Shift method. Edge information is used to accelerate the time consuming scanning process. Our license plate detection results show that this method is relatively insensitive to variations in illumination, license plate patterns, camera perspective and background variations. We tested our method on 200 real life images, captured on Chinese highways under different weather conditions and lighting conditions. And we achieved a detection rate of 100%. After detecting license plates, alignment is then performed on the plate candidates. Conceptually, this alignment method searches neighbors of the bounding box detected, and finds the optimum edge position where the outside regions are very different from the inside regions of the license plate, from color's perspective in RGB space. This method accurately aligns the bounding box to the edges of the plate so that the subsequent license plate segmentation and recognition can be performed accurately and reliably. The system performs license plate segmentation using global alignment on the binary license plate. A global model depending on the layout of license plates is proposed to segment the plates. This model searches for the optimum position where the characters are all segmented but not chopped into pieces. At last, the characters are recognized by another SVM classifier, with a feature size of 576, including raw features, vertical and horizontal scanning features. Our character recognition results show that 99% of the digits are successfully recognized, while the letters achieve an recognition rate of 95%. The license plate recognition system was then incorporated into an embedded system for parallel computing. Several TS7250 and an auxiliary board are used to simulIncludes bibliographical references (pages 67-73)
License Plate Recognition using Convolutional Neural Networks Trained on Synthetic Images
In this thesis, we propose a license plate recognition system and study the feasibility
of using synthetic training samples to train convolutional neural networks for a
practical application.
First we develop a modular framework for synthetic license plate generation; to
generate different license plate types (or other objects) only the first module needs
to be adapted. The other modules apply variations to the training samples such as
background, occlusions, camera perspective projection, object noise and camera
acquisition noise, with the aim to achieve enough variation of the object that the
trained networks will also recognize real objects of the same class.
Then we design two convolutional neural networks of low-complexity for license
plate detection and character recognition. Both are designed for simultaneous
classification and localization by branching the networks into a classification and a
regression branch and are trained end-to-end simultaneously over both branches, on
only our synthetic training samples.
To recognize real license plates, we design a pipeline for scale invariant license
plate detection with a scale pyramid and a fully convolutional application of the
license plate detection network in order to detect any number of license plates and
of any scale in an image. Before character classification is applied, potential plate
regions are un-skewed based on the detected plate location in order to achieve an as
optimal representation of the characters as possible. The character classification is
also performed with a fully convolutional sweep to simultaneously find all characters
at once.
Both the plate and the character stages apply a refinement classification where
initial classifications are first centered and rescaled. We show that this simple, yet
effective trick greatly improves the accuracy of our classifications, and at a small
increase of complexity. To our knowledge, this trick has not been exploited before.
To show the effectiveness of our system we first apply it on a dataset of photos
of Italian license plates to evaluate the different stages of our system and which
effect the classification thresholds have on the accuracy. We also find robust training
parameters and thresholds that are reliable for classification without any need for
calibration on a validation set of real annotated samples (which may not always be
available) and achieve a balanced precision and recall on the set of Italian license
plates, both in excess of 98%.
Finally, to show that our system generalizes to new plate types, we compare our
system to two reference system on a dataset of Taiwanese license plates. For this, we
only modify the first module of the synthetic plate generation algorithm to produce
Taiwanese license plates and adjust parameters regarding plate dimensions, then we
train our networks and apply the classification pipeline, using the robust parameters,
on the Taiwanese reference dataset. We achieve state-of-the-art performance on plate
detection (99.86% precision and 99.1% recall), single character detection (99.6%)
and full license reading (98.7%)
Recommended from our members
On Building Generalizable Learning Agents
It has been a long-standing goal in Artificial Intelligence (AI) to build machines that can solve tasks that humans can. Thanks to the recent rapid progress in data-driven methods, which train agents to solve tasks by learning from massive training data, there have been many successes in applying such learning approaches to handle and even solve a number of extremely challenging tasks, including image classification, language generation, robotics control, and several multi-player games. The key factor for all these data-driven successes is that the trained agents can generalize to test scenarios that are unseen during training. This generalization capability is the foundation for building any practical AI system. This thesis studies generalization, the fundamental challenge in AI, and proposes solutions to improve the generalization performances of learning agents in a variety of problems. We start by providing a formal formulation of the generalization problem in the context of reinforcement learning and proposing 4 principles within this formulation to guide the design of training techniques for improved generalization. We validate the effectiveness of our proposed principles by considering 4 different domains, from simple to complex, and developing domain-specific techniques following these principles. Particularly, we begin with the simplest domain, i.e., path-finding on graphs (Part I), and then consider visual navigation in a 3D world (Part II) and competition in complex multi-agent games (Part III), and lastly tackle some natural language processing tasks (Part IV). Empirical evidences demonstrate that the proposed principles can generally lead to much improved generalization performances in a wide range of problems