285 research outputs found
An advanced combination of semi-supervised Normalizing Flow & Yolo (YoloNF) to detect and recognize vehicle license plates
Fully Automatic License Plate Recognition (ALPR) has been a frequent research
topic due to several practical applications. However, many of the current
solutions are still not robust enough in real situations, commonly depending on
many constraints. This paper presents a robust and efficient ALPR system based
on the state-of-the-art YOLO object detector and Normalizing flows. The model
uses two new strategies. Firstly, a two-stage network using YOLO and a
normalization flow-based model for normalization to detect Licenses Plates (LP)
and recognize the LP with numbers and Arabic characters. Secondly, Multi-scale
image transformations are implemented to provide a solution to the problem of
the YOLO cropped LP detection including significant background noise.
Furthermore, extensive experiments are led on a new dataset with realistic
scenarios, we introduce a larger public annotated dataset collected from
Moroccan plates. We demonstrate that our proposed model can learn on a small
number of samples free of single or multiple characters. The dataset will also
be made publicly available to encourage further studies and research on plate
detection and recognition.Comment: arXiv admin note: text overlap with arXiv:1802.09567 by other
authors; text overlap with arXiv:2012.06737 by other authors without
attributio
Unified Chinese License Plate Detection and Recognition with High Efficiency
Recently, deep learning-based methods have reached an excellent performance
on License Plate (LP) detection and recognition tasks. However, it is still
challenging to build a robust model for Chinese LPs since there are not enough
large and representative datasets. In this work, we propose a new dataset named
Chinese Road Plate Dataset (CRPD) that contains multi-objective Chinese LP
images as a supplement to the existing public benchmarks. The images are mainly
captured with electronic monitoring systems with detailed annotations. To our
knowledge, CRPD is the largest public multi-objective Chinese LP dataset with
annotations of vertices. With CRPD, a unified detection and recognition network
with high efficiency is presented as the baseline. The network is end-to-end
trainable with totally real-time inference efficiency (30 fps with 640p). The
experiments on several public benchmarks demonstrate that our method has
reached competitive performance. The code and dataset will be publicly
available at https://github.com/yxgong0/CRPD
ADD: An Automatic Desensitization Fisheye Dataset for Autonomous Driving
Autonomous driving systems require many images for analyzing the surrounding
environment. However, there is fewer data protection for private information
among these captured images, such as pedestrian faces or vehicle license
plates, which has become a significant issue. In this paper, in response to the
call for data security laws and regulations and based on the advantages of
large Field of View(FoV) of the fisheye camera, we build the first Autopilot
Desensitization Dataset, called ADD, and formulate the first
deep-learning-based image desensitization framework, to promote the study of
image desensitization in autonomous driving scenarios. The compiled dataset
consists of 650K images, including different face and vehicle license plate
information captured by the surround-view fisheye camera. It covers various
autonomous driving scenarios, including diverse facial characteristics and
license plate colors. Then, we propose an efficient multitask desensitization
network called DesCenterNet as a benchmark on the ADD dataset, which can
perform face and vehicle license plate detection and desensitization tasks.
Based on ADD, we further provide an evaluation criterion for desensitization
performance, and extensive comparison experiments have verified the
effectiveness and superiority of our method on image desensitization
Towards End-to-end Car License Plate Location and Recognition in Unconstrained Scenarios
Benefiting from the rapid development of convolutional neural networks, the
performance of car license plate detection and recognition has been largely
improved. Nonetheless, challenges still exist especially for real-world
applications. In this paper, we present an efficient and accurate framework to
solve the license plate detection and recognition tasks simultaneously. It is a
lightweight and unified deep neural network, that can be optimized end-to-end
and work in real-time. Specifically, for unconstrained scenarios, an
anchor-free method is adopted to efficiently detect the bounding box and four
corners of a license plate, which are used to extract and rectify the target
region features. Then, a novel convolutional neural network branch is designed
to further extract features of characters without segmentation. Finally,
recognition task is treated as sequence labelling problems, which are solved by
Connectionist Temporal Classification (CTC) directly. Several public datasets
including images collected from different scenarios under various conditions
are chosen for evaluation. A large number of experiments indicate that the
proposed method significantly outperforms the previous state-of-the-art methods
in both speed and precision
A deep learning approach for complex microstructure inference
Automated, reliable, and objective microstructure inference from micrographs is essential for a comprehensive understanding of process-microstructure-property relations and tailored materials development. However, such inference, with the increasing complexity of microstructures, requires advanced segmentation methodologies. While deep learning offers new opportunities, an intuition about the required data quality/quantity and a methodological guideline for microstructure quantification is still missing. This, along with deep learning’s seemingly intransparent decision-making process, hampers its breakthrough in this field. We apply a multidisciplinary deep learning approach, devoting equal attention to specimen preparation and imaging, and train distinct U-Net architectures with 30–50 micrographs of different imaging modalities and electron backscatter diffraction-informed annotations. On the challenging task of lath-bainite segmentation in complex-phase steel, we achieve accuracies of 90% rivaling expert segmentations. Further, we discuss the impact of image context, pre-training with domain-extrinsic data, and data augmentation. Network visualization techniques demonstrate plausible model decisions based on grain boundary morphology
- …