Search CORE

1,612 research outputs found

Vehicle-Rear: A New Dataset to Explore Feature Fusion for Vehicle Identification Using Convolutional Neural Networks

Author: de Oliveira Icaro O.
Fonseca Keiko V. O.
Laroca Rayson
Menotti David
Minetto Rodrigo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

This work addresses the problem of vehicle identification through non-overlapping cameras. As our main contribution, we introduce a novel dataset for vehicle identification, called Vehicle-Rear, that contains more than three hours of high-resolution videos, with accurate information about the make, model, color and year of nearly 3,000 vehicles, in addition to the position and identification of their license plates. To explore our dataset we design a two-stream CNN that simultaneously uses two of the most distinctive and persistent features available: the vehicle's appearance and its license plate. This is an attempt to tackle a major problem: false alarms caused by vehicles with similar designs or by very close license plate identifiers. In the first network stream, shape similarities are identified by a Siamese CNN that uses a pair of low-resolution vehicle patches recorded by two different cameras. In the second stream, we use a CNN for OCR to extract textual information, confidence scores, and string similarities from a pair of high-resolution license plate patches. Then, features from both streams are merged by a sequence of fully connected layers for decision. In our experiments, we compared the two-stream network against several well-known CNN architectures using single or multiple vehicle features. The architectures, trained models, and dataset are publicly available at https://github.com/icarofua/vehicle-rear

arXiv.org e-Print Archive

Directory of Open Access Journals

Comparative Study of Different Methods in Vibration-Based Terrain Classification for Wheeled Robots with Shock Absorbers

Author: Chang Ji
Li Xiaochuan
Li Yuling
Li Zerui
Lv Wenjun
Mei Mingliang
Publication venue: 'MDPI AG'
Publication date: 01/03/2019
Field of study

open access articleAutonomous robots that operate in the field can enhance their security and efficiency by accurate terrain classification, which can be realized by means of robot-terrain interaction-generated vibration signals. In this paper, we explore the vibration-based terrain classification (VTC), in particular for a wheeled robot with shock absorbers. Because the vibration sensors are usually mounted on the main body of the robot, the vibration signals are dampened significantly, which results in the vibration signals collected on different terrains being more difficult to discriminate. Hence, the existing VTC methods applied to a robot with shock absorbers may degrade. The contributions are two-fold: (1) Several experiments are conducted to exhibit the performance of the existing feature-engineering and feature-learning classification methods; and (2) According to the long short-term memory (LSTM) network, we propose a one-dimensional convolutional LSTM (1DCL)-based VTC method to learn both spatial and temporal characteristics of the dampened vibration signals. The experiment results demonstrate that: (1) The feature-engineering methods, which are efficient in VTC of the robot without shock absorbers, are not so accurate in our project; meanwhile, the feature-learning methods are better choices; and (2) The 1DCL-based VTC method outperforms the conventional methods with an accuracy of 80.18%, which exceeds the second method (LSTM) by 8.23%

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

De Montfort University Open Research Archive

Handcrafted and Transfer Learned Feature Techniques for Vehicle Make and Model Recognition on Nigerian Road

Author: Adeola O Kolawole
Irhebhude Martins E.
Michael Chinonye Izuegbu
Publication venue: Universiti Sultan Zainal Abidin
Publication date: 31/10/2023
Field of study

The vehicle makes and model recognition (VMMR) is a challenging task due to the wide range of vehicle categories and similarities between different classes. Studies have shown that works have recognized vehicles of different countries' make and models. Popular vehicles on Nigerian roads may include products like; Toyota, Honda, Peugeot, Benz, Innoson Vehicle Manufacturing (IVM), etc. The VMMR is important in the intelligent transport system hence, this paper presents a handcrafted and transfer learning model to detect stationary vehicles and classify them based on brand, make, and model. A new dataset was introduced consisting of selected images of popular brands of vehicles driven on Nigerian roads. Framework for a vehicle make and model recognition was developed by extracting features using EfficientNet and HOG models and evaluated on the locally gathered datasets. For classification, a linear Support Machine Vector (SVM) was used. Experimental results showed 94.5% on HOG, 97% with EfficientNet, and 98.1% accuracy when HOG and EfficientNet features were concatenation.  The proposed concatenated model outperformed HOG and EfficientNet extracted features by providing higher accuracy and confusion matrix with the highest number of classified images. The study shows the advantages of the proposed model in terms of its accuracy in terms of identifying the vehicle make and model

Malaysian Journal of Applied Sciences (Journal of UniSZA - Universiti Sultan Zainal Abidin)

End-to-End Learning of Representations for Asynchronous Event-Based Data

Author: Derpanis Konstantinos G.
Gehrig Daniel
Loquercio Antonio
Scaramuzza Davide
Publication venue
Publication date: 20/08/2019
Field of study

Event cameras are vision sensors that record asynchronous streams of per-pixel brightness changes, referred to as "events". They have appealing advantages over frame-based cameras for computer vision, including high temporal resolution, high dynamic range, and no motion blur. Due to the sparse, non-uniform spatiotemporal layout of the event signal, pattern recognition algorithms typically aggregate events into a grid-based representation and subsequently process it by a standard vision pipeline, e.g., Convolutional Neural Network (CNN). In this work, we introduce a general framework to convert event streams into grid-based representations through a sequence of differentiable operations. Our framework comes with two main advantages: (i) allows learning the input event representation together with the task dedicated network in an end to end manner, and (ii) lays out a taxonomy that unifies the majority of extant event representations in the literature and identifies novel ones. Empirically, we show that our approach to learning the event representation end-to-end yields an improvement of approximately 12% on optical flow estimation and object recognition over state-of-the-art methods.Comment: To appear at ICCV 201

arXiv.org e-Print Archive

Crossref

ZORA

Practical classification of different moving targets using automotive radar and deep neural networks

Author: Angelov Aleksandar
Fioranelli Francesco
Murray-Smith Roderick
Robertson Andrew
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/10/2018
Field of study

In this work, the authors present results for classification of different classes of targets (car, single and multiple people, bicycle) using automotive radar data and different neural networks. A fast implementation of radar algorithms for detection, tracking, and micro-Doppler extraction is proposed in conjunction with the automotive radar transceiver TEF810X and microcontroller unit SR32R274 manufactured by NXP Semiconductors. Three different types of neural networks are considered, namely a classic convolutional network, a residual network, and a combination of convolutional and recurrent network, for different classification problems across the four classes of targets recorded. Considerable accuracy (close to 100% in some cases) and low latency of the radar pre-processing prior to classification (∼0.55 s to produce a 0.5 s long spectrogram) are demonstrated in this study, and possible shortcomings and outstanding issues are discussed

Enlighten