Search CORE

210,758 research outputs found

Benchmark Analysis of Representative Deep Neural Network Architectures

Author: Bianco Simone
Cadene Remi
Celona Luigi
Napoletano Paolo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/10/2018
Field of study

This work presents an in-depth analysis of the majority of the deep neural networks (DNNs) proposed in the state of the art for image recognition. For each DNN multiple performance indices are observed, such as recognition accuracy, model complexity, computational complexity, memory usage, and inference time. The behavior of such performance indices and some combinations of them are analyzed and discussed. To measure the indices we experiment the use of DNNs on two different computer architectures, a workstation equipped with a NVIDIA Titan X Pascal and an embedded system based on a NVIDIA Jetson TX1 board. This experimentation allows a direct comparison between DNNs running on machines with very different computational capacity. This study is useful for researchers to have a complete view of what solutions have been explored so far and in which research directions are worth exploring in the future; and for practitioners to select the DNN architecture(s) that better fit the resource constraints of practical deployments and applications. To complete this work, all the DNNs, as well as the software used for the analysis, are available online.Comment: Will appear in IEEE Acces

arXiv.org e-Print Archive

LCrowdV: Generating Labeled Videos for Simulation-based Crowd Behavior Learning

Author: A Chan
A. Bruderlin
B Solmaz
B Ulicny
B Zhou
D Helbing
F Lamarche
F Zhu
G Antonini
G Le Bon
Hans J. Eysenck
J Barraquand
J James
J van den Berg
J Xu
K Zhang
KK Reddy
L Pervin
Mehdi Moussaïd
R Geraerts
S Ali
S Ali
S Curtis
T Li
X Song
X Wang
Y Tsuduki
Publication venue
Publication date: 04/07/2016
Field of study

We present a novel procedural framework to generate an arbitrary number of labeled crowd videos (LCrowdV). The resulting crowd video datasets are used to design accurate algorithms or training models for crowded scene understanding. Our overall approach is composed of two components: a procedural simulation framework for generating crowd movements and behaviors, and a procedural rendering framework to generate different videos or images. Each video or image is automatically labeled based on the environment, number of pedestrians, density, behavior, flow, lighting conditions, viewpoint, noise, etc. Furthermore, we can increase the realism by combining synthetically-generated behaviors with real-world background videos. We demonstrate the benefits of LCrowdV over prior lableled crowd datasets by improving the accuracy of pedestrian detection and crowd behavior classification algorithms. LCrowdV would be released on the WWW

arXiv.org e-Print Archive

Crossref