Search CORE

63,611 research outputs found

RGBD Datasets: Past, Present and Future

Author: Firman Michael
Publication venue
Publication date: 13/04/2016
Field of study

Since the launch of the Microsoft Kinect, scores of RGBD datasets have been released. These have propelled advances in areas from reconstruction to gesture recognition. In this paper we explore the field, reviewing datasets across eight categories: semantics, object pose estimation, camera tracking, scene reconstruction, object tracking, human actions, faces and identification. By extracting relevant information in each category we help researchers to find appropriate data for their needs, and we consider which datasets have succeeded in driving computer vision forward and why. Finally, we examine the future of RGBD datasets. We identify key areas which are currently underexplored, and suggest that future directions may include synthetic data and dense reconstructions of static and dynamic scenes.Comment: 8 pages excluding references (CVPR style

arXiv.org e-Print Archive

Crossref

LCrowdV: Generating Labeled Videos for Simulation-based Crowd Behavior Learning

Author: A Chan
A. Bruderlin
B Solmaz
B Ulicny
B Zhou
D Helbing
F Lamarche
F Zhu
G Antonini
G Le Bon
Hans J. Eysenck
J Barraquand
J James
J van den Berg
J Xu
K Zhang
KK Reddy
L Pervin
Mehdi Moussaïd
R Geraerts
S Ali
S Ali
S Curtis
T Li
X Song
X Wang
Y Tsuduki
Publication venue
Publication date: 04/07/2016
Field of study

We present a novel procedural framework to generate an arbitrary number of labeled crowd videos (LCrowdV). The resulting crowd video datasets are used to design accurate algorithms or training models for crowded scene understanding. Our overall approach is composed of two components: a procedural simulation framework for generating crowd movements and behaviors, and a procedural rendering framework to generate different videos or images. Each video or image is automatically labeled based on the environment, number of pedestrians, density, behavior, flow, lighting conditions, viewpoint, noise, etc. Furthermore, we can increase the realism by combining synthetically-generated behaviors with real-world background videos. We demonstrate the benefits of LCrowdV over prior lableled crowd datasets by improving the accuracy of pedestrian detection and crowd behavior classification algorithms. LCrowdV would be released on the WWW

arXiv.org e-Print Archive

Crossref

Survey on Vision-based Path Prediction

Author: A Lerner
A Robicquet
CG Keller
D Helbing
D Munoz
D Weinland
E Shelhamer
H Zhu
JANE BROMLEY
JFP Kooij
KM Kitani
L Ballan
Nicolas Schneider
R Benenson
S Huang
S Singh
S Yi
SZ Bokhari
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2018
Field of study

Path prediction is a fundamental task for estimating how pedestrians or vehicles are going to move in a scene. Because path prediction as a task of computer vision uses video as input, various information used for prediction, such as the environment surrounding the target and the internal state of the target, need to be estimated from the video in addition to predicting paths. Many prediction approaches that include understanding the environment and the internal state have been proposed. In this survey, we systematically summarize methods of path prediction that take video as input and and extract features from the video. Moreover, we introduce datasets used to evaluate path prediction methods quantitatively.Comment: DAPI 201

arXiv.org e-Print Archive

Crossref