Search CORE

4,289 research outputs found

Box-level Segmentation Supervised Deep Neural Networks for Accurate and Real-time Multispectral Pedestrian Detection

Author: Cao Yanlong
Cao Yanpeng
Guan Dayan
Wu Yulun
Yang Jiangxin
Yang Michael Ying
Publication venue
Publication date: 14/02/2019
Field of study

Effective fusion of complementary information captured by multi-modal sensors (visible and infrared cameras) enables robust pedestrian detection under various surveillance situations (e.g. daytime and nighttime). In this paper, we present a novel box-level segmentation supervised learning framework for accurate and real-time multispectral pedestrian detection by incorporating features extracted in visible and infrared channels. Specifically, our method takes pairs of aligned visible and infrared images with easily obtained bounding box annotations as input and estimates accurate prediction maps to highlight the existence of pedestrians. It offers two major advantages over the existing anchor box based multispectral detection methods. Firstly, it overcomes the hyperparameter setting problem occurred during the training phase of anchor box based detectors and can obtain more accurate detection results, especially for small and occluded pedestrian instances. Secondly, it is capable of generating accurate detection results using small-size input images, leading to improvement of computational efficiency for real-time autonomous driving applications. Experimental results on KAIST multispectral dataset show that our proposed method outperforms state-of-the-art approaches in terms of both accuracy and speed

arXiv.org e-Print Archive

University of Twente Research Information

Scanning from heating: 3D shape estimation of transparent objects from local surface heating

Author: A. Teoman Naskali
Aytul Ercil
David Fofi
Fabrice Meriaudeau
Frederic Truchetet
Gonen Eren
Jiao
L.A. Sanchez Secades
Miyazaki
Olivier Aubreton
Pelletier
Publication venue: 'The Optical Society'
Publication date: 01/01/2009
Field of study

Today, with quality becoming increasingly important, each product requires three-dimensional in-line quality control. On the other hand, the 3D reconstruction of transparent objects is a very difﬁcult problem in computer vision due to transparency and specularity of the surface. This paper proposes a new method, called Scanning From Heating (SFH), to determine the surface shape of transparent objects using laser surface heating and thermal imaging. Furthermore, the application to transparent glass is discussed and results on different surface shapes are presented

HAL-uB

Crossref

Sabanci University Research Database

Thermo-visual feature fusion for object tracking using multiple spatiogram trackers

Author: Alan Smeaton
C. Yang
Ciarán Ó Conaire
D. Comaniciu
G. Fumera
M. Spengler
Noel E. O’Connor
P. Pérez
R.E. Bellman
R.T. Collins
V. Comaniciu
W. Abd-Almageed
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/03/2007
Field of study

In this paper, we propose a framework that can efficiently combine features for robust tracking based on fusing the outputs of multiple spatiogram trackers. This is achieved without the exponential increase in storage and processing that other multimodal tracking approaches suffer from. The framework allows the features to be split arbitrarily between the trackers, as well as providing the flexibility to add, remove or dynamically weight features. We derive a mean-shift type algorithm for the framework that allows efficient object tracking with very low computational overhead. We especially target the fusion of thermal infrared and visible spectrum features as the most useful features for automated surveillance applications. Results are shown on multimodal video sequences clearly illustrating the benefits of combining multiple features using our framework

Crossref

Irish Universities

DCU Online Research Access Service

Pedestrian Attribute Recognition: A Survey

Author: Luo Bin
Tang Jin
Wang Xiao
Yang Rui
Zheng Shaofei
Publication venue
Publication date: 22/01/2019
Field of study

Recognizing pedestrian attributes is an important task in computer vision community due to it plays an important role in video surveillance. Many algorithms has been proposed to handle this task. The goal of this paper is to review existing works using traditional methods or based on deep learning networks. Firstly, we introduce the background of pedestrian attributes recognition (PAR, for short), including the fundamental concepts of pedestrian attributes and corresponding challenges. Secondly, we introduce existing benchmarks, including popular datasets and evaluation criterion. Thirdly, we analyse the concept of multi-task learning and multi-label learning, and also explain the relations between these two learning algorithms and pedestrian attribute recognition. We also review some popular network architectures which have widely applied in the deep learning community. Fourthly, we analyse popular solutions for this task, such as attributes group, part-based, \emph{etc}. Fifthly, we shown some applications which takes pedestrian attributes into consideration and achieve better performance. Finally, we summarized this paper and give several possible research directions for pedestrian attributes recognition. The project page of this paper can be found from the following website: \url{https://sites.google.com/view/ahu-pedestrianattributes/}.Comment: Check our project page for High Resolution version of this survey: https://sites.google.com/view/ahu-pedestrianattributes

arXiv.org e-Print Archive

An evaluation of the pedestrian classification in a multi-domain multi-modality setup

Author: Abdelaziz Bensrhair
Alberto Broggi
Alexandrina Rogozan
Alina Miron
Apatean
Bajracharya
Bertozzi
Besbes
Brown
Dalal
Davis
Dollar
Enzweiler
Enzweiler
Fan
Gandhi
Gavrila
Geiger
Geronimo
Jun
Krotosky
Labayrade
Nedevschi
Olmeda
Rohrbach
Samia Ainouz
Walk
Publication venue: 'MDPI AG'
Publication date: 01/01/2015
Field of study

The objective of this article is to study the problem of pedestrian classification across different light spectrum domains (visible and far-infrared (FIR)) and modalities (intensity, depth and motion). In recent years, there has been a number of approaches for classifying and detecting pedestrians in both FIR and visible images, but the methods are difficult to compare, because either the datasets are not publicly available or they do not offer a comparison between the two domains. Our two primary contributions are the following: (1) we propose a public dataset, named RIFIR , containing both FIR and visible images collected in an urban environment from a moving vehicle during daytime; and (2) we compare the state-of-the-art features in a multi-modality setup: intensity, depth and flow, in far-infrared over visible domains. The experiments show that features families, intensity self-similarity (ISS), local binary patterns (LBP), local gradient patterns (LGP) and histogram of oriented gradients (HOG), computed from FIR and visible domains are highly complementary, but their relative performance varies across different modalities. In our experiments, the FIR domain has proven superior to the visible one for the task of pedestrian classification, but the overall best results are obtained by a multi-domain multi-modality multi-feature fusion

HAL - Normandie Université

Multidisciplinary Digital Publishing Institute

Central Archive at the University of Reading

Crossref

Archivio istituzionale della Ricerca - Università degli Studi di Parma

Directory of Open Access Journals

PubMed Central

Hal-Diderot

Contrast invariant features for human detection in far infrared images

Author: Armingol Moreno José María
Escalera Hueso Arturo de la
Olmeda Reino Daniel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Proceeding of: 2012 IEEE Intelligent Vehicles Symposium (IV), Alcalá de Henares, Spain, June 3-7, 2012In this paper a new contrast invariant descriptor for human detection in long-wave infrared images is proposed. It exploits local information histogram of orientations of phase coherence. Contrast in infrared images depends on the temperature of the object and the background, which makes gradient based descriptors less robust, especially in daylight conditions. The objective is to obtain a scale, brightness and contrast invariant descriptor that can successfully detect pedestrians in images taken with a cheap, temperature-sensitive, uncooled microbolometer. The descriptor, packed into grids is feed to a Support Vector Machine classifier. The algorithm has been tested in night and day sequences and its performance is compared with a day only descriptor: the histogram of oriented features (HOG).This work was supported by the Spanish Government through the Cicyt projects FEDORA (GRANT TRA2010- 20225-C03-01) and VIDAS-Driver (GRANT TRA2010- 21371-C03-02), and the Comunidad de Madrid through the project SEGVAUTO (S2009/DPI-1509).Publicad

Crossref

Universidad Carlos III de Madrid e-Archivo