Search CORE

9,059 research outputs found

Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model

Author: Sarfraz M. Saquib
Schumann Arne
Stiefelhagen Rainer
Wang Yan
Publication venue
Publication date: 01/01/2017
Field of study

Pedestrian attribute inference is a demanding problem in visual surveillance that can facilitate person retrieval, search and indexing. To exploit semantic relations between attributes, recent research treats it as a multi-label image classification task. The visual cues hinting at attributes can be strongly localized and inference of person attributes such as hair, backpack, shorts, etc., are highly dependent on the acquired view of the pedestrian. In this paper we assert this dependence in an end-to-end learning framework and show that a view-sensitive attribute inference is able to learn better attribute predictions. Our proposed model jointly predicts the coarse pose (view) of the pedestrian and learns specialized view-specific multi-label attribute predictions. We show in an extensive evaluation on three challenging datasets (PETA, RAP and WIDER) that our proposed end-to-end view-aware attribute prediction model provides competitive performance and improves on the published state-of-the-art on these datasets.Comment: accepted BMVC 201

arXiv.org e-Print Archive

Crossref

Fraunhofer-ePrints

Pose-Normalized Image Generation for Person Re-identification

Author: E Ristani
F Xiong
GE Hinton
H Shi
N Martinel
RR Varior
RR Varior
Shaogang Gong
SZ Chen
W Li
Publication venue
Publication date: 25/04/2018
Field of study

Person Re-identification (re-id) faces two major challenges: the lack of cross-view paired training data and learning discriminative identity-sensitive and view-invariant features in the presence of large pose variations. In this work, we address both problems by proposing a novel deep person image generation model for synthesizing realistic person images conditional on the pose. The model is based on a generative adversarial network (GAN) designed specifically for pose normalization in re-id, thus termed pose-normalization GAN (PN-GAN). With the synthesized images, we can learn a new type of deep re-id feature free of the influence of pose variations. We show that this feature is strong on its own and complementary to features learned with the original images. Importantly, under the transfer learning setting, we show that our model generalizes well to any new re-id dataset without the need for collecting any training data for model fine-tuning. The model thus has the potential to make re-id model truly scalable.Comment: 10 pages, 5 figure

arXiv.org e-Print Archive

Crossref