967 research outputs found
Hierarchical Bi-Directional Feature Perception Network for Person Re-Identification
Previous Person Re-Identification (Re-ID) models aim to focus on the most
discriminative region of an image, while its performance may be compromised
when that region is missing caused by camera viewpoint changes or occlusion. To
solve this issue, we propose a novel model named Hierarchical Bi-directional
Feature Perception Network (HBFP-Net) to correlate multi-level information and
reinforce each other. First, the correlation maps of cross-level feature-pairs
are modeled via low-rank bilinear pooling. Then, based on the correlation maps,
Bi-directional Feature Perception (BFP) module is employed to enrich the
attention regions of high-level feature, and to learn abstract and specific
information in low-level feature. And then, we propose a novel end-to-end
hierarchical network which integrates multi-level augmented features and inputs
the augmented low- and middle-level features to following layers to retrain a
new powerful network. What's more, we propose a novel trainable generalized
pooling, which can dynamically select any value of all locations in feature
maps to be activated. Extensive experiments implemented on the mainstream
evaluation datasets including Market-1501, CUHK03 and DukeMTMC-ReID show that
our method outperforms the recent SOTA Re-ID models.Comment: Accepted by ACM MM202
Part-Based Deep Hashing for Large-Scale Person Re-Identification
© 1992-2012 IEEE. Large-scale is a trend in person re-identi-fication (re-id). It is important that real-time search be performed in a large gallery. While previous methods mostly focus on discriminative learning, this paper makes the attempt in integrating deep learning and hashing into one framework to evaluate the efficiency and accuracy for large-scale person re-id. We integrate spatial information for discriminative visual representation by partitioning the pedestrian image into horizontal parts. Specifically, Part-based Deep Hashing (PDH) is proposed, in which batches of triplet samples are employed as the input of the deep hashing architecture. Each triplet sample contains two pedestrian images (or parts) with the same identity and one pedestrian image (or part) of the different identity. A triplet loss function is employed with a constraint that the Hamming distance of pedestrian images (or parts) with the same identity is smaller than ones with the different identity. In the experiment, we show that the proposed PDH method yields very competitive re-id accuracy on the large-scale Market-1501 and Market-1501+500K datasets
- …