20 research outputs found
Self-Supervised Gait Encoding with Locality-Aware Attention for Person Re-Identification
Gait-based person re-identification (Re-ID) is valuable for safety-critical
applications, and using only 3D skeleton data to extract discriminative gait
features for person Re-ID is an emerging open topic. Existing methods either
adopt hand-crafted features or learn gait features by traditional supervised
learning paradigms. Unlike previous methods, we for the first time propose a
generic gait encoding approach that can utilize unlabeled skeleton data to
learn gait representations in a self-supervised manner. Specifically, we first
propose to introduce self-supervision by learning to reconstruct input skeleton
sequences in reverse order, which facilitates learning richer high-level
semantics and better gait representations. Second, inspired by the fact that
motion's continuity endows temporally adjacent skeletons with higher
correlations ("locality"), we propose a locality-aware attention mechanism that
encourages learning larger attention weights for temporally adjacent skeletons
when reconstructing current skeleton, so as to learn locality when encoding
gait. Finally, we propose Attention-based Gait Encodings (AGEs), which are
built using context vectors learned by locality-aware attention, as final gait
representations. AGEs are directly utilized to realize effective person Re-ID.
Our approach typically improves existing skeleton-based methods by 10-20%
Rank-1 accuracy, and it achieves comparable or even superior performance to
multi-modal methods with extra RGB or depth information. Our codes are
available at https://github.com/Kali-Hac/SGE-LA.Comment: Accepted at IJCAI 2020 Main Track. Sole copyright holder is IJCAI.
Codes are available at https://github.com/Kali-Hac/SGE-L
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Dominant Person Search methods aim to localize and recognize query persons in
a unified network, which jointly optimizes two sub-tasks, \ie, detection and
Re-IDentification (ReID). Despite significant progress, two major challenges
remain: 1) Detection-prior modules in previous methods are suboptimal for the
ReID task. 2) The collaboration between two sub-tasks is ignored. To alleviate
these issues, we present a novel Person Search framework based on the Diffusion
model, PSDiff. PSDiff formulates the person search as a dual denoising process
from noisy boxes and ReID embeddings to ground truths. Unlike existing methods
that follow the Detection-to-ReID paradigm, our denoising paradigm eliminates
detection-prior modules to avoid the local-optimum of the ReID task. Following
the new paradigm, we further design a new Collaborative Denoising Layer (CDL)
to optimize detection and ReID sub-tasks in an iterative and collaborative way,
which makes two sub-tasks mutually beneficial. Extensive experiments on the
standard benchmarks show that PSDiff achieves state-of-the-art performance with
fewer parameters and elastic computing overhead