Search CORE

1,156 research outputs found

Recommended from our members

Sparse representation matching for person re-identification

Author: An Le
Bhanu Bir
Chen Xiaojing
Yang Songfan
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

eScholarship - University of California

Beyond Intra-modality: A Survey of Heterogeneous Person Re-identification

Author: Satoh Shin'ichi
Wang Zheng
Wang Zhixiang
Wu Yang
Zeng Wenjun
Zheng Yinqiang
Publication venue
Publication date: 27/04/2020
Field of study

An efficient and effective person re-identification (ReID) system relieves the users from painful and boring video watching and accelerates the process of video analysis. Recently, with the explosive demands of practical applications, a lot of research efforts have been dedicated to heterogeneous person re-identification (Hetero-ReID). In this paper, we provide a comprehensive review of state-of-the-art Hetero-ReID methods that address the challenge of inter-modality discrepancies. According to the application scenario, we classify the methods into four categories -- low-resolution, infrared, sketch, and text. We begin with an introduction of ReID, and make a comparison between Homogeneous ReID (Homo-ReID) and Hetero-ReID tasks. Then, we describe and compare existing datasets for performing evaluations, and survey the models that have been widely employed in Hetero-ReID. We also summarize and compare the representative approaches from two perspectives, i.e., the application scenario and the learning pipeline. We conclude by a discussion of some future research directions. Follow-up updates are avaible at: https://github.com/lightChaserX/Awesome-Hetero-reIDComment: Accepted by IJCAI 2020. Project url: https://github.com/lightChaserX/Awesome-Hetero-reI

arXiv.org e-Print Archive

Crossref

Multi-modal Image Processing based on Coupled Dictionary Learning

Author: Rodrigues Miguel R. D.
Song Pingfan
Publication venue
Publication date: 01/01/2018
Field of study

In real-world scenarios, many data processing problems often involve heterogeneous images associated with different imaging modalities. Since these multimodal images originate from the same phenomenon, it is realistic to assume that they share common attributes or characteristics. In this paper, we propose a multi-modal image processing framework based on coupled dictionary learning to capture similarities and disparities between different image modalities. In particular, our framework can capture favorable structure similarities across different image modalities such as edges, corners, and other elementary primitives in a learned sparse transform domain, instead of the original pixel domain, that can be used to improve a number of image processing tasks such as denoising, inpainting, or super-resolution. Practical experiments demonstrate that incorporating multimodal information using our framework brings notable benefits.Comment: SPAWC 2018, 19th IEEE International Workshop On Signal Processing Advances In Wireless Communication

arXiv.org e-Print Archive

Crossref

UCL Discovery

How Does the Low-Rank Matrix Decomposition Help Internal and External Learnings for Super-Resolution

Author: Ji Peiyuan
Jiao Licheng
Liang Xuefeng
Wang Shuang
Yue Bo
Publication venue
Publication date: 30/06/2017
Field of study

Wisely utilizing the internal and external learning methods is a new challenge in super-resolution problem. To address this issue, we analyze the attributes of two methodologies and find two observations of their recovered details: 1) they are complementary in both feature space and image plane, 2) they distribute sparsely in the spatial space. These inspire us to propose a low-rank solution which effectively integrates two learning methods and then achieves a superior result. To fit this solution, the internal learning method and the external learning method are tailored to produce multiple preliminary results. Our theoretical analysis and experiment prove that the proposed low-rank solution does not require massive inputs to guarantee the performance, and thereby simplifying the design of two learning methods for the solution. Intensive experiments show the proposed solution improves the single learning method in both qualitative and quantitative assessments. Surprisingly, it shows more superior capability on noisy images and outperforms state-of-the-art methods

arXiv.org e-Print Archive

Inter-task association critic for cross-resolution person re-identification

Author: Cheng Z
Dong Q
Gong S
IEEE Conference on Computer Vision and Pattern Recognition
Zhu X
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Person images captured by unconstrained surveillance cameras often have low resolutions (LR). This causes the resolution mismatch problem when matched against the high-resolution (HR) gallery images, negatively affecting the performance of person re-identification (re-id). An effective approach is to leverage image super-resolution (SR) along with person re-id in a joint learning manner. However, this scheme is limited due to dramatically more difficult gradients backpropagation during training. In this paper, we introduce a novel model training regularisation method, called Inter-Task Association Critic (INTACT), to address this fundamental problem. Specifically, INTACT discovers the underlying association knowledge between image SR and person re-id, and leverages it as an extra learning constraint for enhancing the compatibility of SR model with person re-id in HR image space. This is realised by parameterising the association constraint which enables it to be automatically learned from the training data. Extensive experiments validate the superiority of INTACT over the state-of-the-art approaches on the cross-resolution re-id task using five standard person re-id datasets

Queen Mary Research Online