Search CORE

625 research outputs found

Dense Feature Aggregation and Pruning for RGBT Tracking

Author: Bertinetto Luca
Boyu Chen
Chenglong Li
Chenglong Li
Choi Jongwon
Conaire Ciaran O
Conaire Ciarán
Danelljan M.
Galoogahi Hamed Kiani
Jianhao Luo
Jianming Zhang
Jung Ilchae
Kim Han Ul
Lan Xiangyuan
Li Chenglong
Liu Huaping
Lukezic A.
Nam Hyeonseob
Nam Hyeonseob
Saihui Hou
Shiyi Hu
Simonyan Karen
Tao Kong
Valmadre Jack
Wu Yi
Wu Yi
Yang Li
Yuankai Qi
Zhipeng Zhang
Zhou Bolei
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 24/07/2019
Field of study

How to perform effective information fusion of different modalities is a core factor in boosting the performance of RGBT tracking. This paper presents a novel deep fusion algorithm based on the representations from an end-to-end trained convolutional neural network. To deploy the complementarity of features of all layers, we propose a recursive strategy to densely aggregate these features that yield robust representations of target objects in each modality. In different modalities, we propose to prune the densely aggregated features of all modalities in a collaborative way. In a specific, we employ the operations of global average pooling and weighted random selection to perform channel scoring and selection, which could remove redundant and noisy features to achieve more robust feature representation. Experimental results on two RGBT tracking benchmark datasets suggest that our tracker achieves clear state-of-the-art against other RGB and RGBT tracking methods.Comment: arXiv admin note: text overlap with arXiv:1811.0985

arXiv.org e-Print Archive

Crossref

RGB-T Tracking Based on Mixed Attention

Author: Dong Mingtao
Guo Xiqing
Luo Yang
Yu Jin
Publication venue
Publication date: 10/04/2023
Field of study

RGB-T tracking involves the use of images from both visible and thermal modalities. The primary objective is to adaptively leverage the relatively dominant modality in varying conditions to achieve more robust tracking compared to single-modality tracking. An RGB-T tracker based on mixed attention mechanism to achieve complementary fusion of modalities (referred to as MACFT) is proposed in this paper. In the feature extraction stage, we utilize different transformer backbone branches to extract specific and shared information from different modalities. By performing mixed attention operations in the backbone to enable information interaction and self-enhancement between the template and search images, it constructs a robust feature representation that better understands the high-level semantic features of the target. Then, in the feature fusion stage, a modality-adaptive fusion is achieved through a mixed attention-based modality fusion network, which suppresses the low-quality modality noise while enhancing the information of the dominant modality. Evaluation on multiple RGB-T public datasets demonstrates that our proposed tracker outperforms other RGB-T trackers on general evaluation metrics while also being able to adapt to longterm tracking scenarios.Comment: 14 pages, 10 figure

arXiv.org e-Print Archive

Articulated motion and deformable objects

Author: Escalera Guerrero Sergio
Kittler Josef
Perales Francisco J.
Wan Jun
Publication venue: 'Elsevier BV'
Publication date: 01/02/2018
Field of study

This guest editorial introduces the twenty two papers accepted for this Special Issue on Articulated Motion and Deformable Objects (AMDO). They are grouped into four main categories within the field of AMDO: human motion analysis (action/gesture), human pose estimation, deformable shape segmentation, and face analysis. For each of the four topics, a survey of the recent developments in the field is presented. The accepted papers are briefly introduced in the context of this survey. They contribute novel methods, algorithms with improved performance as measured on benchmarking datasets, as well as two new datasets for hand action detection and human posture analysis. The special issue should be of high relevance to the reader interested in AMDO recognition and promote future research directions in the field

University of Surrey

VBN

Diposit Digital de la Universitat de Barcelona

Surrey Research Insight

RGBT Salient Object Detection: A Large-scale Dataset and Benchmark

Author: Li Chenglong
Li Zhun
Liu Yongtao
Ma Yan
Tu Zhengzheng
Xu Jieming
Publication venue
Publication date: 18/11/2020
Field of study

Salient object detection in complex scenes and environments is a challenging research topic. Most works focus on RGB-based salient object detection, which limits its performance of real-life applications when confronted with adverse conditions such as dark environments and complex backgrounds. Taking advantage of RGB and thermal infrared images becomes a new research direction for detecting salient object in complex scenes recently, as thermal infrared spectrum imaging provides the complementary information and has been applied to many computer vision tasks. However, current research for RGBT salient object detection is limited by the lack of a large-scale dataset and comprehensive benchmark. This work contributes such a RGBT image dataset named VT5000, including 5000 spatially aligned RGBT image pairs with ground truth annotations. VT5000 has 11 challenges collected in different scenes and environments for exploring the robustness of algorithms. With this dataset, we propose a powerful baseline approach, which extracts multi-level features within each modality and aggregates these features of all modalities with the attention mechanism, for accurate RGBT salient object detection. Extensive experiments show that the proposed baseline approach outperforms the state-of-the-art methods on VT5000 dataset and other two public datasets. In addition, we carry out a comprehensive analysis of different algorithms of RGBT salient object detection on VT5000 dataset, and then make several valuable conclusions and provide some potential research directions for RGBT salient object detection.Comment: 12 pages, 10 figures https://github.com/lz118/RGBT-Salient-Object-Detectio

arXiv.org e-Print Archive