2 research outputs found
Robust Template Update Strategy for Efficient Visual Object Tracking
Real-time visual object tracking is an open problem in computer vision, with multiple applications in the industry, such as autonomous vehicles, human-machine interaction, intelligent cinematography, automated surveillance, and autonomous social navigation. The challenge of tracking a target of interest is critical to all of these applications. Recently, tracking algorithms that use siamese neural networks trained offline on large-scale datasets of image pairs have achieved the best performance exceeding real-time speed on multiple benchmarks. Results show that siamese approaches can be applied to enhance the tracking capabilities by learning deeper features of the object’s appearance. SiamMask utilized the power of siamese networks and supervised learning approaches to solve the problem of arbitrary object tracking in real-time speed. However, its practical applications are limited due to failures encountered during testing. In order to improve the robustness of the tracker and make it applicable for the intended real-world application, two improvements have been incorporated, each addressing a different aspect of the tracking task. The first one is a data augmentation strategy to consider both motion-blur and low-resolution during training. It aims to increase the robustness of the tracker against a motion-blurred and low-resolution frames during inference. The second improvement is a target template update strategy that utilizes both the initial ground truth template and a supplementary updatable template, which considers the score of the predicted target for an efficient template update strategy by avoiding template updates during severe occlusion. All of the improvements were extensively evaluated and have achieved state-of-the-art performance in the VOT2018 and VOT2019 benchmarks. Our method (VPU-SiamM) has been submitted to the VOT-ST 2020 challenge, and it is ranked 16th out of 38 submitted tracking methods according to the Expected average overlap (EAO) metrics. VPU_SiamM Implementation can be found from the VOT2020 Trackers repository 1
Teachers in concordance for pseudo-labeling of 3D sequential data
Automatic pseudo-labeling is a powerful tool to tap into large amounts of
sequential unlabeled data. It is specially appealing in safety-critical
applications of autonomous driving, where performance requirements are extreme,
datasets are large, and manual labeling is very challenging. We propose to
leverage sequences of point clouds to boost the pseudolabeling technique in a
teacher-student setup via training multiple teachers, each with access to
different temporal information. This set of teachers, dubbed Concordance,
provides higher quality pseudo-labels for student training than standard
methods. The output of multiple teachers is combined via a novel pseudo label
confidence-guided criterion. Our experimental evaluation focuses on the 3D
point cloud domain and urban driving scenarios. We show the performance of our
method applied to 3D semantic segmentation and 3D object detection on three
benchmark datasets. Our approach, which uses only 20% manual labels,
outperforms some fully supervised methods. A notable performance boost is
achieved for classes rarely appearing in training data.Comment: This work has been submitted to the IEEE for publicatio