41,277 research outputs found
Long-Term Visual Object Tracking Benchmark
We propose a new long video dataset (called Track Long and Prosper - TLP) and
benchmark for single object tracking. The dataset consists of 50 HD videos from
real world scenarios, encompassing a duration of over 400 minutes (676K
frames), making it more than 20 folds larger in average duration per sequence
and more than 8 folds larger in terms of total covered duration, as compared to
existing generic datasets for visual tracking. The proposed dataset paves a way
to suitably assess long term tracking performance and train better deep
learning architectures (avoiding/reducing augmentation, which may not reflect
real world behaviour). We benchmark the dataset on 17 state of the art trackers
and rank them according to tracking accuracy and run time speeds. We further
present thorough qualitative and quantitative evaluation highlighting the
importance of long term aspect of tracking. Our most interesting observations
are (a) existing short sequence benchmarks fail to bring out the inherent
differences in tracking algorithms which widen up while tracking on long
sequences and (b) the accuracy of trackers abruptly drops on challenging long
sequences, suggesting the potential need of research efforts in the direction
of long-term tracking.Comment: ACCV 2018 (Oral
CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark
A long-term visual object tracking performance evaluation methodology and a
benchmark are proposed. Performance measures are designed by following a
long-term tracking definition to maximize the analysis probing strength. The
new measures outperform existing ones in interpretation potential and in better
distinguishing between different tracking behaviors. We show that these
measures generalize the short-term performance measures, thus linking the two
tracking problems. Furthermore, the new measures are highly robust to temporal
annotation sparsity and allow annotation of sequences hundreds of times longer
than in the current datasets without increasing manual annotation labor. A new
challenging dataset of carefully selected sequences with many target
disappearances is proposed. A new tracking taxonomy is proposed to position
trackers on the short-term/long-term spectrum. The benchmark contains an
extensive evaluation of the largest number of long-term tackers and comparison
to state-of-the-art short-term trackers. We analyze the influence of tracking
architecture implementations to long-term performance and explore various
re-detection strategies as well as influence of visual model update strategies
to long-term tracking drift. The methodology is integrated in the VOT toolkit
to automate experimental analysis and benchmarking and to facilitate future
development of long-term trackers
Learning Spatial Distribution of Long-Term Trackers Scores
Long-Term tracking is a hot topic in Computer Vision. In this context,
competitive models are presented every year, showing a constant growth rate in
performances, mainly measured in standardized protocols as Visual Object
Tracking (VOT) and Object Tracking Benchmark (OTB). Fusion-trackers strategy
has been applied over last few years for overcoming the known re-detection
problem, turning out to be an important breakthrough. Following this approach,
this work aims to generalize the fusion concept to an arbitrary number of
trackers used as baseline trackers in the pipeline, leveraging a learning phase
to better understand how outcomes correlate with each other, even when no
target is present. A model and data independence conjecture will be evidenced
in the manuscript, yielding a recall of 0.738 on LTB-50 dataset when learning
from VOT-LT2022, and 0.619 by reversing the two datasets. In both cases,
results are strongly competitive with state-of-the-art and recall turns out to
be the first on the podium.Comment: 20 pages, 11 figures, 3 table
Long-term Tracking in the Wild: A Benchmark
We introduce the OxUvA dataset and benchmark for evaluating single-object
tracking algorithms. Benchmarks have enabled great strides in the field of
object tracking by defining standardized evaluations on large sets of diverse
videos. However, these works have focused exclusively on sequences that are
just tens of seconds in length and in which the target is always visible.
Consequently, most researchers have designed methods tailored to this
"short-term" scenario, which is poorly representative of practitioners' needs.
Aiming to address this disparity, we compile a long-term, large-scale tracking
dataset of sequences with average length greater than two minutes and with
frequent target object disappearance. The OxUvA dataset is much larger than the
object tracking datasets of recent years: it comprises 366 sequences spanning
14 hours of video. We assess the performance of several algorithms, considering
both the ability to locate the target and to determine whether it is present or
absent. Our goal is to offer the community a large and diverse benchmark to
enable the design and evaluation of tracking methods ready to be used "in the
wild". The project website is http://oxuva.netComment: To appear at ECCV 201
- …