Search CORE

16,020 research outputs found

Online Domain Adaptation for Multi-Object Tracking

Author: Gaidon Adrien
Vig Eleonora
Publication venue
Publication date: 01/01/2015
Field of study

Automatically detecting, labeling, and tracking objects in videos depends first and foremost on accurate category-level object detectors. These might, however, not always be available in practice, as acquiring high-quality large scale labeled training datasets is either too costly or impractical for all possible real-world application scenarios. A scalable solution consists in re-using object detectors pre-trained on generic datasets. This work is the first to investigate the problem of on-line domain adaptation of object detectors for causal multi-object tracking (MOT). We propose to alleviate the dataset bias by adapting detectors from category to instances, and back: (i) we jointly learn all target models by adapting them from the pre-trained one, and (ii) we also adapt the pre-trained model on-line. We introduce an on-line multi-task learning algorithm to efficiently share parameters and reduce drift, while gradually improving recall. Our approach is applicable to any linear object detector, and we evaluate both cheap "mini-Fisher Vectors" and expensive "off-the-shelf" ConvNet features. We quantitatively measure the benefit of our domain adaptation strategy on the KITTI tracking benchmark and on a new dataset (PASCAL-to-KITTI) we introduce to study the domain mismatch problem in MOT.Comment: To appear at BMVC 201

arXiv.org e-Print Archive

Crossref

Robust Distributed Fusion with Labeled Random Finite Sets

Author: Battistelli Giorgio
Hoseinnezhad Reza
Kong Lingjiang
Li Suqi
Wang Bailu
Yi Wei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/10/2017
Field of study

This paper considers the problem of the distributed fusion of multi-object posteriors in the labeled random finite set filtering framework, using Generalized Covariance Intersection (GCI) method. Our analysis shows that GCI fusion with labeled multi-object densities strongly relies on label consistencies between local multi-object posteriors at different sensor nodes, and hence suffers from a severe performance degradation when perfect label consistencies are violated. Moreover, we mathematically analyze this phenomenon from the perspective of Principle of Minimum Discrimination Information and the so called yes-object probability. Inspired by the analysis, we propose a novel and general solution for the distributed fusion with labeled multi-object densities that is robust to label inconsistencies between sensors. Specifically, the labeled multi-object posteriors are firstly marginalized to their unlabeled posteriors which are then fused using GCI method. We also introduce a principled method to construct the labeled fused density and produce tracks formally. Based on the developed theoretical framework, we present tractable algorithms for the family of generalized labeled multi-Bernoulli (GLMB) filters including

\delta

-GLMB, marginalized

\delta

-GLMB and labeled multi-Bernoulli filters. The robustness and efficiency of the proposed distributed fusion algorithm are demonstrated in challenging tracking scenarios via numerical experiments.Comment: 17pages, 23 figure

arXiv.org e-Print Archive

Florence Research

RMIT Research Repository

Jointly Tracking and Separating Speech Sources Using Multiple Features and the generalized labeled multi-Bernoulli Framework

Author: Lin Shoufeng
Publication venue
Publication date: 16/04/2018
Field of study

This paper proposes a novel joint multi-speaker tracking-and-separation method based on the generalized labeled multi-Bernoulli (GLMB) multi-target tracking filter, using sound mixtures recorded by microphones. Standard multi-speaker tracking algorithms usually only track speaker locations, and ambiguity occurs when speakers are spatially close. The proposed multi-feature GLMB tracking filter treats the set of vectors of associated speaker features (location, pitch and sound) as the multi-target multi-feature observation, characterizes transitioning features with corresponding transition models and overall likelihood function, thus jointly tracks and separates each multi-feature speaker, and addresses the spatial ambiguity problem. Numerical evaluation verifies that the proposed method can correctly track locations of multiple speakers and meanwhile separate speech signals

arXiv.org e-Print Archive

Crossref

Class-Agnostic Counting

Author: A Lehmussola
C Arteta
C Arteta
C Arteta
C Zhang
H Idrees
L Bertinetto
O Russakovsky
S Cho
T Leung
TN Mundhenk
V Ranjan
Publication venue
Publication date: 01/11/2018
Field of study

Nearly all existing counting methods are designed for a specific object class. Our work, however, aims to create a counting model able to count any class of object. To achieve this goal, we formulate counting as a matching problem, enabling us to exploit the image self-similarity property that naturally exists in object counting problems. We make the following three contributions: first, a Generic Matching Network (GMN) architecture that can potentially count any object in a class-agnostic manner; second, by reformulating the counting problem as one of matching objects, we can take advantage of the abundance of video data labeled for tracking, which contains natural repetitions suitable for training a counting model. Such data enables us to train the GMN. Third, to customize the GMN to different user requirements, an adapter module is used to specialize the model with minimal effort, i.e. using a few labeled examples, and adapting only a small fraction of the trained parameters. This is a form of few-shot learning, which is practical for domains where labels are limited due to requiring expert knowledge (e.g. microbiology). We demonstrate the flexibility of our method on a diverse set of existing counting benchmarks: specifically cells, cars, and human crowds. The model achieves competitive performance on cell and crowd counting datasets, and surpasses the state-of-the-art on the car dataset using only three training images. When training on the entire dataset, the proposed method outperforms all previous methods by a large margin.Comment: Asian Conference on Computer Vision (ACCV), 201

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

PPF - A Parallel Particle Filtering Library

Author: Demirel Ömer
Meijering Erik
Niessen Wiro
Sbalzarini Ivo F.
Smal Ihor
Publication venue
Publication date: 01/01/2014
Field of study

We present the parallel particle filtering (PPF) software library, which enables hybrid shared-memory/distributed-memory parallelization of particle filtering (PF) algorithms combining the Message Passing Interface (MPI) with multithreading for multi-level parallelism. The library is implemented in Java and relies on OpenMPI's Java bindings for inter-process communication. It includes dynamic load balancing, multi-thread balancing, and several algorithmic improvements for PF, such as input-space domain decomposition. The PPF library hides the difficulties of efficient parallel programming of PF algorithms and provides application developers with the necessary tools for parallel implementation of PF methods. We demonstrate the capabilities of the PPF library using two distributed PF algorithms in two scenarios with different numbers of particles. The PPF library runs a 38 million particle problem, corresponding to more than 1.86 GB of particle data, on 192 cores with 67% parallel efficiency. To the best of our knowledge, the PPF library is the first open-source software that offers a parallel framework for PF applications.Comment: 8 pages, 8 figures; will appear in the proceedings of the IET Data Fusion & Target Tracking Conference 201

arXiv.org e-Print Archive

EUR Research Repository

MPG.PuRe