Search CORE

33 research outputs found

Extended Object Tracking: Introduction, Overview and Applications

Author: Baum Marcus
Granstrom Karl
Reuter Stephan
Publication venue
Publication date: 01/01/2017
Field of study

This article provides an elaborate overview of current research in extended object tracking. We provide a clear definition of the extended object tracking problem and discuss its delimitation to other types of object tracking. Next, different aspects of extended object modelling are extensively discussed. Subsequently, we give a tutorial introduction to two basic and well used extended object tracking approaches - the random matrix approach and the Kalman filter-based approach for star-convex shapes. The next part treats the tracking of multiple extended objects and elaborates how the large number of feasible association hypotheses can be tackled using both Random Finite Set (RFS) and Non-RFS multi-object trackers. The article concludes with a summary of current applications, where four example applications involving camera, X-band radar, light detection and ranging (lidar), red-green-blue-depth (RGB-D) sensors are highlighted.Comment: 30 pages, 19 figure

arXiv.org e-Print Archive

Chalmers Research

Chalmers Publication Library

Decentralized Poisson Multi-Bernoulli Filtering for Vehicle Tracking

Author: Fröhle Markus
Granström Karl
Wymeersch Henk
Publication venue
Publication date: 01/01/2020
Field of study

A decentralized Poisson multi-Bernoulli filter is proposed to track multiple vehicles using multiple high-resolution sensors. Independent filters estimate the vehicles' presence, state, and shape using a Gaussian process extent model; a decentralized filter is realized through fusion of the filters posterior densities. An efficient implementation is achieved by parametric state representation, utilization of single hypothesis tracks, and fusion of vehicle information based on a fusion mapping. Numerical results demonstrate the performance.Comment: 14 pages, 5 figure

arXiv.org e-Print Archive

Chalmers Research

Multi-object tracking in video using labeled random finite sets

Author: Rathnayake T
Publication venue: RMIT University
Publication date
Field of study

The safety of industrial mobile platforms (such as fork lifts and boom lifts) is of major concern in the world today as industry embraces the concepts of Industry 4.0. The existing safety methods are predominantly based on Radio Frequency Identification (RFID) technology and therefore can only determine the distance at which a pedestrian who is wearing an RFID tag is standing. Other methods use expensive laser scanners to map the surrounding and warn the driver accordingly. The aim of this research project is to improve the safety of industrial mobile platforms, by detecting and tracking pedestrians in the path of the mobile platform, using readily available cheap camera modules. In order to achieve this aim, this research focuses on multi-object tracking which is one of the most ubiquitously addressed problems in the field of \textit{Computer Vision}. Algorithms that can track targets under severe conditions, such as varying number of objects, occlusion, illumination changes and abrupt movements of the objects are investigated in this research project. Furthermore, a substantial focus is given to improving the accuracy and, performance and to handling misdetections and false alarms. In order to formulate these algorithms, the recently introduced concept of Random Finite Sets (RFS) is used as the underlying mathematical framework. The algorithms formulated to meet the above criteria were tested on standard visual tracking datasets as well as on a dataset which was created by our research group, for performance and accuracy using standard performance and accuracy metrics that are widely used in the computer vision literature. These results were compared with numerous state-of-the-art methods and are shown to outperform or perform favourably in terms of the metrics mentioned above

RMIT Research Repository

Traffic Scene Perception for Automated Driving with Top-View Grid Maps

Author: Wirges Sascha
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 09/06/2021
Field of study

Ein automatisiertes Fahrzeug muss sichere, sinnvolle und schnelle Entscheidungen auf Basis seiner Umgebung treffen. Dies benötigt ein genaues und recheneffizientes Modell der Verkehrsumgebung. Mit diesem Umfeldmodell sollen Messungen verschiedener Sensoren fusioniert, gefiltert und nachfolgenden Teilsysteme als kompakte, aber aussagekräftige Information bereitgestellt werden. Diese Arbeit befasst sich mit der Modellierung der Verkehrsszene auf Basis von Top-View Grid Maps. Im Vergleich zu anderen Umfeldmodellen ermöglichen sie eine frühe Fusion von Distanzmessungen aus verschiedenen Quellen mit geringem Rechenaufwand sowie eine explizite Modellierung von Freiraum. Nach der Vorstellung eines Verfahrens zur Bodenoberflächenschätzung, das die Grundlage der Top-View Modellierung darstellt, werden Methoden zur Belegungs- und Elevationskartierung für Grid Maps auf Basis von mehreren, verrauschten, teilweise widersprüchlichen oder fehlenden Distanzmessungen behandelt. Auf der resultierenden, sensorunabhängigen Repräsentation werden anschließend Modelle zur Detektion von Verkehrsteilnehmern sowie zur Schätzung von Szenenfluss, Odometrie und Tracking-Merkmalen untersucht. Untersuchungen auf öffentlich verfügbaren Datensätzen und einem Realfahrzeug zeigen, dass Top-View Grid Maps durch on-board LiDAR Sensorik geschätzt und verlässlich sicherheitskritische Umgebungsinformationen wie Beobachtbarkeit und Befahrbarkeit abgeleitet werden können. Schließlich werden Verkehrsteilnehmer als orientierte Bounding Boxen mit semantischen Klassen, Geschwindigkeiten und Tracking-Merkmalen aus einem gemeinsamen Modell zur Objektdetektion und Flussschätzung auf Basis der Top-View Grid Maps bestimmt

KITopen

Online Audio-Visual Multi-Source Tracking and Separation: A Labeled Random Finite Set Approach

Author: Ong Jonah Soon Xuan
Publication venue: Curtin University
Publication date: 01/01/2021
Field of study

The dissertation proposes an online solution for separating an unknown and time-varying number of moving sources using audio and visual data. The random finite set framework is used for the modeling and fusion of audio and visual data. This enables an online tracking algorithm to estimate the source positions and identities for each time point. With this information, a set of beamformers can be designed to separate each desired source and suppress the interfering sources

espace@Curtin

Improved robustness and efficiency for automatic visual site monitoring

Author: Dalley Gerald Edwin
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2009
Field of study

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student-submitted PDF version of thesis.Includes bibliographical references (p. 219-228).Knowing who people are, where they are, what they are doing, and how they interact with other people and things is valuable from commercial, security, and space utilization perspectives. Video sensors backed by computer vision algorithms are a natural way to gather this data. Unfortunately, key technical issues persist in extracting features and models that are simultaneously efficient to compute and robust to issues such as adverse lighting conditions, distracting background motions, appearance changes over time, and occlusions. In this thesis, we present a set of techniques and model enhancements to better handle these problems, focusing on contributions in four areas. First, we improve background subtraction so it can better handle temporally irregular dynamic textures. This allows us to achieve a 5.5% drop in false positive rate on the Wallflower waving trees video. Secondly, we adapt the Dalal and Triggs Histogram of Oriented Gradients pedestrian detector to work on large-scale scenes with dense crowds and harsh lighting conditions: challenges which prevent us from easily using a background subtraction solution. These scenes contain hundreds of simultaneously visible people. To make using the algorithm computationally feasible, we have produced a novel implementation that runs on commodity graphics hardware and is up to 76 faster than our CPU-only implementation. We demonstrate the utility of this detector by modeling scene-level activities with a Hierarchical Dirichlet Process.(cont.) Third, we show how one can improve the quality of pedestrian silhouettes for recognizing individual people. We combine general appearance information from a large population of pedestrians with semi-periodic shape information from individual silhouette sequences. Finally, we show how one can combine a variety of detection and tracking techniques to robustly handle a variety of event detection scenarios such as theft and left-luggage detection. We present the only complete set of results on a standardized collection of very challenging videos.by Gerald Edwin Dalley.Ph.D

CiteSeerX

DSpace@MIT

Gaussian Process Methods for Group, Extended and Point Target Tracking and Smoothing

Author: Aftab Waqas
Publication venue: 'University of Sheffield Conference Proceedings'
Publication date: 16/09/2020
Field of study

White Rose E-theses Online

Recommended from our members

Efficient Bayesian active learning and matrix modelling

Author: Houlsby Neil
Publication venue: University of Cambridge
Publication date: 11/11/2014
Field of study

With the advent of the Internet and growth of storage capabilities, large collections of unlabelled data are now available. However, collecting supervised labels can be costly. Active learning addresses this by selecting, sequentially, only the most useful data in light of the information collected so far. The online nature of such algorithms often necessitates efficient computations. Thus, we present a framework for information theoretic Bayesian active learning, named Bayesian Active Learning by Disagreement, that permits efficient and accurate computations of data utility. Using this framework we develop new techniques for active Gaussian process modelling and adaptive quantum tomography. The latter has been shown, in both simulation and laboratory experiments, to yield faster learning rates than any non-adaptive design. Numerous datasets can be represented as matrices. Bayesian models of matrices are becoming increasingly popular because they can handle noisy or missing elements, and are extensible to different data-types. However, efficient inference is crucial to allow these flexible probabilistic models to scale to large real-world datasets. Binary matrices are a ubiquitous datatype, so we present a stochastic inference algorithm for fast learning in this domain. Preference judgements are a common, implicit source of binary data. We present a hybrid matrix factorization/Gaussian process model for collaborative learning from multiple users' preferences. This model exploits both the structure of the matrix and can incorporate additional covariate information to make accurate predictions. We then combine matrix modelling with active learning and propose a new algorithm for cold-start learning with ordinal data, such as ratings. This algorithm couples Bayesian Active Learning by Disagreement with a heteroscedastic model to handle varying levels of noise. This ordinal matrix model is also used to analyze psychometric questionnaires; we analyze classical assumptions made in psychometrics and show that active learning methods can reduce questionnaire lengths substantially.This PhD was supported by the Google European Doctoral Fellowshi

Apollo (Cambridge)