Search CORE

29,573 research outputs found

Full-3D Edge Tracking with a Particle Filter

Author
Publication venue: 'British Machine Vision Association and Society for Pattern Recognition'
Publication date: 01/01/2006
Field of study

Single camera pose estimation using Bayesian filtering and Kinect motion priors

Author: Burke Michael
Lasenby Joan
Publication venue
Publication date: 17/06/2014
Field of study

Traditional approaches to upper body pose estimation using monocular vision rely on complex body models and a large variety of geometric constraints. We argue that this is not ideal and somewhat inelegant as it results in large processing burdens, and instead attempt to incorporate these constraints through priors obtained directly from training data. A prior distribution covering the probability of a human pose occurring is used to incorporate likely human poses. This distribution is obtained offline, by fitting a Gaussian mixture model to a large dataset of recorded human body poses, tracked using a Kinect sensor. We combine this prior information with a random walk transition model to obtain an upper body model, suitable for use within a recursive Bayesian filtering framework. Our model can be viewed as a mixture of discrete Ornstein-Uhlenbeck processes, in that states behave as random walks, but drift towards a set of typically observed poses. This model is combined with measurements of the human head and hand positions, using recursive Bayesian estimation to incorporate temporal information. Measurements are obtained using face detection and a simple skin colour hand detector, trained using the detected face. The suggested model is designed with analytical tractability in mind and we show that the pose tracking can be Rao-Blackwellised using the mixture Kalman filter, allowing for computational efficiency while still incorporating bio-mechanical properties of the upper body. In addition, the use of the proposed upper body model allows reliable three-dimensional pose estimates to be obtained indirectly for a number of joints that are often difficult to detect using traditional object recognition strategies. Comparisons with Kinect sensor results and the state of the art in 2D pose estimation highlight the efficacy of the proposed approach.Comment: 25 pages, Technical report, related to Burke and Lasenby, AMDO 2014 conference paper. Code sample: https://github.com/mgb45/SignerBodyPose Video: https://www.youtube.com/watch?v=dJMTSo7-uF

arXiv.org e-Print Archive

CiteSeerX

Physical simulation for monocular 3D model based tracking

Author: Damien Jade Duff
Jeremy Wyatt
Rustam Stolkin
Student Member
Thomas Mörwald
Publication venue
Publication date: 09/05/2011
Field of study

The problem of model-based object tracking in three dimensions is addressed. Most previous work on tracking assumes simple motion models, and consequently tracking typically fails in a variety of situations. Our insight is that incorporating physics models of object behaviour improves tracking performance in these cases. In particular it allows us to handle tracking in the face of rigid body interactions where there is also occlusion and fast object motion. We show how to incorporate rigid body physics simulation into a particle filter. We present two methods for this based on pose and force noise. The improvements are tested on four videos of a robot pushing an object, and results indicate that our approach performs considerably better than a plain particle filter tracker, with the force noise method producing the best results over the range of test videos

CiteSeerX

Crossref

University of Birmingham Research Portal

Event-based Vision: A Survey

Author: Bartolozzi Chiara
Censi Andrea
Conradt Joerg
Daniilidis Kostas
Davison Andrew
Delbruck Tobi
Gallego Guillermo
Leutenegger Stefan
Orchard Garrick
Scaramuzza Davide
Taba Brian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Event cameras are bio-inspired sensors that differ from conventional frame cameras: Instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes. Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of microseconds), very high dynamic range (140 dB vs. 60 dB), low power consumption, and high pixel bandwidth (on the order of kHz) resulting in reduced motion blur. Hence, event cameras have a large potential for robotics and computer vision in challenging scenarios for traditional cameras, such as low-latency, high speed, and high dynamic range. However, novel methods are required to process the unconventional output of these sensors in order to unlock their potential. This paper provides a comprehensive overview of the emerging field of event-based vision, with a focus on the applications and the algorithms developed to unlock the outstanding properties of event cameras. We present event cameras from their working principle, the actual sensors that are available and the tasks that they have been used for, from low-level vision (feature detection and tracking, optic flow, etc.) to high-level vision (reconstruction, segmentation, recognition). We also discuss the techniques developed to process events, including learning-based techniques, as well as specialized processors for these novel sensors, such as spiking neural networks. Additionally, we highlight the challenges that remain to be tackled and the opportunities that lie ahead in the search for a more efficient, bio-inspired way for machines to perceive and interact with the world

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

ZORA

Reasoning About Liquids via Closed-Loop Simulation

Author: Fox Dieter
Schenck Connor
Publication venue
Publication date: 09/06/2017
Field of study

Simulators are powerful tools for reasoning about a robot's interactions with its environment. However, when simulations diverge from reality, that reasoning becomes less useful. In this paper, we show how to close the loop between liquid simulation and real-time perception. We use observations of liquids to correct errors when tracking the liquid's state in a simulator. Our results show that closed-loop simulation is an effective way to prevent large divergence between the simulated and real liquid states. As a direct consequence of this, our method can enable reasoning about liquids that would otherwise be infeasible due to large divergences, such as reasoning about occluded liquid.Comment: Robotics: Science & Systems (RSS), July 12-16, 2017. Cambridge, MA, US

arXiv.org e-Print Archive

Crossref

First Demonstration of a Pixelated Charge Readout for Single-Phase Liquid Argon Time Projection Chambers

Author: Asaadi J.
Auger M.
Ereditato A.
Goeldi D.
Haenni R.
Kose U.
Kreslo I
Lorca D.
Luethi M.
Sinclair J. R.
Stocker F.
Tognina C.
Von Rohr C. Rudolf
Weber M.
Publication venue
Publication date: 26/01/2018
Field of study

Liquid Argon Time Projection Chambers (LArTPCs) have been selected for the future long-baseline Deep Underground Neutrino Experiment (DUNE). To allow LArTPCs to operate in the high-multiplicity near detector environment of DUNE, a new charge readout technology is required. Traditional charge readout technologies introduce intrinsic ambiguities, combined with a slow detector response, these ambiguities have limited the performance of LArTPCs, until now. Here, we present a novel pixelated charge readout that enables the full 3D tracking capabilities of LArTPCs. We characterise the signal to noise ratio of charge readout chain, to be about 14, and demonstrate track reconstruction on 3D space points produced by the pixel readout. This pixelated charge readout makes LArTPCs a viable option for the DUNE near detector complex.Comment: 13 pages, 9 figure

arXiv.org e-Print Archive

CERN Document Server