Search CORE

10,619 research outputs found

Event-based Vision: A Survey

Author: Bartolozzi Chiara
Censi Andrea
Conradt Joerg
Daniilidis Kostas
Davison Andrew
Delbruck Tobi
Gallego Guillermo
Leutenegger Stefan
Orchard Garrick
Scaramuzza Davide
Taba Brian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Event cameras are bio-inspired sensors that differ from conventional frame cameras: Instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes. Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of microseconds), very high dynamic range (140 dB vs. 60 dB), low power consumption, and high pixel bandwidth (on the order of kHz) resulting in reduced motion blur. Hence, event cameras have a large potential for robotics and computer vision in challenging scenarios for traditional cameras, such as low-latency, high speed, and high dynamic range. However, novel methods are required to process the unconventional output of these sensors in order to unlock their potential. This paper provides a comprehensive overview of the emerging field of event-based vision, with a focus on the applications and the algorithms developed to unlock the outstanding properties of event cameras. We present event cameras from their working principle, the actual sensors that are available and the tasks that they have been used for, from low-level vision (feature detection and tracking, optic flow, etc.) to high-level vision (reconstruction, segmentation, recognition). We also discuss the techniques developed to process events, including learning-based techniques, as well as specialized processors for these novel sensors, such as spiking neural networks. Additionally, we highlight the challenges that remain to be tackled and the opportunities that lie ahead in the search for a more efficient, bio-inspired way for machines to perceive and interact with the world

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

ZORA

GLOBAL CHANGE REACTIVE BACKGROUND SUBTRACTION

Author: Sathiyamoorthy Edwin Premkumar
Publication venue: UKnowledge
Publication date: 01/01/2011
Field of study

Background subtraction is the technique of segmenting moving foreground objects from stationary or dynamic background scenes. Background subtraction is a critical step in many computer vision applications including video surveillance, tracking, gesture recognition etc. This thesis addresses the challenges associated with the background subtraction systems due to the sudden illumination changes happening in an indoor environment. Most of the existing techniques adapt to gradual illumination changes, but fail to cope with the sudden illumination changes. Here, we introduce a Global change reactive background subtraction to model these changes as a regression function of spatial image coordinates. The regression model is learned from highly probable background regions and the background model is compensated for the illumination changes by the model parameters estimated. Experiments were performed in the indoor environment to show the effectiveness of our approach in modeling the sudden illumination changes by a higher order regression polynomial. The results of non-linear SVM regression were also presented to show the robustness of our regression model

University of Kentucky

Modelling of content-aware indicators for effective determination of shot boundaries in compressed MPEG videos

Author: A Hanjalic
BL Yeo
C Cotsaces
C Grana
G Boccignone
H Fang
J Bescos
J Cao
J Hoey
J Meng
J Ren
J Yuan
Jianmin Jiang
Jinchang Ren
Juan Chen
K Qiu
K-C Yang
M Cooper
O Urhan
R Lienhart
RM Ford
S Lefèvre
S Li
S Porter
S-C Pei
TY Liu
U Gargi
Z Rasheed
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/04/2010
Field of study

In this paper, a content-aware approach is proposed to design multiple test conditions for shot cut detection, which are organized into a multiple phase decision tree for abrupt cut detection and a finite state machine for dissolve detection. In comparison with existing approaches, our algorithm is characterized with two categories of content difference indicators and testing. While the first category indicates the content changes that are directly used for shot cut detection, the second category indicates the contexts under which the content change occurs. As a result, indications of frame differences are tested with context awareness to make the detection of shot cuts adaptive to both content and context changes. Evaluations announced by TRECVID 2007 indicate that our proposed algorithm achieved comparable performance to those using machine learning approaches, yet using a simpler feature set and straightforward design strategies. This has validated the effectiveness of modelling of content-aware indicators for decision making, which also provides a good alternative to conventional approaches in this topic

Crossref

University of Strathclyde Institutional Repository

Surrey Research Insight

Real-time detection and tracking of multiple objects with partial decoding in H.264/AVC bitstream domain

Author: Kim Munchurl
Sabirin M. S. Houari
You Wonsang
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 21/02/2012
Field of study

In this paper, we show that we can apply probabilistic spatiotemporal macroblock filtering (PSMF) and partial decoding processes to effectively detect and track multiple objects in real time in H.264|AVC bitstreams with stationary background. Our contribution is that our method cannot only show fast processing time but also handle multiple moving objects that are articulated, changing in size or internally have monotonous color, even though they contain a chaotic set of non-homogeneous motion vectors inside. In addition, our partial decoding process for H.264|AVC bitstreams enables to improve the accuracy of object trajectories and overcome long occlusion by using extracted color information.Comment: SPIE Real-Time Image and Video Processing Conference 200

arXiv.org e-Print Archive

Crossref

Recommended from our members

Tracking a table tennis ball for umpiring purposes using a multi-agent system

Author: Dooley Laurence
Hopgood Adrian
Myint Hnin
Wong Patrick
Publication venue
Publication date: 01/01/2016
Field of study

Tracking a table tennis ball for umpiring purposes is a challenging task as, in real-match scenarios, the ball travels fast and can become occluded or merged with other background objects. This paper presents the design of a multi-view based tracking system that can overcome the challenges of tracking a ball in real match sequences. The system has been tested on a complete table tennis rally and the results are very promising. The system is able to continuously track the ball with only marginal variations in detection. Furthermore, the initialization of the multi-camera system means it is both a portable and cost-effective solution for umpiring purposes

Open Research Online (The Open University)

Portsmouth University Research Portal (Pure)

Open Repository and Bibliography - Liège

Calipso: Physics-based Image and Video Editing through CAD Model Proxies

Author: Cotin Stephane
Courtecuisse Hadrien
Haouchine Nazim
Nießner Matthias
Roy Frederick
Publication venue
Publication date: 12/08/2017
Field of study

We present Calipso, an interactive method for editing images and videos in a physically-coherent manner. Our main idea is to realize physics-based manipulations by running a full physics simulation on proxy geometries given by non-rigidly aligned CAD models. Running these simulations allows us to apply new, unseen forces to move or deform selected objects, change physical parameters such as mass or elasticity, or even add entire new objects that interact with the rest of the underlying scene. In Calipso, the user makes edits directly in 3D; these edits are processed by the simulation and then transfered to the target 2D content using shape-to-image correspondences in a photo-realistic rendering process. To align the CAD models, we introduce an efficient CAD-to-image alignment procedure that jointly minimizes for rigid and non-rigid alignment while preserving the high-level structure of the input shape. Moreover, the user can choose to exploit image flow to estimate scene motion, producing coherent physical behavior with ambient dynamics. We demonstrate Calipso's physics-based editing on a wide range of examples producing myriad physical behavior while preserving geometric and visual consistency.Comment: 11 page

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL-Rennes 1