Search CORE

5,382 research outputs found

Vision and Learning for Deliberative Monocular Cluttered Flight

Author: Agcayazi M. Talha
Bagnell J. Andrew
Daftry Shreyansh
Dey Debadeepta
Eriksen Christopher
Hebert Martial
Mehta Rupesh
Shankar Kumar Shaurya
Zeng Sam
Publication venue
Publication date: 23/11/2014
Field of study

Cameras provide a rich source of information while being passive, cheap and lightweight for small and medium Unmanned Aerial Vehicles (UAVs). In this work we present the first implementation of receding horizon control, which is widely used in ground vehicles, with monocular vision as the only sensing mode for autonomous UAV flight in dense clutter. We make it feasible on UAVs via a number of contributions: novel coupling of perception and control via relevant and diverse, multiple interpretations of the scene around the robot, leveraging recent advances in machine learning to showcase anytime budgeted cost-sensitive feature selection, and fast non-linear regression for monocular depth prediction. We empirically demonstrate the efficacy of our novel pipeline via real world experiments of more than 2 kms through dense trees with a quadrotor built from off-the-shelf parts. Moreover our pipeline is designed to combine information from other modalities like stereo and lidar as well if available

arXiv.org e-Print Archive

CiteSeerX

A graphical model based solution to the facial feature point tracking problem

Author: Cetin Mujdat
Cosar Serhan
Coşar Serhan
Çetin Müjdat
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

In this paper a facial feature point tracker that is motivated by applications such as human-computer interfaces and facial expression analysis systems is proposed. The proposed tracker is based on a graphical model framework. The facial features are tracked through video streams by incorporating statistical relations in time as well as spatial relations between feature points. By exploiting the spatial relationships between feature points, the proposed method provides robustness in real-world conditions such as arbitrary head movements and occlusions. A Gabor feature-based occlusion detector is developed and used to handle occlusions. The performance of the proposed tracker has been evaluated on real video data under various conditions including occluded facial gestures and head movements. It is also compared to two popular methods, one based on Kalman filtering exploiting temporal relations, and the other based on active appearance models (AAM). Improvements provided by the proposed approach are demonstrated through both visual displays and quantitative analysis

Sabanci University Research Database

Deformable Models for Eye Tracking

Author: Ersbøll Bjarne Kjær
Hansen Lars Kai
Leimberg Denis
Vester-Christensen Martin
Publication venue
Publication date: 01/01/2005
Field of study

Online Research Database In Technology

EyeScout: Active Eye Tracking for Position and Movement Independent Gaze Interaction with Large Public Displays

Author: Alt Florian
Bulling Andreas
Hoesl Axel
Khamis Mohamed
Klimczak Alexander
Reiss Martin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/10/2017
Field of study

While gaze holds a lot of promise for hands-free interaction with public displays, remote eye trackers with their confined tracking box restrict users to a single stationary position in front of the display. We present EyeScout, an active eye tracking system that combines an eye tracker mounted on a rail system with a computational method to automatically detect and align the tracker with the user's lateral movement. EyeScout addresses key limitations of current gaze-enabled large public displays by offering two novel gaze-interaction modes for a single user: In "Walk then Interact" the user can walk up to an arbitrary position in front of the display and interact, while in "Walk and Interact" the user can interact even while on the move. We report on a user study that shows that EyeScout is well perceived by users, extends a public display's sweet spot into a sweet line, and reduces gaze interaction kick-off time to 3.5 seconds -- a 62% improvement over state of the art solutions. We discuss sample applications that demonstrate how EyeScout can enable position and movement-independent gaze interaction with large public displays

Crossref

Enlighten