Search CORE

10,881 research outputs found

Ego-motion and Surrounding Vehicle State Estimation Using a Monocular Camera

Author: Dariush Behzad
Hayakawa Jun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/05/2020
Field of study

Understanding ego-motion and surrounding vehicle state is essential to enable automated driving and advanced driving assistance technologies. Typical approaches to solve this problem use fusion of multiple sensors such as LiDAR, camera, and radar to recognize surrounding vehicle state, including position, velocity, and orientation. Such sensing modalities are overly complex and costly for production of personal use vehicles. In this paper, we propose a novel machine learning method to estimate ego-motion and surrounding vehicle state using a single monocular camera. Our approach is based on a combination of three deep neural networks to estimate the 3D vehicle bounding box, depth, and optical flow from a sequence of images. The main contribution of this paper is a new framework and algorithm that integrates these three networks in order to estimate the ego-motion and surrounding vehicle state. To realize more accurate 3D position estimation, we address ground plane correction in real-time. The efficacy of the proposed method is demonstrated through experimental evaluations that compare our results to ground truth data available from other sensors including Can-Bus and LiDAR

arXiv.org e-Print Archive

Crossref

Multiframe Scene Flow with Piecewise Rigid Motion

Author: Golyanik Vladislav
Kautz Jan
Kim Kihwan
Maier Robert
Nießner Matthias
Stricker Didier
Publication venue
Publication date: 05/10/2017
Field of study

We introduce a novel multiframe scene flow approach that jointly optimizes the consistency of the patch appearances and their local rigid motions from RGB-D image sequences. In contrast to the competing methods, we take advantage of an oversegmentation of the reference frame and robust optimization techniques. We formulate scene flow recovery as a global non-linear least squares problem which is iteratively solved by a damped Gauss-Newton approach. As a result, we obtain a qualitatively new level of accuracy in RGB-D based scene flow estimation which can potentially run in real-time. Our method can handle challenging cases with rigid, piecewise rigid, articulated and moderate non-rigid motion, and does not rely on prior knowledge about the types of motions and deformations. Extensive experiments on synthetic and real data show that our method outperforms state-of-the-art.Comment: International Conference on 3D Vision (3DV), Qingdao, China, October 201

arXiv.org e-Print Archive

Crossref

Multiframe Scene Flow with Piecewise Rigid Motion

Author: Christophe Quesnel
Clarisse Blayau
Francis Bonnet
Guillaume Arlet
Jean-Luc Mainardi
Jean-Pierre Fulgencio
Marc Garnier
Mehdi Hafiani
Muriel Fartoukh
Sacha Rozencwajg
Salah Gallah
Sophie Vimont
Tài Pham
Publication venue
Publication date: 01/06/2017
Field of study

arXiv.org e-Print Archive

University of Toronto Research Repository

Directory of Open Access Journals

HAL Descartes

Hal-Diderot

Disparity and Optical Flow Partitioning Using Extended Potts Priors

Author: Cai Xiaohao
Fitschen Jan Henrik
Nikolova Mila
Steidl Gabriele
Storath Martin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 07/05/2014
Field of study

This paper addresses the problems of disparity and optical flow partitioning based on the brightness invariance assumption. We investigate new variational approaches to these problems with Potts priors and possibly box constraints. For the optical flow partitioning, our model includes vector-valued data and an adapted Potts regularizer. Using the notation of asymptotically level stable functions we prove the existence of global minimizers of our functionals. We propose a modified alternating direction method of minimizers. This iterative algorithm requires the computation of global minimizers of classical univariate Potts problems which can be done efficiently by dynamic programming. We prove that the algorithm converges both for the constrained and unconstrained problems. Numerical examples demonstrate the very good performance of our partitioning method

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Low Power Depth Estimation of Rigid Objects for Time-of-Flight Imaging

Author: Noraky James
Sze Vivienne
Publication venue
Publication date: 25/03/2019
Field of study

Depth sensing is useful in a variety of applications that range from augmented reality to robotics. Time-of-flight (TOF) cameras are appealing because they obtain dense depth measurements with minimal latency. However, for many battery-powered devices, the illumination source of a TOF camera is power hungry and can limit the battery life of the device. To address this issue, we present an algorithm that lowers the power for depth sensing by reducing the usage of the TOF camera and estimating depth maps using concurrently collected images. Our technique also adaptively controls the TOF camera and enables it when an accurate depth map cannot be estimated. To ensure that the overall system power for depth sensing is reduced, we design our algorithm to run on a low power embedded platform, where it outputs 640x480 depth maps at 30 frames per second. We evaluate our approach on several RGB-D datasets, where it produces depth maps with an overall mean relative error of 0.96% and reduces the usage of the TOF camera by 85%. When used with commercial TOF cameras, we estimate that our algorithm can lower the total power for depth sensing by up to 73%

arXiv.org e-Print Archive

DSpace@MIT

Vision and Learning for Deliberative Monocular Cluttered Flight

Author: Agcayazi M. Talha
Bagnell J. Andrew
Daftry Shreyansh
Dey Debadeepta
Eriksen Christopher
Hebert Martial
Mehta Rupesh
Shankar Kumar Shaurya
Zeng Sam
Publication venue
Publication date: 23/11/2014
Field of study

Cameras provide a rich source of information while being passive, cheap and lightweight for small and medium Unmanned Aerial Vehicles (UAVs). In this work we present the first implementation of receding horizon control, which is widely used in ground vehicles, with monocular vision as the only sensing mode for autonomous UAV flight in dense clutter. We make it feasible on UAVs via a number of contributions: novel coupling of perception and control via relevant and diverse, multiple interpretations of the scene around the robot, leveraging recent advances in machine learning to showcase anytime budgeted cost-sensitive feature selection, and fast non-linear regression for monocular depth prediction. We empirically demonstrate the efficacy of our novel pipeline via real world experiments of more than 2 kms through dense trees with a quadrotor built from off-the-shelf parts. Moreover our pipeline is designed to combine information from other modalities like stereo and lidar as well if available

arXiv.org e-Print Archive

CiteSeerX

Recommended from our members

An evaluation framework for stereo-based driver assistance

Author: Banitsas KA
Gehrig S
Pfeiffer D
Schneider N
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

This is the post-print version of the Article - Copyright @ 2012 Springer VerlagThe accuracy of stereo algorithms or optical flow methods is commonly assessed by comparing the results against the Middlebury database. However, equivalent data for automotive or robotics applications rarely exist as they are difficult to obtain. As our main contribution, we introduce an evaluation framework tailored for stereo-based driver assistance able to deliver excellent performance measures while circumventing manual label effort. Within this framework one can combine several ways of ground-truthing, different comparison metrics, and use large image databases. Using our framework we show examples on several types of ground truthing techniques: implicit ground truthing (e.g. sequence recorded without a crash occurred), robotic vehicles with high precision sensors, and to a small extent, manual labeling. To show the effectiveness of our evaluation framework we compare three different stereo algorithms on pixel and object level. In more detail we evaluate an intermediate representation called the Stixel World. Besides evaluating the accuracy of the Stixels, we investigate the completeness (equivalent to the detection rate) of the StixelWorld vs. the number of phantom Stixels. Among many findings, using this framework enables us to reduce the number of phantom Stixels by a factor of three compared to the base parametrization. This base parametrization has already been optimized by test driving vehicles for distances exceeding 10000 km

Brunel University Research Archive