Search CORE

5 research outputs found

Deep Depth From Focus

Author: A Thelen
C Hazirbas
E Adelson
F Liu
J Park
K Honauer
M Mahmood
M Moeller
O Russakovsky
R Garg
S Pertuz
V Badrinarayanan
Y Bok
Publication venue
Publication date: 28/10/2018
Field of study

Depth from focus (DFF) is one of the classical ill-posed inverse problems in computer vision. Most approaches recover the depth at each pixel based on the focal setting which exhibits maximal sharpness. Yet, it is not obvious how to reliably estimate the sharpness level, particularly in low-textured areas. In this paper, we propose `Deep Depth From Focus (DDFF)' as the first end-to-end learning approach to this problem. One of the main challenges we face is the hunger for data of deep neural networks. In order to obtain a significant amount of focal stacks with corresponding groundtruth depth, we propose to leverage a light-field camera with a co-calibrated RGB-D sensor. This allows us to digitally create focal stacks of varying sizes. Compared to existing benchmarks our dataset is 25 times larger, enabling the use of machine learning for this inverse problem. We compare our results with state-of-the-art DFF methods and we also analyze the effect of several key deep architectural components. These experiments show that our proposed method `DDFFNet' achieves state-of-the-art performance in all scenes, reducing depth error by more than 75% compared to the classical DFF methods.Comment: accepted to Asian Conference on Computer Vision (ACCV) 201

arXiv.org e-Print Archive

Crossref

SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection

Author: A Wedel
C Hazirbas
C Lu
JM Alvarez
L Caltagirone
LC Chen
O Ronneberger
P Cai
R Fan
S Hinterstoisser
V Badrinarayanan
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/08/2020
Field of study

Freespace detection is an essential component of visual perception for self-driving cars. The recent efforts made in data-fusion convolutional neural networks (CNNs) have significantly improved semantic driving scene segmentation. Freespace can be hypothesized as a ground plane, on which the points have similar surface normals. Hence, in this paper, we first introduce a novel module, named surface normal estimator (SNE), which can infer surface normal information from dense depth/disparity images with high accuracy and efficiency. Furthermore, we propose a data-fusion CNN architecture, referred to as RoadSeg, which can extract and fuse features from both RGB images and the inferred surface normal information for accurate freespace detection. For research purposes, we publish a large-scale synthetic freespace detection dataset, named Ready-to-Drive (R2D) road dataset, collected under different illumination and weather conditions. The experimental results demonstrate that our proposed SNE module can benefit all the state-of-the-art CNNs for freespace detection, and our SNE-RoadSeg achieves the best overall performance among different datasets.Comment: ECCV 202

arXiv.org e-Print Archive

Crossref

We Learn Better Road Pothole Detection:From Attention Aggregation to Adversarial Domain Adaptation

Author: C Hazirbas
C Koch
MR Jahanshahi
N Otsu
O Ronneberger
R Fan
R Fan
R Fan
R Fan
S Mathavan
Y LeCun
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/01/2022
Field of study

Crossref

Explore Bristol Research

We Learn Better Road Pothole Detection: from Attention Aggregation to Adversarial Domain Adaptation

Author: C Hazirbas
C Koch
MR Jahanshahi
N Otsu
O Ronneberger
R Fan
R Fan
R Fan
R Fan
S Mathavan
Y LeCun
Y Sun
Publication venue
Publication date: 11/12/2020
Field of study

Manual visual inspection performed by certified inspectors is still the main form of road pothole detection. This process is, however, not only tedious, time-consuming and costly, but also dangerous for the inspectors. Furthermore, the road pothole detection results are always subjective, because they depend entirely on the individual experience. Our recently introduced disparity (or inverse depth) transformation algorithm allows better discrimination between damaged and undamaged road areas, and it can be easily deployed to any semantic segmentation network for better road pothole detection results. To boost the performance, we propose a novel attention aggregation (AA) framework, which takes the advantages of different types of attention modules. In addition, we develop an effective training set augmentation technique based on adversarial domain adaptation, where the synthetic road RGB images and transformed road disparity (or inverse depth) images are generated to enhance the training of semantic segmentation networks. The experimental results demonstrate that, firstly, the transformed disparity (or inverse depth) images become more informative; secondly, AA-UNet and AA-RTFNet, our best performing implementations, respectively outperform all other state-of-the-art single-modal and data-fusion networks for road pothole detection; and finally, the training set augmentation technique based on adversarial domain adaptation not only improves the accuracy of the state-of-the-art semantic segmentation networks, but also accelerates their convergence.Comment: 16 pages, 7 figures and 2 tables. This paper is accepted by ECCV Workshops 202

arXiv.org e-Print Archive

Crossref

Explore Bristol Research

RGB-D Indoor Object Recognition Algorithm Based on Fusion Convolutional Neural Network

Author: Decheng Wang
Feng Zhao
Hazirbas C
Hoffman J
Hui Yi
Lu H T
Redmon J.
Runshun L
Silberman N
Simonyan K.
Socher R
Wei L.
Publication venue: 'IOP Publishing'
Publication date
Field of study

Crossref