Search CORE

1,421 research outputs found

Variational Disparity Estimation Framework for Plenoptic Image

Author: Simon Sven
Tran Trung-Hieu
Wang Zhe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/04/2018
Field of study

This paper presents a computational framework for accurately estimating the disparity map of plenoptic images. The proposed framework is based on the variational principle and provides intrinsic sub-pixel precision. The light-field motion tensor introduced in the framework allows us to combine advanced robust data terms as well as provides explicit treatments for different color channels. A warping strategy is embedded in our framework for tackling the large displacement problem. We also show that by applying a simple regularization term and a guided median filtering, the accuracy of displacement field at occluded area could be greatly enhanced. We demonstrate the excellent performance of the proposed framework by intensive comparisons with the Lytro software and contemporary approaches on both synthetic and real-world datasets

arXiv.org e-Print Archive

Crossref

Real-Time Stereo Vision Applications

Author: Christos Georgoulas
Georgios Ch. Sirakoulis
Ioannis Andreadis
Publication venue: 'IntechOpen'
Publication date: 01/03/2010
Field of study

IntechOpen

Crossref

Cross-Scale Cost Aggregation for Stereo Matching

Author: Dongbo Min
Kang Zhang
Lifeng Sun
Qi Tian
Shiqiang Yang
Shuicheng Yan
Yuqiang Fang
Publication venue
Publication date: 03/03/2014
Field of study

Human beings process stereoscopic correspondence across multiple scales. However, this bio-inspiration is ignored by state-of-the-art cost aggregation methods for dense stereo correspondence. In this paper, a generic cross-scale cost aggregation framework is proposed to allow multi-scale interaction in cost aggregation. We firstly reformulate cost aggregation from a unified optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels. Then, an inter-scale regularizer is introduced into optimization and solving this new optimization problem leads to the proposed framework. Since the regularization term is independent of the similarity kernel, various cost aggregation methods can be integrated into the proposed general framework. We show that the cross-scale framework is important as it effectively and efficiently expands state-of-the-art cost aggregation methods and leads to significant improvements, when evaluated on Middlebury, KITTI and New Tsukuba datasets.Comment: To Appear in 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2014 (poster, 29.88%

arXiv.org e-Print Archive

CiteSeerX

Crossref

Stereoscopic hand-detection system based on FPGA

Author: Santos Pedro Miguel Salgueiro dos
Publication venue
Publication date: 01/01/2009
Field of study

Tese de mestrado integrado. Engenharia Electrotécnica e de Computadores (Major de Telecomunicações). Faculdade de Engenharia. Universidade do Porto. 200

Repositório Aberto da Universidade do Porto

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Author: Brostow Gabriel J.
Godard Clément
Mac Aodha Oisin
Publication venue
Publication date: 12/04/2017
Field of study

Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we innovate beyond existing approaches, replacing the use of explicit depth data during training with easier-to-obtain binocular stereo footage. We propose a novel training objective that enables our convolutional neural network to learn to perform single image depth estimation, despite the absence of ground truth depth data. Exploiting epipolar geometry constraints, we generate disparity images by training our network with an image reconstruction loss. We show that solving for image reconstruction alone results in poor quality depth images. To overcome this problem, we propose a novel training loss that enforces consistency between the disparities produced relative to both the left and right images, leading to improved performance and robustness compared to existing approaches. Our method produces state of the art results for monocular depth estimation on the KITTI driving dataset, even outperforming supervised methods that have been trained with ground truth depth.Comment: CVPR 2017 ora

arXiv.org e-Print Archive

Crossref

UCL Discovery

Light field reconstruction from multi-view images

Author: Han Kang
Publication venue
Publication date: 01/01/2022
Field of study

Kang Han studied recovering the 3D world from multi-view images. He proposed several algorithms to deal with occlusions in depth estimation and effective representations in view rendering. the proposed algorithms can be used for many innovative applications based on machine intelligence, such as autonomous driving and Metaverse

ResearchOnline at James Cook University