Search CORE

40,342 research outputs found

A critical evaluation of computational mechanisms of binocular disparity processing

Author: Bølling Hansen Rikke
Konge Lars
Lindorff-Larsen Karen Gilboe
Nayahangan Leizl Joy
Nielsen Bjørn Ulrik
Paltved Charlotte
Publication venue: 'Purdue University (bepress)'
Publication date: 14/05/2015
Field of study

The past decades of research in visual neuroscience have generated a large and disparate body of literature on the computation of binocular disparity in the primary visual cortex. Models have been proposed to describe specific phenomena, yet we lack a theoretical framework which is grounded in neurophysiology and also explains the effectiveness of disparity computation. Here, we examine neural circuits that are thought to play an important role in the computation of binocular disparity. Starting with the binocular energy model (Ohzawa et al. 1990), we consider plausible extensions which include suppressive mechanisms from units tuned to different phase disparities (Tanabe et al. 2011), which is formerly theorized to perform false disparity detection (Read & Cumming 2007) as well as coarse-to-fine (Menz & Freeman 2004a,b) and recurrent processing (Samonds et al. 2013). We rigorously cross-examine the consistency of these circuits with neurophysiology data including ocular dominance and binocular modulation (Ohzawa & Freeman 1990), spike-triggered analysis and temporal dynamics of disparity tuning (Tanabe et al. 2011) and attenuation to anti-correlated stimuli (Cumming & Parker 1997; Tanabe et al. 2011). We further evaluate the ability of the resulting computational models to recover depth, both theoretically and experimentally, using a dataset of natural and synthetic images. Overall, we find that a computational model which combines suppressive mechanisms by units with non-zero phase disparity, contrast normalization as well as lateral interaction between units tuned to specific combinations of phase and position disparities, seems consistent with all of the available V1 neurophysiology data and achieves the highest accuracy in real-world depth computation

Copenhagen University Research Information System

VBN

Purdue E-Pubs

University of Southern Denmark Research Output

Stereo matching based on absolute differences for multiple objects detection

Author: Anwar Nik Syahrim Nik
Hamzah Rostam Affendi
Wei Melvin Gan Yeou
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/02/2019
Field of study

This article presents a new algorithm for object detection using stereo camera system. The problem to get an accurate object detion using stereo camera is the imprecise of matching process between two scenes with the same viewpoint. Hence, this article aims to reduce the incorrect matching pixel with four stages. This new algorithm is the combination of continuous process of matching cost computation, aggregation, optimization and filtering. The first stage is matching cost computation to acquire preliminary result using an absolute differences method. Then the second stage known as aggregation step uses a guided filter with fixed window support size. After that, the optimization stage uses winner-takes-all (WTA) approach which selects the smallest matching differences value and normalized it to the disparity level. The last stage in the framework uses a bilateral filter. It is effectively further decrease the error on the disparity map which contains information of object detection and locations. The proposed work produces low errors (i.e., 12.11% and 14.01% nonocc and all errors) based on the KITTI dataset and capable to perform much better compared with before the proposed framework and competitive with some newly available methods

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

Stereo Matching Based On Absolute Differences For Multiple Objects Detection

Author: Gan Melvin Yeou Wei
Hamzah Rostam Affendi
Nik Anwar Nik Syahrim
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/02/2019
Field of study

This article presents a new algorithm for object detection using stereo camera system. The problem to get an accurate object detection using stereo camera is the imprecise of matching process between two scenes with the same viewpoint. Hence, this article aims to reduce the incorrect matching pixel with four stages. This new algorithm is the combination of continuous process of matching cost computation, aggregation, optimization and filtering. The first stage is matching cost computation to acquire preliminary result using an absolute differences method. Then the second stage known as aggregation step uses a guided filter with fixed window support size. After that, the optimization stage uses winner-takes-all (WTA) approach which selects the smallest matching differences value and normalized it to the disparity level. The last stage in the framework uses a bilateral filter. It is effectively further decrease the error on the disparity map which contains information of object detection and locations. The proposed work produces low errors (i.e., 12.11% and 14.01% nonocc and all errors) based on the KITTI dataset and capable to perform much better compared with before the proposed framework and competitive with some newly available methods

ZENODO

Universiti Teknikal Malaysia Melaka (UTeM) Repository

Stereo Computation for a Single Mixture Image

Author: A Levin
A Levin
AM Bronstein
B Li
C Liu
D Scharstein
J Xie
Jingyu Yang
O Ronneberger
R Garg
R Zhang
S Iizuka
Z Wang
Publication venue
Publication date: 27/08/2018
Field of study

This paper proposes an original problem of \emph{stereo computation from a single mixture image}-- a challenging problem that had not been researched before. The goal is to separate (\ie, unmix) a single mixture image into two constitute image layers, such that the two layers form a left-right stereo image pair, from which a valid disparity map can be recovered. This is a severely illposed problem, from one input image one effectively aims to recover three (\ie, left image, right image and a disparity map). In this work we give a novel deep-learning based solution, by jointly solving the two subtasks of image layer separation as well as stereo matching. Training our deep net is a simple task, as it does not need to have disparity maps. Extensive experiments demonstrate the efficacy of our method.Comment: Accepted by European Conference on Computer Vision (ECCV) 201

arXiv.org e-Print Archive

Crossref

The Australian National University

Multi-Scale 3D Scene Flow from Binocular Stereo Sequences

Author: Li Rui
Sclaroff Stan
Publication venue: Boston University Computer Science Department
Publication date: 01/01/2007
Field of study

Scene ﬂow methods estimate the three-dimensional motion ﬁeld for points in the world, using multi-camera video data. Such methods combine multi-view reconstruction with motion estimation. This paper describes an alternative formulation for dense scene ﬂow estimation that provides reliable results using only two cameras by fusing stereo and optical ﬂow estimation into a single coherent framework. Internally, the proposed algorithm generates probability distributions for optical ﬂow and disparity. Taking into account the uncertainty in the intermediate stages allows for more reliable estimation of the 3D scene ﬂow than previous methods allow. To handle the aperture problems inherent in the estimation of optical ﬂow and disparity, a multi-scale method along with a novel region-based technique is used within a regularized solution. This combined approach both preserves discontinuities and prevents over-regularization – two problems commonly associated with the basic multi-scale approaches. Experiments with synthetic and real test data demonstrate the strength of the proposed approach.National Science Foundation (CNS-0202067, IIS-0208876); Office of Naval Research (N00014-03-1-0108

CiteSeerX

Boston University Institutional Repository (OpenBU)

Cross-Scale Cost Aggregation for Stereo Matching

Author: Dongbo Min
Kang Zhang
Lifeng Sun
Qi Tian
Shiqiang Yang
Shuicheng Yan
Yuqiang Fang
Publication venue
Publication date: 03/03/2014
Field of study

Human beings process stereoscopic correspondence across multiple scales. However, this bio-inspiration is ignored by state-of-the-art cost aggregation methods for dense stereo correspondence. In this paper, a generic cross-scale cost aggregation framework is proposed to allow multi-scale interaction in cost aggregation. We firstly reformulate cost aggregation from a unified optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels. Then, an inter-scale regularizer is introduced into optimization and solving this new optimization problem leads to the proposed framework. Since the regularization term is independent of the similarity kernel, various cost aggregation methods can be integrated into the proposed general framework. We show that the cross-scale framework is important as it effectively and efficiently expands state-of-the-art cost aggregation methods and leads to significant improvements, when evaluated on Middlebury, KITTI and New Tsukuba datasets.Comment: To Appear in 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2014 (poster, 29.88%

arXiv.org e-Print Archive

CiteSeerX

Crossref

Quick and energy-efficient Bayesian computing of binocular disparity using stochastic digital signals

Author: Bessière Pierre
Coninx Alexandre
Droulez Jacques
Publication venue: 'Elsevier BV'
Publication date: 15/05/2016
Field of study

Reconstruction of the tridimensional geometry of a visual scene using the binocular disparity information is an important issue in computer vision and mobile robotics, which can be formulated as a Bayesian inference problem. However, computation of the full disparity distribution with an advanced Bayesian model is usually an intractable problem, and proves computationally challenging even with a simple model. In this paper, we show how probabilistic hardware using distributed memory and alternate representation of data as stochastic bitstreams can solve that problem with high performance and energy efficiency. We put forward a way to express discrete probability distributions using stochastic data representations and perform Bayesian fusion using those representations, and show how that approach can be applied to diparity computation. We evaluate the system using a simulated stochastic implementation and discuss possible hardware implementations of such architectures and their potential for sensorimotor processing and robotics.Comment: Preprint of article submitted for publication in International Journal of Approximate Reasoning and accepted pending minor revision

arXiv.org e-Print Archive

Variational Disparity Estimation Framework for Plenoptic Image

Author: Simon Sven
Tran Trung-Hieu
Wang Zhe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/04/2018
Field of study

This paper presents a computational framework for accurately estimating the disparity map of plenoptic images. The proposed framework is based on the variational principle and provides intrinsic sub-pixel precision. The light-field motion tensor introduced in the framework allows us to combine advanced robust data terms as well as provides explicit treatments for different color channels. A warping strategy is embedded in our framework for tackling the large displacement problem. We also show that by applying a simple regularization term and a guided median filtering, the accuracy of displacement field at occluded area could be greatly enhanced. We demonstrate the excellent performance of the proposed framework by intensive comparisons with the Lytro software and contemporary approaches on both synthetic and real-world datasets

arXiv.org e-Print Archive

Crossref