Search CORE

767 research outputs found

Semi-Global Stereo Matching with Surface Orientation Priors

Author: Scharstein Daniel
Sinha Sudipta N.
Taniai Tatsunori
Publication venue
Publication date: 03/12/2017
Field of study

Semi-Global Matching (SGM) is a widely-used efficient stereo matching technique. It works well for textured scenes, but fails on untextured slanted surfaces due to its fronto-parallel smoothness assumption. To remedy this problem, we propose a simple extension, termed SGM-P, to utilize precomputed surface orientation priors. Such priors favor different surface slants in different 2D image regions or 3D scene regions and can be derived in various ways. In this paper we evaluate plane orientation priors derived from stereo matching at a coarser resolution and show that such priors can yield significant performance gains for difficult weakly-textured scenes. We also explore surface normal priors derived from Manhattan-world assumptions, and we analyze the potential performance gains using oracle priors derived from ground-truth data. SGM-P only adds a minor computational overhead to SGM and is an attractive alternative to more complex methods employing higher-order smoothness terms.Comment: extended draft of 3DV 2017 (spotlight) pape

arXiv.org e-Print Archive

Crossref

Computational intelligence approaches to robotics, automation, and control [Volume guest editors]

Author: Chen Yi
Gu Dongbing
Hu Huosheng
Li Yun
Xu Peter
Zhang Jun
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

No abstract available

Enlighten

3D RECONSTRUCTION FROM STEREO/RANGE IMAGES

Author: Yang Qingxiong
Publication venue: UKnowledge
Publication date: 01/01/2007
Field of study

3D reconstruction from stereo/range image is one of the most fundamental and extensively researched topics in computer vision. Stereo research has recently experienced somewhat of a new era, as a result of publically available performance testing such as the Middlebury data set, which has allowed researchers to compare their algorithms against all the state-of-the-art algorithms. This thesis investigates into the general stereo problems in both the two-view stereo and multi-view stereo scopes. In the two-view stereo scope, we formulate an algorithm for the stereo matching problem with careful handling of disparity, discontinuity and occlusion. The algorithm works with a global matching stereo model based on an energy minimization framework. The experimental results are evaluated on the Middlebury data set, showing that our algorithm is the top performer. A GPU approach of the Hierarchical BP algorithm is then proposed, which provides similar stereo quality to CPU Hierarchical BP while running at real-time speed. A fast-converging BP is also proposed to solve the slow convergence problem of general BP algorithms. Besides two-view stereo, ecient multi-view stereo for large scale urban reconstruction is carefully studied in this thesis. A novel approach for computing depth maps given urban imagery where often large parts of surfaces are weakly textured is presented. Finally, a new post-processing step to enhance the range images in both the both the spatial resolution and depth precision is proposed

University of Kentucky

Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

Author: Iqbal Waseem
Mehltretter Max
Paffenholz Jens-André
Publication venue: [Cham] : Springer International Publishing
Publication date: 01/01/2023
Field of study

Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps

Institutionelles Repositorium der Leibniz Universität Hannover

3d semantic representation of actions from effcient stereo-image-sequence segmentation on GPUs

Author: Aksoy Eren Erdal
Alexey Abramov
Dellen Babette
Dörr Johannes
Pauwels Karl
Wörgötter Florentin
Publication venue
Publication date: 01/01/2010
Field of study

A novel real-time framework for model-free stereo-video segmentation and stereo-segment tracking is presented, combining real-time optical flow and stereo with image segmentation running separately on two GPUs. The stereosegment tracking algorithm achieves a frame rate of 23 Hz for regular videos with a frame size of 256 x 320 pixels and nearly real time for stereo videos. The computed stereo segments are used to construct 3D segment graphs, from which main graphs, representing a relevant change in the scene, are extracted, which allow us to represent a movie of e.g. 396 original frames by only 12 graphs, each containing only a small number of nodes, providing a condensed description of the scene while preserving data-intrinsic semantics. Using this method, human activities, e.g., handling of objects, can be encoded in an efficient way. The method has potential applications for manipulation action recognition and learning, and provides a vision-front end for applications in cognitive robotics.Postprint (published version

UPCommons. Portal del coneixement obert de la UPC

Capture, Reconstruction, and Representation of the Visual Real World for Virtual Reality

Author: A Collet
A Dai
A Davis
A Dosovitskiy
A Meka
A Parra Pozo
A Serrano
B Luo
B Mildenhall
C Keysers
C Kim
C Lipski
C Schroers
C Weissig
D Lanman
E Penner
F Perazzi
F Prada
G Chaurasia
G Chaurasia
G Nam
G Wetzstein
G Wetzstein
GA Koulieris
H Hukkelås
H Ishiguro
H Kim
H Rhodin
HY Shum
J Lee
J Thies
J Ventura
J Zaragoza
JL Schönberger
K Yücer
M Mori
M Nießner
M Tatarchenko
M Zollhöfer
N Snavely
NK Kalantari
P Hedman
P Hedman
P Hedman
P Hedman
P Moulon
R Anderson
R Konrad
R Martin-Brualla
R Szeliski
R Szeliski
RS Overbeck
S Lombardi
S Niklaus
S Peleg
S Tulsiani
SE Wei
SMA Eslami
T Bertel
T Whelan
T Zhou
T Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/03/2020
Field of study

OPUS

Crossref

Fusion of Range and Stereo Data for High-Resolution Scene-Modeling

Author: Evangelidis GD
Hansard M
Horaud R
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/11/2015
Field of study

This work has received funding from Agence Nationale de la Recherche under the MIXCAM project number ANR-13-BS02-0010-01. Georgios Evangelidis is the corresponding author

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Queen Mary Research Online

Computational approach for depth from defocus

Author: Asada
Darrell
Deschenes
Favaro
FitzGerrell
Ghita
Girod
Grossman
Jones
Klarquist
Krotkov
Lee
Ovidiu Ghita
Pentland
Pentland
Petkov
Schechner
Subbarao
Watanabe
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2005
Field of study

Active depth from defocus (DFD) eliminates the main limitation faced by passive DFD, namely its inability to recover depth when dealing with scenes defined by weakly textured (or textureless) objects. This is achieved by projecting a dense illumination pattern onto the scene and depth can be recovered by measuring the local blurring of the projected pattern. Since the illumination pattern forces a strong dominant texture on imaged surfaces, the level of blurring is determined by applying a local operator (tuned on the frequency derived from the illumination pattern) as opposed to the case of window-based passive DFD where a large range of band pass operators are required. The choice of the local operator is a key issue in achieving precise and dense depth estimation. Consequently, in this paper we introduce a new focus operator and we propose refinements to compensate for the problems associated with a suboptimal local operator and a nonoptimized illumination pattern. The developed range sensor has been tested on real images and the results demonstrate that the performance of our range sensor compares well with those achieved by other implementations, where precise and computationally expensive optimization techniques are employed

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service