Search CORE

16 research outputs found

Disparity Map Algorithm Based on Edge Preserving Filter for Stereo Video Processing

Author: Abd Ghani S. F.
Hamid M. S.
Hamzah R. A.
Kadmin A. F.
Salam S.
Publication venue: Journal of Telecommunication, Electronic and Computer Engineering (JTEC)
Publication date: 12/02/2018
Field of study

This paper proposes a new local-based stereo matching algorithm for stereo video processing. Fundamentally, the Sum of Absolute Differences (SAD) algorithm produces an accurate results on the stereo video processing for the textured regions. However, this algorithm sensitives to low texture and radiometric distortions (i.e., contrast or brightness). To overcome these problems, the proposed algorithm utilizes edgepreserving filter which is known as Bilateral Filter (BF). The BF algorithm reduces noise and sharpen the images. Additionally, BF works fine on the low or plain texture areas. The proposed algorithm produces an accurate results and performs much better compared to some established algorithms on the standard benchmarking results of the Middlebury and KITTI dataset

Universiti Teknikal Malaysia Melaka: UTeM Open Journal System

ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems

Author: A Bhandari
A Foi
A Hosni
D Scharstein
F Besse
H Hirschmuller
H Zhao
J Kowalczuk
J Xie
J Zbontar
KJ Yoon
Mingsong Dou
PF Felzenszwalb
R Garg
R Szeliski
RA Hamzah
SR Fanello
SR Fanello
SR Fanello
Publication venue
Publication date: 01/01/2018
Field of study

In this paper we present ActiveStereoNet, the first deep learning solution for active stereo systems. Due to the lack of ground truth, our method is fully self-supervised, yet it produces precise depth with a subpixel precision of

1/30th

of a pixel; it does not suffer from the common over-smoothing issues; it preserves the edges; and it explicitly handles occlusions. We introduce a novel reconstruction loss that is more robust to noise and texture-less patches, and is invariant to illumination changes. The proposed loss is optimized using a window-based cost aggregation with an adaptive support weight scheme. This cost aggregation is edge-preserving and smooths the loss function, which is key to allow the network to reach compelling results. Finally we show how the task of predicting invalid regions, such as occlusions, can be trained end-to-end without ground-truth. This component is crucial to reduce blur and particularly improves predictions along depth discontinuities. Extensive quantitatively and qualitatively evaluations on real and synthetic data demonstrate state of the art results in many challenging scenes.Comment: Accepted by ECCV2018, Oral Presentation, Main paper + Supplementary Material

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

Depth from Monocular Images using a Semi-Parallel Deep Neural Network (SPDNN) Hybrid Architecture

Author: Bazrafkan S.
Corcoran P.
Javidnia H.
Lemley J.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 18/04/2018
Field of study

Deep neural networks are applied to a wide range of problems in recent years. In this work, Convolutional Neural Network (CNN) is applied to the problem of determining the depth from a single camera image (monocular depth). Eight different networks are designed to perform depth estimation, each of them suitable for a feature level. Networks with different pooling sizes determine different feature levels. After designing a set of networks, these models may be combined into a single network topology using graph optimization techniques. This "Semi Parallel Deep Neural Network (SPDNN)" eliminates duplicated common network layers, and can be further optimized by retraining to achieve an improved model compared to the individual topologies. In this study, four SPDNN models are trained and have been evaluated at 2 stages on the KITTI dataset. The ground truth images in the first part of the experiment are provided by the benchmark, and for the second part, the ground truth images are the depth map results from applying a state-of-the-art stereo matching method. The results of this evaluation demonstrate that using post-processing techniques to refine the target of the network increases the accuracy of depth estimation on individual mono images. The second evaluation shows that using segmentation data alongside the original data as the input can improve the depth estimation results to a point where performance is comparable with stereo depth estimation. The computational time is also discussed in this study.Comment: 44 pages, 25 figure

arXiv.org e-Print Archive

Irish Universities

Access to Research at National University of Ireland, Galway

STEREO MATCHING ALGORITHM BASED ON ILLUMINATION CONTROL TO IMPROVE THE ACCURACY

Author
Publication venue: 'Slovenian Society for Stereology and Quantitative Image Analysis'
Publication date
Field of study

Crossref

Real-time Stereo Matching on CUDA using an Iterative Refinement Method for Adaptive Support-Weight Correspondences

Author: Kowalczuk Jedrzej
Psota Eric T.
Pérez Lance C.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2012
Field of study

High-quality real-time stereo matching has the potential to enable various computer vision applications including semi-automated robotic surgery, tele-immersion, and three-dimensional video surveillance. A novel real-time stereo matching method is presented that uses a two-pass approximation of adaptive support-weight aggregation, and a low-complexity iterative disparity refinement technique. Through an evaluation of computationally efficient approaches to adaptive support weight cost aggregation, it is shown that the two-pass method produces an accurate approximation of the support weights while greatly reducing the complexity of aggregation. The refinement technique, constructed using a probabilistic framework, incorporates an additive term into matching cost minimization and facilitates iterative processing to improve the accuracy of the disparity map. This method has been implemented on massively parallel high-performance graphics hardware using the CUDA computing engine. Results show that the proposed method is the most accurate among all of the real-time stereo matching methods listed on the Middlebury stereo benchmark

DigitalCommons@University of Nebraska

Stereo Matching Algorithm Based On Illumination Control To Improve The Accuracy

Author: Abu Hassan Anwar Hasni
Hamzah Rostam Affendi
Ibrahim Haidi
Publication venue: 'Slovenian Society for Stereology and Quantitative Image Analysis'
Publication date: 01/01/2016
Field of study

Directory of Open Access Journals

Universiti Teknikal Malaysia Melaka (UTeM) Repository

Integration of Multisensorial Stimuli and Multimodal Interaction in a Hybrid 3DTV System

Author: Cisneros Perez Guillermo
Feijoo Gonzalez Claudio Antonio
Galloso Guitard Iris
Luque Oostrom Francisco Pedro
Martín Edo Carlos Alberto
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

This article proposes the integration of multisensorial stimuli and multimodal interaction components into a sports multimedia asset under two dimensions: immersion and interaction. The first dimension comprises a binaural audio system and a set of sensory effects synchronized with the audiovisual content, whereas the second explores interaction through the insertion of interactive 3D objects into the main screen and on-demand presentation of additional information in a second touchscreen. We present an end-to-end solution integrating these components into a hybrid (internet-broadcast) television system using current 3DTV standards. Results from an experimental study analyzing the perceived quality of these stimuli and their influence on the Quality of Experience are presented

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM