Search CORE

49,992 research outputs found

Video-based, real-time multi-view stereo

Author: Agarwal
Bhotika
Bradley
Broadhurst
Campbell
Carlos Hernández
Cornelis
Davison
Furukawa
Gargallo
George Vogiatzis
Goesele
Goesele
Habbecke
Hernández
Hernández
Hornung
Jaynes
Kleinand
Kolev
Kutulakos
Lhuillier
Li
Pollefeys
Richard
Seitz
Snavely
Strecha
Strecha
Vogiatzis
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/06/2011
Field of study

We investigate the problem of obtaining a dense reconstruction in real-time, from a live video stream. In recent years, multi-view stereo (MVS) has received considerable attention and a number of methods have been proposed. However, most methods operate under the assumption of a relatively sparse set of still images as input and unlimited computation time. Video based MVS has received less attention despite the fact that video sequences offer significant benefits in terms of usability of MVS systems. In this paper we propose a novel video based MVS algorithm that is suitable for real-time, interactive 3d modeling with a hand-held camera. The key idea is a per-pixel, probabilistic depth estimation scheme that updates posterior depth distributions with every new frame. The current implementation is capable of updating 15 million distributions/s. We evaluate the proposed method against the state-of-the-art real-time MVS method and show improvement in terms of accuracy

CiteSeerX

Real-Time Dense 3D Reconstruction from Monocular Video Data Captured by Low-Cost UAVS

Author: Hermann M.
Ruf B.
Weinmann M.
Publication venue: Copernicus Publications
Publication date: 12/07/2021
Field of study

Real-time 3D reconstruction enables fast dense mapping of the environment which benefits numerous applications, such as navigation or live evaluation of an emergency. In contrast to most real-time capable approaches, our method does not need an explicit depth sensor. Instead, we only rely on a video stream from a camera and its intrinsic calibration. By exploiting the self-motion of the unmanned aerial vehicle (UAV) flying with oblique view around buildings, we estimate both camera trajectory and depth for selected images with enough novel content. To create a 3D model of the scene, we rely on a three-stage processing chain. First, we estimate the rough camera trajectory using a simultaneous localization and mapping (SLAM) algorithm. Once a suitable constellation is found, we estimate depth for local bundles of images using a Multi-View Stereo (MVS) approach and then fuse this depth into a global surfel-based model. For our evaluation, we use 55 video sequences with diverse settings, consisting of both synthetic and real scenes. We evaluate not only the generated reconstruction but also the intermediate products and achieve competitive results both qualitatively and quantitatively. At the same time, our method can keep up with a 30 fps video for a resolution of 768 × 448 pixels

Multi Camera Stereo and Tracking Patient Motion for SPECT Scanning Systems

Author: Nadella Suman
Publication venue: Digital WPI
Publication date: 29/08/2005
Field of study

Patient motion, which causes artifacts in reconstructed images, can be a serious problem in Single Photon Emission Computed Tomography (SPECT) imaging. If patient motion can be detected and quantified, the reconstruction algorithm can compensate for the motion. A real-time multi-threaded Visual Tracking System (VTS) using optical cameras, which will be suitable for deployment in clinical trials, is under development. The VTS tracks patients using multiple video images and image processing techniques, calculating patient motion in three-dimensional space. This research aimed to develop and implement an algorithm for feature matching and stereo location computation using multiple cameras. Feature matching is done based on the epipolar geometry constraints for a pair of images and extended to the multiple view case with an iterative algorithm. Stereo locations of the matches are then computed using sum of squared distances from the projected 3D lines in SPECT coordinates as the error metric. This information from the VTS, when coupled with motion assessment from the emission data itself, can provide a robust compensation for patient motion as part of reconstruction

Towards accessible content creation of real world objects for virtual environments

Author: Rasmuson Sverker
Publication venue
Publication date: 01/01/2019
Field of study

3D reconstruction is the general problem of creating 3D models from real world objects. In today\u27s movie and games industry,there is an increasing demand for using real world content as assets in production. In general, however, 3D reconstruction is achallenging problem, and current techniques only allow for production-ready results given a combination of expensive equipment andspecific expertise.This thesis is a collection of three papers that address various aspects of this general problem of 3D reconstruction,with the aim of lowering the bar for making usable real world content.In Paper I, we address the problem of storing and streaming time varying geometry for e.g.\ free-viewpoint video, whichotherwise has too high bandwidth requirements to be streamed efficiently. We use a memory-efficient structure based on compressedvoxels to store the data, in which we can send only incremental updates to the geometry in each frame.In Paper II, we implement an end-to-end real-time pipeline for free-viewpoint video communication.The pipeline uses a set of ordinary webcams as input and do all processing on a single desktop computer. Even with theselimitations, we show that we can produce free-viewpoint video with agreeable quality in real-time.Paper III addresses the problem of accessible and accurate modeling of static real-world objects.Given a set of calibrated input images, we have developed an interactive tool that makes 3D reconstruction with multi-view stereo moreaccessible. This interactive reconstruction has several advantages over automatic 3D scanning, since we obtain correct topology by designas well as information about visibility and foreground segmentation

Chalmers Research

Multi-Scale 3D Scene Flow from Binocular Stereo Sequences

Author: Li Rui
Sclaroff Stan
Publication venue: Boston University Computer Science Department
Publication date: 01/01/2007
Field of study

Scene ﬂow methods estimate the three-dimensional motion ﬁeld for points in the world, using multi-camera video data. Such methods combine multi-view reconstruction with motion estimation. This paper describes an alternative formulation for dense scene ﬂow estimation that provides reliable results using only two cameras by fusing stereo and optical ﬂow estimation into a single coherent framework. Internally, the proposed algorithm generates probability distributions for optical ﬂow and disparity. Taking into account the uncertainty in the intermediate stages allows for more reliable estimation of the 3D scene ﬂow than previous methods allow. To handle the aperture problems inherent in the estimation of optical ﬂow and disparity, a multi-scale method along with a novel region-based technique is used within a regularized solution. This combined approach both preserves discontinuities and prevents over-regularization – two problems commonly associated with the basic multi-scale approaches. Experiments with synthetic and real test data demonstrate the strength of the proposed approach.National Science Foundation (CNS-0202067, IIS-0208876); Office of Naval Research (N00014-03-1-0108

CiteSeerX

Boston University Institutional Repository (OpenBU)

Online Mutual Foreground Segmentation for Multispectral Stereo Videos

Author: Bergevin Robert
Bilodeau Guillaume-Alexandre
St-Charles Pierre-Luc
Publication venue
Publication date: 21/12/2018
Field of study

The segmentation of video sequences into foreground and background regions is a low-level process commonly used in video content analysis and smart surveillance applications. Using a multispectral camera setup can improve this process by providing more diverse data to help identify objects despite adverse imaging conditions. The registration of several data sources is however not trivial if the appearance of objects produced by each sensor differs substantially. This problem is further complicated when parallax effects cannot be ignored when using close-range stereo pairs. In this work, we present a new method to simultaneously tackle multispectral segmentation and stereo registration. Using an iterative procedure, we estimate the labeling result for one problem using the provisional result of the other. Our approach is based on the alternating minimization of two energy functions that are linked through the use of dynamic priors. We rely on the integration of shape and appearance cues to find proper multispectral correspondences, and to properly segment objects in low contrast regions. We also formulate our model as a frame processing pipeline using higher order terms to improve the temporal coherence of our results. Our method is evaluated under different configurations on multiple multispectral datasets, and our implementation is available online.Comment: Preprint accepted for publication in IJCV (December 2018

arXiv.org e-Print Archive