Search CORE

632 research outputs found

Cascaded Scene Flow Prediction using Semantic Segmentation

Author: Kautz Jan
Ren Zhile
Sudderth Erik B.
Sun Deqing
Publication venue
Publication date: 05/10/2017
Field of study

Given two consecutive frames from a pair of stereo cameras, 3D scene flow methods simultaneously estimate the 3D geometry and motion of the observed scene. Many existing approaches use superpixels for regularization, but may predict inconsistent shapes and motions inside rigidly moving objects. We instead assume that scenes consist of foreground objects rigidly moving in front of a static background, and use semantic cues to produce pixel-accurate scene flow estimates. Our cascaded classification framework accurately models 3D scenes by iteratively refining semantic segmentation masks, stereo correspondences, 3D rigid motion estimates, and optical flow fields. We evaluate our method on the challenging KITTI autonomous driving benchmark, and show that accounting for the motion of segmented vehicles leads to state-of-the-art performance.Comment: International Conference on 3D Vision (3DV), 2017 (oral presentation

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

3D Dynamic Scene Reconstruction from Multi-View Image Sequences

Author: Leung Carlos
Publication venue
Publication date: 01/03/2003
Field of study

A confirmation report outlining my PhD research plan is presented. The PhD research topic is 3D dynamic scene reconstruction from multiple view image sequences. Chapter 1 describes the motivation and research aims. An overview of the progress in the past year is included. Chapter 2 is a review of volumetric scene reconstruction techniques and Chapter 3 is an in-depth description of my proposed reconstruction method. The theory behind the proposed volumetric scene reconstruction method is also presented, including topics in projective geometry, camera calibration and energy minimization. Chapter 4 presents the research plan and outlines the future work planned for the next two years

University of Queensland eSpace

Multilinear Wavelets: A Statistical Shape Space for Human Faces

Author: A. Salazar
A. Savran
C. Creusot
C. Davatzikos
D. Liu
D. Nain
D. Vlasic
F. Lecron
F.B. Haar ter
G. Tam
I. Kakadiaris
I. Mpiperis
M. Bertram
M. Smet De
M.A.O. Vasilescu
W. Sweldens
Publication venue
Publication date: 01/01/2014
Field of study

We present a statistical model for

3

D human faces in varying expression, which decomposes the surface of the face using a wavelet transform, and learns many localized, decorrelated multilinear models on the resulting coefficients. Using this model we are able to reconstruct faces from noisy and occluded

3

D face scans, and facial motion sequences. Accurate reconstruction of face shape is important for applications such as tele-presence and gaming. The localized and multi-scale nature of our model allows for recovery of fine-scale detail while retaining robustness to severe noise and occlusion, and is computationally efficient and scalable. We validate these properties experimentally on challenging data in the form of static scans and motion sequences. We show that in comparison to a global multilinear model, our model better preserves fine detail and is computationally faster, while in comparison to a localized PCA model, our model better handles variation in expression, is faster, and allows us to fix identity parameters for a given subject.Comment: 10 pages, 7 figures; accepted to ECCV 201

arXiv.org e-Print Archive

Crossref

Fraunhofer-ePrints

From light rays to 3D models

Author: Donné Simon
Publication venue: Faculty of Engineering and Architecture Ghent University
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

Augmented reality based real-time subcutaneous vein imaging system

Author: Ai Danni
Fan Jingfan
Shao Ling
Shen Jianbing
Song Xianzheng
Wang Yongtian
Yang Jian
Zhao Yitian
Publication venue: 'The Optical Society'
Publication date: 13/06/2016
Field of study

A novel 3D reconstruction and fast imaging system for subcutaneous veins by augmented reality is presented. The study was performed to reduce the failure rate and time required in intravenous injection by providing augmented vein structures that back-project superimposed veins on the skin surface of the hand. Images of the subcutaneous vein are captured by two industrial cameras with extra reflective near-infrared lights. The veins are then segmented by a multiple-feature clustering method. Vein structures captured by the two cameras are matched and reconstructed based on the epipolar constraint and homographic property. The skin surface is reconstructed by active structured light with spatial encoding values and fusion displayed with the reconstructed vein. The vein and skin surface are both reconstructed in the 3D space. Results show that the structures can be precisely back-projected to the back of the hand for further augmented display and visualization. The overall system performance is evaluated in terms of vein segmentation, accuracy of vein matching, feature points distance error, duration times, accuracy of skin reconstruction, and augmented display. All experiments are validated with sets of real vein data. The imaging and augmented system produces good imaging and augmented reality results with high speed

Northumbria Research Link

PubMed Central

Realtime Dynamic 3D Facial Reconstruction for Monocular Video In-the-Wild

Author: Liu Shuang
Wang Z.
Yang Xiaosong
Zhang Jian J.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/01/2018
Field of study

With the increasing amount of videos recorded using 2D mobile cameras, the technique for recovering the 3D dynamic facial models from these monocular videos has become a necessity for many image and video editing applications. While methods based parametric 3D facial models can reconstruct the 3D shape in dynamic environment, large structural changes are ignored. Structure-from-motion methods can reconstruct these changes but assume the object to be static. To address this problem we present a novel method for realtime dynamic 3D facial tracking and reconstruction from videos captured in uncontrolled environments. Our method can track the deforming facial geometry and reconstruct external objects that protrude from the face such as glasses and hair. It also allows users to move around, perform facial expressions freely without degrading the reconstruction quality

Bournemouth University Research Online

Visualization and Correction of Automated Segmentation, Tracking and Lineaging from 5-D Stem Cell Image Sequences

Author: Bjornsson Chris
Cohen Andrew
Goderie Susan
Kokovay Erzsebet
Temple Sally
Wait Eric
Wang Yue
Winter Mark
Publication venue
Publication date: 01/01/2014
Field of study

Results: We present an application that enables the quantitative analysis of multichannel 5-D (x, y, z, t, channel) and large montage confocal fluorescence microscopy images. The image sequences show stem cells together with blood vessels, enabling quantification of the dynamic behaviors of stem cells in relation to their vascular niche, with applications in developmental and cancer biology. Our application automatically segments, tracks, and lineages the image sequence data and then allows the user to view and edit the results of automated algorithms in a stereoscopic 3-D window while simultaneously viewing the stem cell lineage tree in a 2-D window. Using the GPU to store and render the image sequence data enables a hybrid computational approach. An inference-based approach utilizing user-provided edits to automatically correct related mistakes executes interactively on the system CPU while the GPU handles 3-D visualization tasks. Conclusions: By exploiting commodity computer gaming hardware, we have developed an application that can be run in the laboratory to facilitate rapid iteration through biological experiments. There is a pressing need for visualization and analysis tools for 5-D live cell image data. We combine accurate unsupervised processes with an intuitive visualization of the results. Our validation interface allows for each data set to be corrected to 100% accuracy, ensuring that downstream data analysis is accurate and verifiable. Our tool is the first to combine all of these aspects, leveraging the synergies obtained by utilizing validation information from stereo visualization to improve the low level image processing tasks.Comment: BioVis 2014 conferenc

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

Točna 3D rekonstrukcija zasnovana na rotirajućoj platformi i telecentričnoj viziji

Author: Hugo Jiménez-Hernández
José-Guadalupe Rico-Espino
José-Joel Gonzalez-Barbosa
Miguel Razo
Ricardo Gonzalez-Barbosa
Roberto-Augusto Gómez-Loenzo
Publication venue: 'KOREMA'
Publication date: 01/01/2015
Field of study

This paper presents a camera+telecentric lens that is able to obtain 3D information. We designed and implemented a method which can register and integrate 3D information captured from different viewpoints to build a complete 3D object model. First, a geometric model of a camera+telecentric lens is established. Then a calibration process using a planar checkerboard is developed and implemented. The object is placed on a rotation stage in front of a stationary camera. Normally the rotation axis is considered to be aligned with camera frame. In the description presented in this paper, the rotation matrix and translation vector of the rotation axis are calibrated. At the same time, a three-dimensional reconstruction system based on contour extraction of objects with dimensions less than 50 mm in diameter is developed. Finally, an analysis of the uncertainty model parameters and performance reconstruction of 3D objects are discussed.Članak prestavlja sustav koji se sastoji od kamere i telecentrične leće koji omogućavaju dobivanje 3D informacije o objektu. Dizajnirana je i implementirana metoda koja može registrirati i integrirati 3D informacije iz različitih točaka gledišta, kako bi se izgradio potpuni 3D model. Na početku, uspostavlja se geometrijski model kamere i telecentrične leće. Nakon toga koristi se razvijena metoda kalibracije zasnovana na šahovskoj ploči te se objekt postavlja na rotirajuću platformu ispred stacionarne kamere. Također, pretpostavlja se da je os rotacije poravnta s koordinantim sustavom kamere. U ovome članku kalibriraju se rotacijska matrica i translacijski vektor rotacijske osi. Razvijen je i sustav 3D rekonstrukcija zasnovan na izlučivanju kontura objekta dimenzija manjih od 50 mm u promjeru. Na kraju, provedena je i analiza nesigurnosti parametara modela kao i točnost rekonstrukcije 3D modela

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Recommended from our members

Camera positioning for 3D panoramic image rendering

Author: Audu Abdulkadir Iyyaka
Publication venue: Brunel University London
Publication date: 01/01/2015
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University London.Virtual camera realisation and the proposition of trapezoidal camera architecture are the two broad contributions of this thesis. Firstly, multiple camera and their arrangement constitute a critical component which affect the integrity of visual content acquisition for multi-view video. Currently, linear, convergence, and divergence arrays are the prominent camera topologies adopted. However, the large number of cameras required and their synchronisation are two of prominent challenges usually encountered. The use of virtual cameras can significantly reduce the number of physical cameras used with respect to any of the known camera structures, hence adequately reducing some of the other implementation issues. This thesis explores to use image-based rendering with and without geometry in the implementations leading to the realisation of virtual cameras. The virtual camera implementation was carried out from the perspective of depth map (geometry) and use of multiple image samples (no geometry). Prior to the virtual camera realisation, the generation of depth map was investigated using region match measures widely known for solving image point correspondence problem. The constructed depth maps have been compare with the ones generated using the dynamic programming approach. In both the geometry and no geometry approaches, the virtual cameras lead to the rendering of views from a textured depth map, construction of 3D panoramic image of a scene by stitching multiple image samples and performing superposition on them, and computation of virtual scene from a stereo pair of panoramic images. The quality of these rendered images were assessed through the use of either objective or subjective analysis in Imatest software. Further more, metric reconstruction of a scene was performed by re-projection of the pixel points from multiple image samples with a single centre of projection. This was done using sparse bundle adjustment algorithm. The statistical summary obtained after the application of this algorithm provides a gauge for the efficiency of the optimisation step. The optimised data was then visualised in Meshlab software environment, hence providing the reconstructed scene. Secondly, with any of the well-established camera arrangements, all cameras are usually constrained to the same horizontal plane. Therefore, occlusion becomes an extremely challenging problem, and a robust camera set-up is required in order to resolve strongly the hidden part of any scene objects. To adequately meet the visibility condition for scene objects and given that occlusion of the same scene objects can occur, a multi-plane camera structure is highly desirable. Therefore, this thesis also explore trapezoidal camera structure for image acquisition. The approach here is to assess the feasibility and potential of several physical cameras of the same model being sparsely arranged on the edge of an efficient trapezoid graph. This is implemented both Matlab and Maya. The quality of the depth maps rendered in Matlab are better in Quality

Brunel University Research Archive