Search CORE

392,602 research outputs found

Surface Completion Using Laplacian Transform

Author: Kanongchaiyos Pizzanu
Vichitvejpaisal Pongsagon
Publication venue: 'Faculty of Engineering, Chulalongkorn University'
Publication date: 14/01/2014
Field of study

Model acquisition process usually produce incomplete surfaces due to the technical constrains. This research presents the algorithm to perform surface completion using the available surface's context. Previous works on surface completions do not handle surfaces with near-regular pattern or irregular patterns well. The main goal of this research is to synthesize surface for hole that will have similar surface's context or geometric details as the hole's surrounding. This research uses multi-resolution approach to decompose the model into low-frequency part and high-frequency part. The low-frequency part is filled smoothly. The high-frequency part are transformed it into the Laplacian coordinate and filled using example-based synthesize approach. The algorithm is tested with planar surfaces and curve surfaces with all kind of relief patterns. The results indicate that the holes can be completed with the geometric detail similar to the surrounding surface

Engineering Journal (Faculty of Engineering, Chulalongkorn University, Bangkok)

High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference

Author: Han Xiaoguang
Huang Haibin
Kalogerakis Evangelos
Li Zhen
Yu Yizhou
Publication venue
Publication date: 01/01/2017
Field of study

We propose a data-driven method for recovering miss-ing parts of 3D shapes. Our method is based on a new deep learning architecture consisting of two sub-networks: a global structure inference network and a local geometry refinement network. The global structure inference network incorporates a long short-term memorized context fusion module (LSTM-CF) that infers the global structure of the shape based on multi-view depth information provided as part of the input. It also includes a 3D fully convolutional (3DFCN) module that further enriches the global structure representation according to volumetric information in the input. Under the guidance of the global structure network, the local geometry refinement network takes as input lo-cal 3D patches around missing regions, and progressively produces a high-resolution, complete surface through a volumetric encoder-decoder architecture. Our method jointly trains the global structure inference and local geometry refinement networks in an end-to-end manner. We perform qualitative and quantitative evaluations on six object categories, demonstrating that our method outperforms existing state-of-the-art work on shape completion.Comment: 8 pages paper, 11 pages supplementary material, ICCV spotlight pape

arXiv.org e-Print Archive

HKU Scholars Hub

ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans

Author: Bokeloh Martin
Dai Angela
Nießner Matthias
Reed Scott
Ritchie Daniel
Sturm Jürgen
Publication venue
Publication date: 27/03/2018
Field of study

We introduce ScanComplete, a novel data-driven approach for taking an incomplete 3D scan of a scene as input and predicting a complete 3D model along with per-voxel semantic labels. The key contribution of our method is its ability to handle large scenes with varying spatial extent, managing the cubic growth in data size as scene size increases. To this end, we devise a fully-convolutional generative 3D CNN model whose filter kernels are invariant to the overall scene size. The model can be trained on scene subvolumes but deployed on arbitrarily large scenes at test time. In addition, we propose a coarse-to-fine inference strategy in order to produce high-resolution output while also leveraging large input context sizes. In an extensive series of experiments, we carefully evaluate different model design choices, considering both deterministic and probabilistic models for completion and semantic inference. Our results show that we outperform other methods not only in the size of the environments handled and processing efficiency, but also with regard to completion quality and semantic segmentation performance by a significant margin.Comment: Video: https://youtu.be/5s5s8iH0NF

arXiv.org e-Print Archive

Crossref

Occlusion-related lateral connections stabilize kinetic depth stimuli through perceptual coupling

Author: Holten Vivian
Klink P. Christiaan
Noest Andre J.
van den Berg Albert V.
van Wezel Richard J.A.
Publication venue: Association for Research in Vision and Ophthalmology
Publication date: 01/01/2009
Field of study

Local sensory information is often ambiguous forcing the brain to integrate spatiotemporally separated information for stable conscious perception. Lateral connections between clusters of similarly tuned neurons in the visual cortex are a potential neural substrate for the coupling of spatially separated visual information. Ecological optics suggests that perceptual coupling of visual information is particularly beneficial in occlusion situations. Here we present a novel neural network model and a series of human psychophysical experiments that can together explain the perceptual coupling of kinetic depth stimuli with activity-driven lateral information sharing in the far depth plane. Our most striking finding is the perceptual coupling of an ambiguous kinetic depth cylinder with a coaxially presented and disparity defined cylinder backside, while a similar frontside fails to evoke coupling. Altogether, our findings are consistent with the idea that clusters of similarly tuned far depth neurons share spatially separated motion information in order to resolve local perceptual ambiguities. The classification of far depth in the facilitation mechanism results from a combination of absolute and relative depth that suggests a functional role of these lateral connections in the perception of partially occluded objects

University of Twente Research Information

Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis

Author: Dai Angela
Nießner Matthias
Qi Charles Ruizhongtai
Publication venue
Publication date: 11/04/2017
Field of study

We introduce a data-driven approach to complete partial 3D shapes through a combination of volumetric deep neural networks and 3D shape synthesis. From a partially-scanned input shape, our method first infers a low-resolution -- but complete -- output. To this end, we introduce a 3D-Encoder-Predictor Network (3D-EPN) which is composed of 3D convolutional layers. The network is trained to predict and fill in missing data, and operates on an implicit surface representation that encodes both known and unknown space. This allows us to predict global structure in unknown areas at high accuracy. We then correlate these intermediary results with 3D geometry from a shape database at test time. In a final pass, we propose a patch-based 3D shape synthesis method that imposes the 3D geometry from these retrieved shapes as constraints on the coarsely-completed mesh. This synthesis process enables us to reconstruct fine-scale detail and generate high-resolution output while respecting the global mesh structure obtained by the 3D-EPN. Although our 3D-EPN outperforms state-of-the-art completion method, the main contribution in our work lies in the combination of a data-driven shape predictor and analytic 3D shape synthesis. In our results, we show extensive evaluations on a newly-introduced shape completion benchmark for both real-world and synthetic data

arXiv.org e-Print Archive

Crossref

Deep Depth Completion of a Single RGB-D Image

Author: Funkhouser Thomas
Zhang Yinda
Publication venue
Publication date: 01/01/2018
Field of study

The goal of our work is to complete the depth channel of an RGB-D image. Commodity-grade depth cameras often fail to sense depth for shiny, bright, transparent, and distant surfaces. To address this problem, we train a deep network that takes an RGB image as input and predicts dense surface normals and occlusion boundaries. Those predictions are then combined with raw depth observations provided by the RGB-D camera to solve for depths for all pixels, including those missing in the original observation. This method was chosen over others (e.g., inpainting depths directly) as the result of extensive experiments with a new depth completion benchmark dataset, where holes are filled in training data through the rendering of surface reconstructions created from multiview RGB-D scans. Experiments with different network inputs, depth representations, loss functions, optimization methods, inpainting methods, and deep depth estimation networks show that our proposed approach provides better depth completions than these alternatives.Comment: Accepted by CVPR2018 (Spotlight). Project webpage: http://deepcompletion.cs.princeton.edu/ This version includes supplementary materials which provide more implementation details, quantitative evaluation, and qualitative results. Due to file size limit, please check project website for high-res pape

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

Im2Pano3D: Extrapolating 360 Structure and Semantics Beyond the Field of View

Author: Chang Angel X.
Funkhouser Thomas
Savarese Silvio
Savva Manolis
Song Shuran
Zeng Andy
Publication venue
Publication date: 12/12/2017
Field of study

We present Im2Pano3D, a convolutional neural network that generates a dense prediction of 3D structure and a probability distribution of semantic labels for a full 360 panoramic view of an indoor scene when given only a partial observation (<= 50%) in the form of an RGB-D image. To make this possible, Im2Pano3D leverages strong contextual priors learned from large-scale synthetic and real-world indoor scenes. To ease the prediction of 3D structure, we propose to parameterize 3D surfaces with their plane equations and train the model to predict these parameters directly. To provide meaningful training supervision, we use multiple loss functions that consider both pixel level accuracy and global context consistency. Experiments demon- strate that Im2Pano3D is able to predict the semantics and 3D structure of the unobserved scene with more than 56% pixel accuracy and less than 0.52m average distance error, which is significantly better than alternative approaches.Comment: Video summary: https://youtu.be/Au3GmktK-S

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref