Search CORE

8,816 research outputs found

What do we perceive in a glance of a real-world scene?

Author: Iyer Asha
Koch Christof
Li Fei Fei
Perona Pietro
Publication venue: 'Association for Research in Vision and Ophthalmology (ARVO)'
Publication date: 01/01/2007
Field of study

What do we see when we glance at a natural scene and how does it change as the glance becomes longer? We asked naive subjects to report in a free-form format what they saw when looking at briefly presented real-life photographs. Our subjects received no specific information as to the content of each stimulus. Thus, our paradigm differs from previous studies where subjects were cued before a picture was presented and/or were probed with multiple-choice questions. In the first stage, 90 novel grayscale photographs were foveally shown to a group of 22 native-English-speaking subjects. The presentation time was chosen at random from a set of seven possible times (from 27 to 500 ms). A perceptual mask followed each photograph immediately. After each presentation, subjects reported what they had just seen as completely and truthfully as possible. In the second stage, another group of naive individuals was instructed to score each of the descriptions produced by the subjects in the first stage. Individual scores were assigned to more than a hundred different attributes. We show that within a single glance, much object- and scene-level information is perceived by human subjects. The richness of our perception, though, seems asymmetrical. Subjects tend to have a propensity toward perceiving natural scenes as being outdoor rather than indoor. The reporting of sensory- or feature-level information of a scene (such as shading and shape) consistently precedes the reporting of the semantic-level information. But once subjects recognize more semantic-level components of a scene, there is little evidence suggesting any bias toward either scene-level or object-level recognition

CiteSeerX

Caltech Authors

Observed methods of cuneiform tablet reconstruction in virtual and real world environments

Author: Anderson
Andrew Lewis
Bertaux
Brown
Budge
Ch'ng
Cohen
Demaine
Diamond
Erlend Gehlken
Eugene Ch'ng
Fort
Gilboa
Guest
Gunz
Güth
Hahn
Hameeuw
Hochberg
Iwamoto
Karasik
Keehner
Kim
Kleber
Knapp
Kuzminsky
Laugerotte
Lewis
Lu
Marshall
Mason
Montani
Nielsen
Papaioannou
Poupyrev
Ross
Sandra Woolley
Schild
Schmettow
Vora
Willems
Wong
Woolley
Woolley
Yu
Zambanini
Zambanini
Publication venue: 'Elsevier BV'
Publication date: 09/10/2014
Field of study

The reconstruction of fragmented artefacts is a tedious process that consumes many valuable work hours of scholars' time. We believe that such work can be made more efficient via new techniques in interactive virtual environments. The purpose of this research is to explore approaches to the reconstruction of cuneiform tablets in the real and virtual environment, and to address the potential barriers to virtual reconstruction of fragments. In this paper we present the results of an experiment exploring the reconstruction strategies employed by individual users working with tablet fragments in real and virtual environments. Our findings have identified physical factors that users find important to the reconstruction process and further explored the subjective usefulness of stereoscopic 3D in the reconstruction process. Our results, presented as dynamic graphs of interaction, compare the precise order of movement and rotation interactions, and the frequency of interaction achieved by successful and unsuccessful participants with some surprising insights. We present evidence that certain interaction styles and behaviours characterise success in the reconstruction process

Nottingham ePrints

Nottingham eTheses

Keele Research Repository

Crossref

Repository@Nottingham

University of Birmingham Research Portal

Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis

Author: Dai Angela
Nießner Matthias
Qi Charles Ruizhongtai
Publication venue
Publication date: 11/04/2017
Field of study

We introduce a data-driven approach to complete partial 3D shapes through a combination of volumetric deep neural networks and 3D shape synthesis. From a partially-scanned input shape, our method first infers a low-resolution -- but complete -- output. To this end, we introduce a 3D-Encoder-Predictor Network (3D-EPN) which is composed of 3D convolutional layers. The network is trained to predict and fill in missing data, and operates on an implicit surface representation that encodes both known and unknown space. This allows us to predict global structure in unknown areas at high accuracy. We then correlate these intermediary results with 3D geometry from a shape database at test time. In a final pass, we propose a patch-based 3D shape synthesis method that imposes the 3D geometry from these retrieved shapes as constraints on the coarsely-completed mesh. This synthesis process enables us to reconstruct fine-scale detail and generate high-resolution output while respecting the global mesh structure obtained by the 3D-EPN. Although our 3D-EPN outperforms state-of-the-art completion method, the main contribution in our work lies in the combination of a data-driven shape predictor and analytic 3D shape synthesis. In our results, we show extensive evaluations on a newly-introduced shape completion benchmark for both real-world and synthetic data

arXiv.org e-Print Archive

Crossref

Cumulative object categorization in clutter

Author: Balint-Benczedi Ferenc
Beetz Michael
Martinez Mozos Oscar
Marton Zoltan-Csaba
Pangercic Dejan
Publication venue: ACIN: Automation and Control Institute, University of Technology, Vienna, Austria)
Publication date: 27/06/2013
Field of study

In this paper we present an approach based on scene- or part-graphs for geometrically categorizing touching and occluded objects. We use additive RGBD feature descriptors and hashing of graph conﬁguration parameters for describing the spatial arrangement of constituent parts. The presented experiments quantify that this method outperforms our earlier part-voting and sliding window classiﬁcation. We evaluated our approach on cluttered scenes, and by using a 3D dataset containing over 15000 Kinect scans of over 100 objects which were grouped into general geometric categories. Additionally, color, geometric, and combined features were compared for categorization tasks

University of Lincoln Institutional Repository

Institute of Transport Research:Publications

Web App for Tools Inventory Management with Predictive Categorization

Author: Razali Muhamad Hamzah
Publication venue
Publication date: 01/01/2022
Field of study

The title of this project ‘Web App for Tools Inventory Management with Predictive Categorization' is proposed by Mr. Muhamad Hamzah bin Razali. The main purpose of this project is to develop a web application namely ‘Drillclinic’ that can digitalize inventory management process for tools management, by having predictive tool categorization and assigning Data Matrix code to each real-world Tool object

UTPedia

Shortest-path algorithms on road networks and real-world applications

Author: Efentakis Alexandros
Εφεντάκης Αλέξανδρος
Publication venue
Publication date: 05/01/2015
Field of study

DSpace at NTUA

Matterport3D: Learning from RGB-D Data in Indoor Environments

Author: Chang Angel
Dai Angela
Funkhouser Thomas
Halber Maciej
Nießner Matthias
Savva Manolis
Song Shuran
Zeng Andy
Zhang Yinda
Publication venue
Publication date: 01/01/2017
Field of study

Access to large, diverse RGB-D datasets is critical for training RGB-D scene understanding algorithms. However, existing datasets still cover only a limited number of views or a restricted scale of spaces. In this paper, we introduce Matterport3D, a large-scale RGB-D dataset containing 10,800 panoramic views from 194,400 RGB-D images of 90 building-scale scenes. Annotations are provided with surface reconstructions, camera poses, and 2D and 3D semantic segmentations. The precise global alignment and comprehensive, diverse panoramic set of views over entire buildings enable a variety of supervised and self-supervised computer vision tasks, including keypoint matching, view overlap prediction, normal prediction from color, semantic segmentation, and region classification

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref