Search CORE

3,371 research outputs found

3D Shape Estimation from 2D Landmarks: A Convex Relaxation Approach

Author: Daniilidis Kostas
Hu Xiaoyan
Leonardos Spyridon
Zhou Xiaowei
Publication venue
Publication date: 01/06/2015
Field of study

We investigate the problem of estimating the 3D shape of an object, given a set of 2D landmarks in a single image. To alleviate the reconstruction ambiguity, a widely-used approach is to confine the unknown 3D shape within a shape space built upon existing shapes. While this approach has proven to be successful in various applications, a challenging issue remains, i.e., the joint estimation of shape parameters and camera-pose parameters requires to solve a nonconvex optimization problem. The existing methods often adopt an alternating minimization scheme to locally update the parameters, and consequently the solution is sensitive to initialization. In this paper, we propose a convex formulation to address this problem and develop an efficient algorithm to solve the proposed convex program. We demonstrate the exact recovery property of the proposed method, its merits compared to alternative methods, and the applicability in human pose and car shape estimation.Comment: In Proceedings of CVPR 201

arXiv.org e-Print Archive

Crossref

AMRA: Augmented Reality Assistance for Train Maintenance Tasks

Author: Bourgeois Steve
Didier Jean-Yves
Hocquard Arnaud
Leroux Christophe
Mallem Malik
Mégard Christine
Naudet Sylvie
Otmane Samir
Pham Quoc-Cuong
Roussel David
Publication venue: HAL CCSD
Publication date: 05/10/2005
Field of study

International audienceThe AMRA project, carried out by a consortium including industrials and research partners, aims at implementing an Augmented Reality (AR) system for mobile use in industrial applications such as train maintenance and repairs in industrial sites. The adopted solution is a video see-through system where a tablet-PC is used as an augmented window. The overall architecture of a prototype is unfolded, and its key points are detailed. For instance, a visual registration system has been developed to accurately overlay a video stream with information. A robust, real time registration, using a single camera tied to the tablet-PC, is performed. Besides, a hierarchical description of maintenance procedure is set up and enriched by new media such as photos, video and/or 3D models. These 3D models have been specially tailored to meet maintenance tasks requirements. The obtained multimedia contents allow easy access to technical documentation through a man machine interface managing a multimedia engine. All these features have been combined in the AMRA prototype which have been evaluated by a maintenance operator

HAL Evry

HAL-CEA

Real-Time Seamless Single Shot 6D Object Pose Prediction

Author: Fua Pascal
Sinha Sudipta N.
Tekin Bugra
Publication venue
Publication date: 14/03/2018
Field of study

We propose a single-shot approach for simultaneously detecting an object in an RGB image and predicting its 6D pose without requiring multiple stages or having to examine multiple hypotheses. Unlike a recently proposed single-shot technique for this task (Kehl et al., ICCV'17) that only predicts an approximate 6D pose that must then be refined, ours is accurate enough not to require additional post-processing. As a result, it is much faster - 50 fps on a Titan X (Pascal) GPU - and more suitable for real-time processing. The key component of our method is a new CNN architecture inspired by the YOLO network design that directly predicts the 2D image locations of the projected vertices of the object's 3D bounding box. The object's 6D pose is then estimated using a PnP algorithm. For single object and multiple object pose estimation on the LINEMOD and OCCLUSION datasets, our approach substantially outperforms other recent CNN-based approaches when they are all used without post-processing. During post-processing, a pose refinement step can be used to boost the accuracy of the existing methods, but at 10 fps or less, they are much slower than our method.Comment: CVPR 201

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Augmented reality meeting table: a novel multi-user interface for architectural design

Author: A. A. Argyros
A. Colquhoun
A. Penn
A. Penn
A. Turner
B. Hillier
B. Hillier
B. Hillier
B. Hillier
B. Hillier
B. Lawson
B. Wanstall
C. Jones
C. Jones
D.A. Schoen
D.A. Schoen
G. Stiny
H. Rittel
H. Simon
H. Simon
H.T. Regenbrecht
J.F.H. Fuchs
L. Archer
N. Navab
N.J. Habraken
P. Checkland
P. Checkland
R. Venturi
R.T. Azuma
R.T. Azuma
S. Feiner
S. Weghorst
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Immersive virtual environments have received widespread attention as providing possible replacements for the media and systems that designers traditionally use, as well as, more generally, in providing support for collaborative work. Relatively little attention has been given to date however to the problem of how to merge immersive virtual environments into real world work settings, and so to add to the media at the disposal of the designer and the design team, rather than to replace it. In this paper we report on a research project in which optical see-through augmented reality displays have been developed together with prototype decision support software for architectural and urban design. We suggest that a critical characteristic of multi user augmented reality is its ability to generate visualisations from a first person perspective in which the scale of rendition of the design model follows many of the conventions that designers are used to. Different scales of model appear to allow designers to focus on different aspects of the design under consideration. Augmenting the scene with simulations of pedestrian movement appears to assist both in scale recognition, and in moving from a first person to a third person understanding of the design. This research project is funded by the European Commission IST program (IST-2000-28559)