Search CORE

12,165 research outputs found

An Immersive Telepresence System using RGB-D Sensors and Head Mounted Display

Author: Lu Xinzhong
Perugini Saverio
Shen Ju
Yang Jianjun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/11/2015
Field of study

We present a tele-immersive system that enables people to interact with each other in a virtual world using body gestures in addition to verbal communication. Beyond the obvious applications, including general online conversations and gaming, we hypothesize that our proposed system would be particularly beneficial to education by offering rich visual contents and interactivity. One distinct feature is the integration of egocentric pose recognition that allows participants to use their gestures to demonstrate and manipulate virtual objects simultaneously. This functionality enables the instructor to ef- fectively and efficiently explain and illustrate complex concepts or sophisticated problems in an intuitive manner. The highly interactive and flexible environment can capture and sustain more student attention than the traditional classroom setting and, thus, delivers a compelling experience to the students. Our main focus here is to investigate possible solutions for the system design and implementation and devise strategies for fast, efficient computation suitable for visual data processing and network transmission. We describe the technique and experiments in details and provide quantitative performance results, demonstrating our system can be run comfortably and reliably for different application scenarios. Our preliminary results are promising and demonstrate the potential for more compelling directions in cyberlearning.Comment: IEEE International Symposium on Multimedia 201

arXiv.org e-Print Archive

University of Dayton

Optical techniques for 3D surface reconstruction in computer-assisted laparoscopic surgery

Author: A. Bartoli
A. Groch
A. Kolb
Ali
Audette
Bachta
Bailey
Barnard
Baumhauer
Benincasa
Besl
Blake
Bogatyrenko
Bronstein
Brown
Burschka
Böhme
Cash
Cash
Chen
Chen
Chen
Chen
Clancy
Clancy
Clatz
Cleary
Clements
Criminisi
Cryer
D. Elson
D. Stoyanov
Dumpuri
Durrant-Whyte
Elhawary
Falk
Faugeras
Fayad
Feuerstein
Fichtinger
Foix
Fuchs
Galvez-Lopez
Giannarou
Ginhoux
Glocker
Gorthi
Gudmundsson
H. Elhawary
Haneishi
Hartley
Hayashibe
Horn
Hu
Huhle
Huhle
Ieiri
Iftimia
J. Sorger
Jannin
Jannin
Jerabkova
Jin
Kolmogorov
Konishi
Kowalczuk
L. Maier-Hein
Lindner
Lindner
Lipman
M. Rodrigues
Maier-Hein
Marchesseau
Marescaux
Markelj
Marr
Marr
Marvik
Megali
Mersmann
Mezger
Miller
Mirota
Mountney
Mutter
Nalpantidis
Nicolau
Nozaki
Okatani
Ortmaier
P. Mountney
Pavlidis
Perriollat
Pilet
Pizarro
Placht
Pluim
Pratt
Rauth
Richa
Robinson
Röhl
S. Speidel
Salvi
Salzmann
Sauvee
Schaller
Scharstein
Schmalz
Shekhar
Simpfendorfer
Simpson
Soper
Stoyanov
Su
Szpala
Taffinder
Thrun
Thrun
Totz
Ukimura
Ullman
van Kaick
Vigneron
Warren
Wentz
Wittek
Wittek
Wolf
Wu
Wu
Wu
Wöhler
Yip
Yoon
Zhang
Zhang
Zhu
Publication venue: 'Elsevier BV'
Publication date: 03/05/2013
Field of study

One of the main challenges for computer-assisted surgery (CAS) is to determine the intra-opera- tive morphology and motion of soft-tissues. This information is prerequisite to the registration of multi-modal patient-specific data for enhancing the surgeon’s navigation capabilites by observ- ing beyond exposed tissue surfaces and for providing intelligent control of robotic-assisted in- struments. In minimally invasive surgery (MIS), optical techniques are an increasingly attractive approach for in vivo 3D reconstruction of the soft-tissue surface geometry. This paper reviews the state-of-the-art methods for optical intra-operative 3D reconstruction in laparoscopic surgery and discusses the technical challenges and future perspectives towards clinical translation. With the recent paradigm shift of surgical practice towards MIS and new developments in 3D opti- cal imaging, this is a timely discussion about technologies that could facilitate complex CAS procedures in dynamic and deformable anatomical regions

Crossref

Sheffield Hallam University Research Archive

UCL Discovery

Spiral - Imperial College Digital Repository

Recommended from our members

Binocular Eye Movements Are Adapted to the Natural Environment.

Author: Banks Martin S
Gibaldi Agostino
Publication venue: eScholarship, University of California
Publication date: 01/04/2019
Field of study

Humans and many animals make frequent saccades requiring coordinated movements of the eyes. When landing on the new fixation point, the eyes must converge accurately or double images will be perceived. We asked whether the visual system uses statistical regularities in the natural environment to aid eye alignment at the end of saccades. We measured the distribution of naturally occurring disparities in different parts of the visual field. The central tendency of the distributions was crossed (nearer than fixation) in the lower field and uncrossed (farther) in the upper field in male and female participants. It was uncrossed in the left and right fields. We also measured horizontal vergence after completion of vertical, horizontal, and oblique saccades. When the eyes first landed near the eccentric target, vergence was quite consistent with the natural-disparity distribution. For example, when making an upward saccade, the eyes diverged to be aligned with the most probable uncrossed disparity in that part of the visual field. Likewise, when making a downward saccade, the eyes converged to enable alignment with crossed disparity in that part of the field. Our results show that rapid binocular eye movements are adapted to the statistics of the 3D environment, minimizing the need for large corrective vergence movements at the end of saccades. The results are relevant to the debate about whether eye movements are derived from separate saccadic and vergence neural commands that control both eyes or from separate monocular commands that control the eyes independently.SIGNIFICANCE STATEMENT We show that the human visual system incorporates statistical regularities in the visual environment to enable efficient binocular eye movements. We define the oculomotor horopter: the surface of 3D positions to which the eyes initially move when stimulated by eccentric targets. The observed movements maximize the probability of accurate fixation as the eyes move from one position to another. This is the first study to show quantitatively that binocular eye movements conform to 3D scene statistics, thereby enabling efficient processing. The results provide greater insight into the neural mechanisms underlying the planning and execution of saccadic eye movements

eScholarship - University of California

Sketching space

Author: Chapman D
Penn A
Turner A
Publication venue
Publication date: 01/01/2000
Field of study

In this paper, we present a sketch modelling system which we call Stilton. The program resembles a desktop VRML browser, allowing a user to navigate a three-dimensional model in a perspective projection, or panoramic photographs, which the program maps onto the scene as a `floor' and `walls'. We place an imaginary two-dimensional drawing plane in front of the user, and any geometric information that user sketches onto this plane may be reconstructed to form solid objects through an optimization process. We show how the system can be used to reconstruct geometry from panoramic images, or to add new objects to an existing model. While panoramic imaging can greatly assist with some aspects of site familiarization and qualitative assessment of a site, without the addition of some foreground geometry they offer only limited utility in a design context. Therefore, we suggest that the system may be of use in `just-in-time' CAD recovery of complex environments, such as shop floors, or construction sites, by recovering objects through sketched overlays, where other methods such as automatic line-retrieval may be impossible. The result of using the system in this manner is the `sketching of space' - sketching out a volume around the user - and once the geometry has been recovered, the designer is free to quickly sketch design ideas into the newly constructed context, or analyze the space around them. Although end-user trials have not, as yet, been undertaken we believe that this implementation may afford a user-interface that is both accessible and robust, and that the rapid growth of pen-computing devices will further stimulate activity in this area

CiteSeerX

UCL Discovery

RGBD Datasets: Past, Present and Future

Author: Firman Michael
Publication venue
Publication date: 13/04/2016
Field of study

Since the launch of the Microsoft Kinect, scores of RGBD datasets have been released. These have propelled advances in areas from reconstruction to gesture recognition. In this paper we explore the field, reviewing datasets across eight categories: semantics, object pose estimation, camera tracking, scene reconstruction, object tracking, human actions, faces and identification. By extracting relevant information in each category we help researchers to find appropriate data for their needs, and we consider which datasets have succeeded in driving computer vision forward and why. Finally, we examine the future of RGBD datasets. We identify key areas which are currently underexplored, and suggest that future directions may include synthetic data and dense reconstructions of static and dynamic scenes.Comment: 8 pages excluding references (CVPR style

arXiv.org e-Print Archive

Crossref

A mixed reality telepresence system for collaborative space operation

Author: Campion SP
Fairchild AJ
Fernando TP
Garcia Jimenez AS
Roberts DJ
Wolf R
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/04/2017
Field of study

This paper presents a Mixed Reality system that results from the integration of a telepresence system and an application to improve collaborative space exploration. The system combines free viewpoint video with immersive projection technology to support non-verbal communication, including eye gaze, inter-personal distance and facial expression. Importantly, these can be interpreted together as people move around the simulation, maintaining natural social distance. The application is a simulation of Mars, within which the collaborators must come to agreement over, for example, where the Rover should land and go. The first contribution is the creation of a Mixed Reality system supporting contextualization of non-verbal communication. Tw technological contributions are prototyping a technique to subtract a person from a background that may contain physical objects and/or moving images, and a light weight texturing method for multi-view rendering which provides balance in terms of visual and temporal quality. A practical contribution is the demonstration of pragmatic approaches to sharing space between display systems of distinct levels of immersion. A research tool contribution is a system that allows comparison of conventional authored and video based reconstructed avatars, within an environment that encourages exploration and social interaction. Aspects of system quality, including the communication of facial expression and end-to-end latency are reported

University of Salford Institutional Repository

Crossref