Search CORE

6,181 research outputs found

Video Registration in Egocentric Vision under Day and Night Illumination Changes

Author: Alletto Stefano
Cucchiara Rita
Serra Giuseppe
Publication venue
Publication date: 28/07/2016
Field of study

With the spread of wearable devices and head mounted cameras, a wide range of application requiring precise user localization is now possible. In this paper we propose to treat the problem of obtaining the user position with respect to a known environment as a video registration problem. Video registration, i.e. the task of aligning an input video sequence to a pre-built 3D model, relies on a matching process of local keypoints extracted on the query sequence to a 3D point cloud. The overall registration performance is strictly tied to the actual quality of this 2D-3D matching, and can degrade if environmental conditions such as steep changes in lighting like the ones between day and night occur. To effectively register an egocentric video sequence under these conditions, we propose to tackle the source of the problem: the matching process. To overcome the shortcomings of standard matching techniques, we introduce a novel embedding space that allows us to obtain robust matches by jointly taking into account local descriptors, their spatial arrangement and their temporal robustness. The proposal is evaluated using unconstrained egocentric video sequences both in terms of matching quality and resulting registration performance using different 3D models of historical landmarks. The results show that the proposed method can outperform state of the art registration algorithms, in particular when dealing with the challenges of night and day sequences

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

A surgical system for automatic registration, stiffness mapping and dynamic image overlay

Author: Choset Howie
Li Lu
Patath Kartik
Qian Jianing
Salman Hadi
Saxena Saumya
Srivatsan Rangaprasad Arun
Xu Mengyun
Zevallos Nicolas
Publication venue
Publication date: 23/11/2017
Field of study

In this paper we develop a surgical system using the da Vinci research kit (dVRK) that is capable of autonomously searching for tumors and dynamically displaying the tumor location using augmented reality. Such a system has the potential to quickly reveal the location and shape of tumors and visually overlay that information to reduce the cognitive overload of the surgeon. We believe that our approach is one of the first to incorporate state-of-the-art methods in registration, force sensing and tumor localization into a unified surgical system. First, the preoperative model is registered to the intra-operative scene using a Bingham distribution-based filtering approach. An active level set estimation is then used to find the location and the shape of the tumors. We use a recently developed miniature force sensor to perform the palpation. The estimated stiffness map is then dynamically overlaid onto the registered preoperative model of the organ. We demonstrate the efficacy of our system by performing experiments on phantom prostate models with embedded stiff inclusions.Comment: International Symposium on Medical Robotics (ISMR 2018

arXiv.org e-Print Archive

Crossref

3D Visual Perception for Self-Driving Cars using a Multi-Camera System: Calibration, Mapping, Localization, and Obstacle Detection

Author: Fraundorfer Friedrich
Furgale Paul
Heng Lionel
Häne Christian
Lee Gim Hee
Pollefeys Marc
Sattler Torsten
Publication venue
Publication date: 31/08/2017
Field of study

Cameras are a crucial exteroceptive sensor for self-driving cars as they are low-cost and small, provide appearance information about the environment, and work in various weather conditions. They can be used for multiple purposes such as visual navigation and obstacle detection. We can use a surround multi-camera system to cover the full 360-degree field-of-view around the car. In this way, we avoid blind spots which can otherwise lead to accidents. To minimize the number of cameras needed for surround perception, we utilize fisheye cameras. Consequently, standard vision pipelines for 3D mapping, visual localization, obstacle detection, etc. need to be adapted to take full advantage of the availability of multiple cameras rather than treat each camera individually. In addition, processing of fisheye images has to be supported. In this paper, we describe the camera calibration and subsequent processing pipeline for multi-fisheye-camera systems developed as part of the V-Charge project. This project seeks to enable automated valet parking for self-driving cars. Our pipeline is able to precisely calibrate multi-camera systems, build sparse 3D maps for visual navigation, visually localize the car with respect to these maps, generate accurate dense maps, as well as detect obstacles based on real-time depth map extraction

arXiv.org e-Print Archive

Institute of Transport Research:Publications

Crossref

FLAT2D: Fast localization from approximate transformation into 2D

Author: Goeddel Robert
Kershaw Carl
Olson Edwin
SERAFIN JACOPO
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Many autonomous vehicles require precise localization into a prior map in order to support planning and to leverage semantic information within those maps (e.g. that the right lane is a turn-only lane.) A popular approach in automotive systems is to use infrared intensity maps of the ground surface to localize, making them susceptible to failures when the surface is obscured by snow or when the road is repainted. An emerging alternative is to localize based on the 3D structure around the vehicle; these methods are robust to these types of changes, but the maps are costly both in terms of storage and the computational cost of matching. In this paper, we propose a fast method for localizing based on 3D structure around the vehicle using a 2D representation. This representation retains many of the advantages of "full" matching in 3D, but comes with dramatically lower space and computational requirements. We also introduce a variation of Graph-SLAM tailored to support localization, allowing us to make use of graph-based error-recovery techniques in our localization estimate. Finally, we present real-world localization results for both an indoor mobile robotic platform and an autonomous golf cart, demonstrating that autonomous vehicles do not need full 3D matching to accurately localize in the environment

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Keyframe-based monocular SLAM: design, survey, and future directions

Author: Asmar Daniel
Shammas Elie
Younes Georges
Zelek John
Publication venue: 'Elsevier BV'
Publication date: 01/12/2017
Field of study

Extensive research in the field of monocular SLAM for the past fifteen years has yielded workable systems that found their way into various applications in robotics and augmented reality. Although filter-based monocular SLAM systems were common at some time, the more efficient keyframe-based solutions are becoming the de facto methodology for building a monocular SLAM system. The objective of this paper is threefold: first, the paper serves as a guideline for people seeking to design their own monocular SLAM according to specific environmental constraints. Second, it presents a survey that covers the various keyframe-based monocular SLAM systems in the literature, detailing the components of their implementation, and critically assessing the specific strategies made in each proposed solution. Third, the paper provides insight into the direction of future research in this field, to address the major limitations still facing monocular SLAM; namely, in the issues of illumination changes, initialization, highly dynamic motion, poorly textured scenes, repetitive textures, map maintenance, and failure recovery

arXiv.org e-Print Archive

University of Waterloo's Institutional Repository

Optical techniques for 3D surface reconstruction in computer-assisted laparoscopic surgery

Author: A. Bartoli
A. Groch
A. Kolb
Ali
Audette
Bachta
Bailey
Barnard
Baumhauer
Benincasa
Besl
Blake
Bogatyrenko
Bronstein
Brown
Burschka
Böhme
Cash
Cash
Chen
Chen
Chen
Chen
Clancy
Clancy
Clatz
Cleary
Clements
Criminisi
Cryer
D. Elson
D. Stoyanov
Dumpuri
Durrant-Whyte
Elhawary
Falk
Faugeras
Fayad
Feuerstein
Fichtinger
Foix
Fuchs
Galvez-Lopez
Giannarou
Ginhoux
Glocker
Gorthi
Gudmundsson
H. Elhawary
Haneishi
Hartley
Hayashibe
Horn
Hu
Huhle
Huhle
Ieiri
Iftimia
J. Sorger
Jannin
Jannin
Jerabkova
Jin
Kolmogorov
Konishi
Kowalczuk
L. Maier-Hein
Lindner
Lindner
Lipman
M. Rodrigues
Maier-Hein
Marchesseau
Marescaux
Markelj
Marr
Marr
Marvik
Megali
Mersmann
Mezger
Miller
Mirota
Mountney
Mutter
Nalpantidis
Nicolau
Nozaki
Okatani
Ortmaier
P. Mountney
Pavlidis
Perriollat
Pilet
Pizarro
Placht
Pluim
Pratt
Rauth
Richa
Robinson
Röhl
S. Speidel
Salvi
Salzmann
Sauvee
Schaller
Scharstein
Schmalz
Shekhar
Simpfendorfer
Simpson
Soper
Stoyanov
Su
Szpala
Taffinder
Thrun
Thrun
Totz
Ukimura
Ullman
van Kaick
Vigneron
Warren
Wentz
Wittek
Wittek
Wolf
Wu
Wu
Wu
Wöhler
Yip
Yoon
Zhang
Zhang
Zhu
Publication venue: 'Elsevier BV'
Publication date: 03/05/2013
Field of study

One of the main challenges for computer-assisted surgery (CAS) is to determine the intra-opera- tive morphology and motion of soft-tissues. This information is prerequisite to the registration of multi-modal patient-specific data for enhancing the surgeon’s navigation capabilites by observ- ing beyond exposed tissue surfaces and for providing intelligent control of robotic-assisted in- struments. In minimally invasive surgery (MIS), optical techniques are an increasingly attractive approach for in vivo 3D reconstruction of the soft-tissue surface geometry. This paper reviews the state-of-the-art methods for optical intra-operative 3D reconstruction in laparoscopic surgery and discusses the technical challenges and future perspectives towards clinical translation. With the recent paradigm shift of surgical practice towards MIS and new developments in 3D opti- cal imaging, this is a timely discussion about technologies that could facilitate complex CAS procedures in dynamic and deformable anatomical regions

Crossref

Sheffield Hallam University Research Archive

UCL Discovery

Spiral - Imperial College Digital Repository