Search CORE

56,880 research outputs found

Multi Spectral Imaging System

Author: Spiering Bruce A.
Publication venue
Publication date: 04/05/1999
Field of study

An optical imaging system provides automatic co-registration of a plurality of multi spectral images of an object which are generated by a plurality of video cameras or other optical detectors. The imaging system includes a modular assembly of beam splitters, lens tubes, camera lenses and wavelength selective filters which facilitate easy reconfiguration and adjustment of the system for various applications. A primary lens assembly generates a real image of an object to be imaged on a reticle which is positioned at a fixed length from a beam splitter assembly. The beam splitter assembly separates a collimated image beam received from the reticle into multiple image beams, each of which is projected onto a corresponding one of a plurality of video cameras. The lens tubes which connect the beam splitter assembly to the cameras are adjustable in length to provide automatic co-registration of the images generated by each camera

NASA Technical Reports Server

Fine-To-Coarse Global Registration of RGB-D Scans

Author: Funkhouser Thomas
Halber Maciej
Publication venue
Publication date: 22/11/2016
Field of study

RGB-D scanning of indoor environments is important for many applications, including real estate, interior design, and virtual reality. However, it is still challenging to register RGB-D images from a hand-held camera over a long video sequence into a globally consistent 3D model. Current methods often can lose tracking or drift and thus fail to reconstruct salient structures in large environments (e.g., parallel walls in different rooms). To address this problem, we propose a "fine-to-coarse" global registration algorithm that leverages robust registrations at finer scales to seed detection and enforcement of new correspondence and structural constraints at coarser scales. To test global registration algorithms, we provide a benchmark with 10,401 manually-clicked point correspondences in 25 scenes from the SUN3D dataset. During experiments with this benchmark, we find that our fine-to-coarse algorithm registers long RGB-D sequences better than previous methods

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

Patient-specific bronchoscope simulation with pq-space-based 2D/3D registration

Author: Adrian Chung
Chui H
DeMenthonD
Fani Deligianni
Gold S
Guang-zhong Yang
Horn B
HornB
Likar B
Mori K
Okatani T
Oliensis J
PradosE
Rashid H
Studholme C
Viola P
Worthington P
Yang G
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2004
Field of study

Objective: The use of patient-specific models for surgical simulation requires photorealistic rendering of 3D structure and surface properties. For bronchoscope simulation, this requires augmenting virtual bronchoscope views generated from 3D tomographic data with patient-specific bronchoscope videos. To facilitate matching of video images to the geometry extracted from 3D tomographic data, this paper presents a new pq-space-based 2D/3D registration method for camera pose estimation in bronchoscope tracking. Methods: The proposed technique involves the extraction of surface normals for each pixel of the video images by using a linear local shape-from-shading algorithm derived from the unique camera/lighting constraints of the endoscopes. The resultant pq-vectors are then matched to those of the 3D model by differentiation of the z-buffer. A similarity measure based on angular deviations of the pq-vectors is used to provide a robust 2D/3D registration framework. Localization of tissue deformation is considered by assessing the temporal variation of the pq-vectors between subsequent frames. Results: The accuracy of the proposed method was assessed by using an electromagnetic tracker and a specially constructed airway phantom. Preliminary in vivo validation of the proposed method was performed on a matched patient bronchoscope video sequence and 3D CT data. Comparison to existing intensity-based techniques was also made. Conclusion: The proposed method does not involve explicit feature extraction and is relatively immune to illumination changes. The temporal variation of the pq distribution also permits the identification of localized deformation, which offers an effective way of excluding such areas from the registration process

Crossref

UCL Discovery

Spiral - Imperial College Digital Repository

Enlighten

Enhanced video indirect ophthalmoscopy (VIO) via robust mosaicing

Author: Ahmad
Besl
Brown
Burt
Cheng
Daszykowski
Fletcher
Fox
Gelman
Horn
Kautsky
Kirbas
Lin
Maintz
Peli
Robinson
Robinson
Simonson
Soares
Srinivasa
Staal
Stewart
Subbarao
Szeliski
Tao
Tasman
Tsai
Tschumperlé
Van der Vorst
Wallace
Wallace
Wilson
Zhang
Zitova
Publication venue: Optical Society of America
Publication date
Field of study

Indirect ophthalmoscopy (IO) is the standard of care for evaluation of the neonatal retina. When recorded on video from a head-mounted camera, IO images have low quality and narrow Field of View (FOV). We present an image fusion methodology for converting a video IO recording into a single, high quality, wide-FOV mosaic that seamlessly blends the best frames in the video. To this end, we have developed fast and robust algorithms for automatic evaluation of video quality, artifact detection and removal, vessel mapping, registration, and multi-frame image fusion. Our experiments show the effectiveness of the proposed methods

Crossref

PubMed Central

A system for synthetic vision and augmented reality in future flight decks

Author: Behringer R
McGee J
Sundareswaran V
Tam C
Vassiliou M
Publication venue
Publication date: 01/04/2000
Field of study

Rockwell Science Center is investigating novel human-computer interaction techniques for enhancing the situational awareness in future flight decks. One aspect is to provide intuitive displays that provide the vital information and the spatial awareness by augmenting the real world with an overlay of relevant information registered to the real world. Such Augmented Reality (AR) techniques can be employed during bad weather scenarios to permit flying in Visual Flight Rules (VFR) in conditions which would normally require Instrumental Flight Rules (IFR). These systems could easily be implemented on heads-up displays (HUD). The advantage of AR systems vs. purely synthetic vision (SV) systems is that the pilot can relate the information overlay to real objects in the world, whereas SV systems provide a constant virtual view, where inconsistencies can hardly be detected. The development of components for such a system led to a demonstrator implemented on a PC. A camera grabs video images which are overlaid with registered information. Orientation of the camera is obtained from an inclinometer and a magnetometer; position is acquired from GPS. In a possible implementation in an airplane, the on-board attitude information can be used for obtaining correct registration. If visibility is sufficient, computer vision modules can be used to fine-tune the registration by matching visual cues with database features. This technology would be especially useful for landing approaches. The current demonstrator provides a frame-rate of 15 fps, using a live video feed as background with an overlay of avionics symbology in the foreground. In addition, terrain rendering from a 1 arc sec. digital elevation model database can be overlaid to provide synthetic vision in case of limited visibility. For true outdoor testing (on ground level), the system has been implemented on a wearable computer

Leeds Beckett Repository

Web-based Simulation and Training Environment for Laparoscopic Camera Calibration

Author: Thakore Kartik
Publication venue: Scholarship@Western
Publication date: 24/10/2013
Field of study

Endoscopic cameras are increasingly employed in image-guidance procedures, where the video images must be registered to data from other modalities. However, such cameras are susceptible to distortions, requiring calibration before images can be used for registration, tracking and 3D reconstruction. Camera calibration is learned in a laboratory setting, where configuring and adjusting the physical setup is tedious and not necessarily conducive to learning. A centralized resource that utilizes 3D interactive components needs to be available for training on camera calibration. In this project, a web-based training environment for camera calibration is implemented called SimCAM. SimCAM was developed using the Web Graphics Library (WebGL), Open Computer Vision (OpenCV) library, and custom software components. WebGL and OpenCV were used to simulate camera distortions and the calibration task. The main contributions include the implementation and validation of SimCAM. SimCAM was validated with a content validity study, where it was found to be useful as an introduction to camera calibration. Future work involves improving the supporting material and implementing more features, such as uncertainty propogation

Scholarship@Western

Aerial video geo-registration using terrain models from dense and coherent stereo matching

Author: Cuevas Rodríguez Carlos
Gallego Bonet Guillermo
García Santos Narciso
Ruano Sáinz Susana
Publication venue: E.T.S.I. Telecomunicación (UPM)
Publication date: 01/05/2014
Field of study

In the context of aerial imagery, one of the first steps toward a coherent processing of the information contained in multiple images is geo-registration, which consists in assigning geographic 3D coordinates to the pixels of the image. This enables accurate alignment and geo-positioning of multiple images, detection of moving objects and fusion of data acquired from multiple sensors. To solve this problem there are different approaches that require, in addition to a precise characterization of the camera sensor, high resolution referenced images or terrain elevation models, which are usually not publicly available or out of date. Building upon the idea of developing technology that does not need a reference terrain elevation model, we propose a geo-registration technique that applies variational methods to obtain a dense and coherent surface elevation model that is used to replace the reference model. The surface elevation model is built by interpolation of scattered 3D points, which are obtained in a two-step process following a classical stereo pipeline: first, coherent disparity maps between image pairs of a video sequence are estimated and then image point correspondences are back-projected. The proposed variational method enforces continuity of the disparity map not only along epipolar lines (as done by previous geo-registration techniques) but also across them, in the full 2D image domain. In the experiments, aerial images from synthetic video sequences have been used to validate the proposed technique

Archivo Digital UPM

Enhancing Video Deblurring using Efficient Fourier Aggregation

Author: Shweta K. Holey, Prof. K. V .Warkar
Publication venue: Auricle Global Society of Education and Research
Publication date: 30/04/2018
Field of study

Video Deblurring is a process of removing blur from all the video frames and achieving the required level of smoothness. Numerous recent approaches attempt to remove image blur due to camera shake,either with one or multiple input images, by explicitly solving an inverse and inherently ill-posed deconvolution problem.An efficient video deblurring system to handle the blurs due to shaky camera and complex motion blurs due to moving objects has been proposed.The proposed algorithm is strikingly simple: it performs a weighted average in the Fourier domain, with weights depending on the Fourier spectrum magnitude. The method can be seen as a generalization of the align and average procedure, with a weighted average, motivated by hand-shake physiology and theoretically supported, taking place in the Fourier domain. The method�s rationale is that camera shake has a random nature, and therefore, each image in the burst is generally blurred differently.The proposed system has effectively deblurred the video and results showed that the reconstructed video is sharper and less noisy than the original ones.The proposed Fourier Burst Accumulation algorithm produced similar or better results than the state-of-the-art multi-image deconvolution while being significantly faster and with lower memory footprint.The method is robust to moving objects as it acquired the consistent registration scheme

International Journal on Future Revolution in Computer Science & Communication Engineering

Video Registration in Egocentric Vision under Day and Night Illumination Changes

Author: Alletto Stefano
Cucchiara Rita
Serra Giuseppe
Publication venue
Publication date: 28/07/2016
Field of study

With the spread of wearable devices and head mounted cameras, a wide range of application requiring precise user localization is now possible. In this paper we propose to treat the problem of obtaining the user position with respect to a known environment as a video registration problem. Video registration, i.e. the task of aligning an input video sequence to a pre-built 3D model, relies on a matching process of local keypoints extracted on the query sequence to a 3D point cloud. The overall registration performance is strictly tied to the actual quality of this 2D-3D matching, and can degrade if environmental conditions such as steep changes in lighting like the ones between day and night occur. To effectively register an egocentric video sequence under these conditions, we propose to tackle the source of the problem: the matching process. To overcome the shortcomings of standard matching techniques, we introduce a novel embedding space that allows us to obtain robust matches by jointly taking into account local descriptors, their spatial arrangement and their temporal robustness. The proposal is evaluated using unconstrained egocentric video sequences both in terms of matching quality and resulting registration performance using different 3D models of historical landmarks. The results show that the proposed method can outperform state of the art registration algorithms, in particular when dealing with the challenges of night and day sequences

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università degli Studi di Udine

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Deep-sea image processing

Author: Fornari D J
Mayer Larry A.
Rzhanov Yuri
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 01/11/2004
Field of study

High-resolution seafloor mapping often requires optical methods of sensing, to confirm interpretations made from sonar data. Optical digital imagery of seafloor sites can now provide very high resolution and also provides additional cues, such as color information for sediments, biota and divers rock types. During the cruise AT11-7 of the Woods Hole Oceanographic Institution (WHOI) vessel R/V Atlantis (February 2004, East Pacific Rise) visual imagery was acquired from three sources: (1) a digital still down-looking camera mounted on the submersible Alvin, (2) observer-operated 1-and 3-chip video cameras with tilt and pan capabilities mounted on the front of Alvin, and (3) a digital still camera on the WHOI TowCam (Fornari, 2003). Imagery from the first source collected on a previous cruise (AT7-13) to the Galapagos Rift at 86°W was successfully processed and mosaicked post-cruise, resulting in a single image covering area of about 2000 sq.m, with the resolution of 3 mm per pixel (Rzhanov et al., 2003). This paper addresses the issues of the optimal acquisition of visual imagery in deep-seaconditions, and requirements for on-board processing. Shipboard processing of digital imagery allows for reviewing collected imagery immediately after the dive, evaluating its importance and optimizing acquisition parameters, and augmenting acquisition of data over specific sites on subsequent dives.Images from the deepsea power and light (DSPL) digital camera offer the best resolution (3.3 Mega pixels) and are taken at an interval of 10 seconds (determined by the strobe\u27s recharge rate). This makes images suitable for mosaicking only when Alvin moves slowly (≪1/4 kt), which is not always possible for time-critical missions. Video cameras provided a source of imagery more suitable for mosaicking, despite its inferiority in resolution. We discuss required pre-processing and imageenhancement techniques and their influence on the interpretation of mosaic content. An algorithm for determination of camera tilt parameters from acquired imagery is proposed and robustness conditions are discussed

UNH Scholars' Repository