3,674 research outputs found
Cognitive visual tracking and camera control
Cognitive visual tracking is the process of observing and understanding the behaviour of a moving person. This paper presents an efficient solution to extract, in real-time, high-level information from an observed scene, and generate the most appropriate commands for a set of pan-tilt-zoom (PTZ) cameras in a surveillance scenario. Such a high-level feedback control loop, which is the main novelty of our work, will serve to reduce uncertainties in the observed scene and to maximize the amount of information extracted from it. It is implemented with a distributed camera system using SQL tables as virtual communication channels, and Situation Graph Trees for knowledge representation, inference and high-level camera control. A set of experiments in a surveillance scenario show the effectiveness of our approach and its potential for real applications of cognitive vision
Navigation without localisation: reliable teach and repeat based on the convergence theorem
We present a novel concept for teach-and-repeat visual navigation. The
proposed concept is based on a mathematical model, which indicates that in
teach-and-repeat navigation scenarios, mobile robots do not need to perform
explicit localisation. Rather than that, a mobile robot which repeats a
previously taught path can simply `replay' the learned velocities, while using
its camera information only to correct its heading relative to the intended
path. To support our claim, we establish a position error model of a robot,
which traverses a taught path by only correcting its heading. Then, we outline
a mathematical proof which shows that this position error does not diverge over
time. Based on the insights from the model, we present a simple monocular
teach-and-repeat navigation method. The method is computationally efficient, it
does not require camera calibration, and it can learn and autonomously traverse
arbitrarily-shaped paths. In a series of experiments, we demonstrate that the
method can reliably guide mobile robots in realistic indoor and outdoor
conditions, and can cope with imperfect odometry, landmark deficiency,
illumination variations and naturally-occurring environment changes.
Furthermore, we provide the navigation system and the datasets gathered at
http://www.github.com/gestom/stroll_bearnav.Comment: The paper will be presented at IROS 2018 in Madri
Coronary Artery Segmentation and Motion Modelling
Conventional coronary artery bypass surgery requires invasive sternotomy and the
use of a cardiopulmonary bypass, which leads to long recovery period and has high
infectious potential. Totally endoscopic coronary artery bypass (TECAB) surgery
based on image guided robotic surgical approaches have been developed to allow the
clinicians to conduct the bypass surgery off-pump with only three pin holes incisions
in the chest cavity, through which two robotic arms and one stereo endoscopic camera
are inserted. However, the restricted field of view of the stereo endoscopic images leads
to possible vessel misidentification and coronary artery mis-localization. This results
in 20-30% conversion rates from TECAB surgery to the conventional approach.
We have constructed patient-specific 3D + time coronary artery and left ventricle
motion models from preoperative 4D Computed Tomography Angiography (CTA)
scans. Through temporally and spatially aligning this model with the intraoperative
endoscopic views of the patient's beating heart, this work assists the surgeon to identify
and locate the correct coronaries during the TECAB precedures. Thus this work has
the prospect of reducing the conversion rate from TECAB to conventional coronary
bypass procedures.
This thesis mainly focus on designing segmentation and motion tracking methods
of the coronary arteries in order to build pre-operative patient-specific motion models.
Various vessel centreline extraction and lumen segmentation algorithms are presented,
including intensity based approaches, geometric model matching method and
morphology-based method. A probabilistic atlas of the coronary arteries is formed
from a group of subjects to facilitate the vascular segmentation and registration procedures.
Non-rigid registration framework based on a free-form deformation model
and multi-level multi-channel large deformation diffeomorphic metric mapping are
proposed to track the coronary motion. The methods are applied to 4D CTA images
acquired from various groups of patients and quantitatively evaluated
Optical flow estimation via steered-L1 norm
Global variational methods for estimating optical flow are among the best performing methods due to the subpixel accuracy and the ‘fill-in’ effect they provide. The fill-in effect allows optical flow displacements to be estimated even in low and untextured areas of the image. The estimation of such displacements are induced by the smoothness term. The L1 norm provides a robust regularisation term for the optical flow energy function with a very good performance for edge-preserving. However this norm suffers from several issues, among these is the isotropic nature of this norm which reduces the fill-in effect and eventually the accuracy of estimation in areas near motion boundaries. In this paper we propose an enhancement to the L1 norm that improves the fill-in effect for this smoothness term. In order to do this we analyse the structure tensor matrix and use its eigenvectors to steer the smoothness term into components that are ‘orthogonal to’ and ‘aligned with’ image structures. This is done in primal-dual formulation. Results show a reduced end-point error and improved accuracy compared to the conventional L1 norm
On morphological hierarchical representations for image processing and spatial data clustering
Hierarchical data representations in the context of classi cation and data
clustering were put forward during the fties. Recently, hierarchical image
representations have gained renewed interest for segmentation purposes. In this
paper, we briefly survey fundamental results on hierarchical clustering and
then detail recent paradigms developed for the hierarchical representation of
images in the framework of mathematical morphology: constrained connectivity
and ultrametric watersheds. Constrained connectivity can be viewed as a way to
constrain an initial hierarchy in such a way that a set of desired constraints
are satis ed. The framework of ultrametric watersheds provides a generic scheme
for computing any hierarchical connected clustering, in particular when such a
hierarchy is constrained. The suitability of this framework for solving
practical problems is illustrated with applications in remote sensing
- …