Search CORE

40,201 research outputs found

Cognitive visual tracking and camera control

Author: Arens
Ayers
Bellotto
Ben Benfold
Bimbo
Binford
Boult
Brémond
Chuan Zhao
Eric Sommerlade
Haag
Hall
Hanno Harland
Hans-Hellmut Nagel
Hartley
Ian Reid
Kojima
Nagel
Nicola Bellotto
Nicola Pirlo
Robertson
Tsomko
Viola
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Cognitive visual tracking is the process of observing and understanding the behaviour of a moving person. This paper presents an efficient solution to extract, in real-time, high-level information from an observed scene, and generate the most appropriate commands for a set of pan-tilt-zoom (PTZ) cameras in a surveillance scenario. Such a high-level feedback control loop, which is the main novelty of our work, will serve to reduce uncertainties in the observed scene and to maximize the amount of information extracted from it. It is implemented with a distributed camera system using SQL tables as virtual communication channels, and Situation Graph Trees for knowledge representation, inference and high-level camera control. A set of experiments in a surveillance scenario show the effectiveness of our approach and its potential for real applications of cognitive vision

University of Lincoln Institutional Repository

Crossref

Adelaide Research & Scholarship

Unsupervised Adversarial Depth Estimation using Cycled Generative Networks

Author: Pilzer Andrea
Puscas Mihai Marian
Ricci Elisa
Sebe Nicu
Xu Dan
Publication venue
Publication date: 01/01/2018
Field of study

While recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance, costly ground truth annotations are required during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps and show that the depth estimation task can be effectively tackled within an adversarial learning framework. Specifically, we propose a deep generative network that learns to predict the correspondence field i.e. the disparity map between two image views in a calibrated stereo camera setting. The proposed architecture consists of two generative sub-networks jointly trained with adversarial learning for reconstructing the disparity map and organized in a cycle such as to provide mutual constraints and supervision to each other. Extensive experiments on the publicly available datasets KITTI and Cityscapes demonstrate the effectiveness of the proposed model and competitive results with state of the art methods. The code and trained model are available on https://github.com/andrea-pilzer/unsup-stereo-depthGAN.Comment: To appear in 3DV 2018. Code is available on GitHu

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

SurfelMeshing: Online Surfel-Based Mesh Reconstruction

Author: Pollefeys Marc
Sattler Torsten
Schöps Thomas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/11/2019
Field of study

We address the problem of mesh reconstruction from live RGB-D video, assuming a calibrated camera and poses provided externally (e.g., by a SLAM system). In contrast to most existing approaches, we do not fuse depth measurements in a volume but in a dense surfel cloud. We asynchronously (re)triangulate the smoothed surfels to reconstruct a surface mesh. This novel approach enables to maintain a dense surface representation of the scene during SLAM which can quickly adapt to loop closures. This is possible by deforming the surfel cloud and asynchronously remeshing the surface where necessary. The surfel-based representation also naturally supports strongly varying scan resolution. In particular, it reconstructs colors at the input camera's resolution. Moreover, in contrast to many volumetric approaches, ours can reconstruct thin objects since objects do not need to enclose a volume. We demonstrate our approach in a number of experiments, showing that it produces reconstructions that are competitive with the state-of-the-art, and we discuss its advantages and limitations. The algorithm (excluding loop closure functionality) is available as open source at https://github.com/puzzlepaint/surfelmeshing .Comment: Version accepted to IEEE Transactions on Pattern Analysis and Machine Intelligenc

arXiv.org e-Print Archive

Chalmers Research

Swept source optical coherence tomography Gabor fusion splicing technique for microscopy of thick samples using a deformable mirror

Author: Adrian Bradu
Adrian Podoleanu
Booth
Christopher Costa
John Rogers
Pauline Phelan
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2015
Field of study

We present a swept source optical coherence tomography (OCT) system at 1060 nm equipped with a wavefront sensor at 830 nm and a deformable mirror in a closed-loop adaptive optics (AO) system. Due to the AO correction, the confocal profile of the interface optics becomes narrower than the OCT axial range, restricting the part of the B-scan (cross section) with good contrast. By actuating on the deformable mirror, the depth of the focus is changed and the system is used to demonstrate Gabor filtering in order to produce B-scan OCT images with enhanced sensitivity throughout the axial range from a Drosophila larvae. The focus adjustment is achieved by manipulating the curvature of the deformable mirror between two user-defined limits. Particularities of controlling the focus for Gabor filtering using the deformable mirror are presented. Â© 2015 Society of Photo-Optical Instrumentation Engineers

Crossref

Kent Academic Repository

UG^2: a Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition

Author: Banerjee Sreya
Grm Klemen
Scheirer Walter J.
Struc Vitomir
Vidal Rosaura G.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/02/2018
Field of study

Advances in image restoration and enhancement techniques have led to discussion about how such algorithmscan be applied as a pre-processing step to improve automatic visual recognition. In principle, techniques like deblurring and super-resolution should yield improvements by de-emphasizing noise and increasing signal in an input image. But the historically divergent goals of the computational photography and visual recognition communities have created a significant need for more work in this direction. To facilitate new research, we introduce a new benchmark dataset called UG^2, which contains three difficult real-world scenarios: uncontrolled videos taken by UAVs and manned gliders, as well as controlled videos taken on the ground. Over 160,000 annotated frames forhundreds of ImageNet classes are available, which are used for baseline experiments that assess the impact of known and unknown image artifacts and other conditions on common deep learning-based object classification approaches. Further, current image restoration and enhancement techniques are evaluated by determining whether or not theyimprove baseline classification performance. Results showthat there is plenty of room for algorithmic innovation, making this dataset a useful tool going forward.Comment: Supplemental material: https://goo.gl/vVM1xe, Dataset: https://goo.gl/AjA6En, CVPR 2018 Prize Challenge: ug2challenge.or

arXiv.org e-Print Archive

Crossref