Search CORE

541 research outputs found

Shape Representations Using Nested Descriptors

Author: Byrne Jeffrey
Publication venue: ScholarlyCommons
Publication date: 01/01/2014
Field of study

The problem of shape representation is a core problem in computer vision. It can be argued that shape representation is the most central representational problem for computer vision, since unlike texture or color, shape alone can be used for perceptual tasks such as image matching, object detection and object categorization. This dissertation introduces a new shape representation called the nested descriptor. A nested descriptor represents shape both globally and locally by pooling salient scaled and oriented complex gradients in a large nested support set. We show that this nesting property introduces a nested correlation structure that enables a new local distance function called the nesting distance, which provides a provably robust similarity function for image matching. Furthermore, the nesting property suggests an elegant flower like normalization strategy called a log-spiral difference. We show that this normalization enables a compact binary representation and is equivalent to a form a bottom up saliency. This suggests that the nested descriptor representational power is due to representing salient edges, which makes a fundamental connection between the saliency and local feature descriptor literature. In this dissertation, we introduce three examples of shape representation using nested descriptors: nested shape descriptors for imagery, nested motion descriptors for video and nested pooling for activities. We show evaluation results for these representations that demonstrate state-of-the-art performance for image matching, wide baseline stereo and activity recognition tasks

ScholarlyCommons@Penn

Evaluation of SIFT on Android-based mobile devices and personal computers

Author: Cornelius Ian
Publication venue
Publication date: 01/01/2014
Field of study

Coventry University Pure Portal

A Spatio-Temporal Multi-Scale Binary Descriptor

Author: Cavallaro A
Lanz O
Xompero A
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Binary descriptors are widely used for multi-view matching and robotic navigation. However, their matching performance decreases considerably under severe scale and viewpoint changes in non-planar scenes. To overcome this problem, we propose to encode the varying appearance of selected 3D scene points tracked by a moving camera with compact spatio-temporal descriptors. To this end, we first track interest points and capture their temporal variations at multiple scales. Then, we validate feature tracks through 3D reconstruction and compress the temporal sequence of descriptors by encoding the most frequent and stable binary values. Finally, we determine multi-scale correspondences across views with a matching strategy that handles severe scale differences. The proposed spatio-temporal multi-scale approach is generic and can be used with a variety of binary descriptors. We show the effectiveness of the joint multi-scale extraction and temporal reduction through comparisons of different temporal reduction strategies and the application to several binary descriptors

Archivio della ricerca - Fondazione Bruno Kessler

Queen Mary Research Online

Semantic Segmentation of Terrain and Road Terrain for Advanced Driver Assistance Systems

Author: Gheorghe Ionut Valentin
Publication venue
Publication date: 01/01/2015
Field of study

Coventry University Pure Portal

A survey on compact features for visual content analysis

Author: Baroffio Luca
Redondi Alessandro E. C
Tagliasacchi Marco
Tubaro Stefano
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2016
Field of study

Archivio istituzionale della ricerca - Politecnico di Milano

A Unified framework for local visual descriptors evaluation

Author: Gosselin Philippe-Henri
Kihl Olivier
Picard David
Publication venue: 'Elsevier BV'
Publication date: 01/04/2015
Field of study

International audienceLocal descriptors are the ground layer of recognition feature based systems for still images and video. We propose a new framework to explain local descriptors. This framework is based on the descriptors decomposition in three levels: primitive extraction, primitive coding and code aggregation. With this framework, we are able to explain most of the popular descriptors in the literature such as HOG, HOF, SURF. We propose two new projection methods based on approximation with oscillating functions basis (sinus and Legendre polynomials). Using our framework, we are able to extend usual descriptors by changing the code aggregation or adding new primitive coding method. The experiments are carried out on images (VOC 2007) and videos datasets (KTH, Hollywood2 and UCF11), and achieve equal or better performances than the literature

INRIA a CCSD electronic archive server

HAL-Rennes 1

Towards Plug-n-Play robot guidance: Advanced 3D estimation and pose estimation in Robotic applications

Author: Sølund Thomas
Publication venue: Technical University of Denmark
Publication date: 01/01/2017
Field of study

Online Research Database In Technology