Search CORE

18,594 research outputs found

A Framework for Symmetric Part Detection in Cluttered Scenes

Author: Dickinson Sven
Fidler Sanja
Lee Tom
Levinshtein Alex
Sminchisescu Cristian
Publication venue
Publication date: 05/02/2015
Field of study

The role of symmetry in computer vision has waxed and waned in importance during the evolution of the field from its earliest days. At first figuring prominently in support of bottom-up indexing, it fell out of favor as shape gave way to appearance and recognition gave way to detection. With a strong prior in the form of a target object, the role of the weaker priors offered by perceptual grouping was greatly diminished. However, as the field returns to the problem of recognition from a large database, the bottom-up recovery of the parts that make up the objects in a cluttered scene is critical for their recognition. The medial axis community has long exploited the ubiquitous regularity of symmetry as a basis for the decomposition of a closed contour into medial parts. However, today's recognition systems are faced with cluttered scenes, and the assumption that a closed contour exists, i.e. that figure-ground segmentation has been solved, renders much of the medial axis community's work inapplicable. In this article, we review a computational framework, previously reported in Lee et al. (2013), Levinshtein et al. (2009, 2013), that bridges the representation power of the medial axis and the need to recover and group an object's parts in a cluttered scene. Our framework is rooted in the idea that a maximally inscribed disc, the building block of a medial axis, can be modeled as a compact superpixel in the image. We evaluate the method on images of cluttered scenes.Comment: 10 pages, 8 figure

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

CiteSeerX

Directory of Open Access Journals

Rotation-invariant features for multi-oriented text detection in natural images.

Author: Bai Xiang
Liu Wenyu
Ma Yi
Tu Zhuowen
Yao Cong
Zhang Xin
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Object tracking using log-polar transformation

Author: Thunuguntla Saikiran Sri
Publication venue: LSU Digital Commons
Publication date: 01/01/2005
Field of study

In this thesis, we use log-polar transform to solve object tracking. Object tracking in video sequences is a fundamental problem in computer vision. Even though object tracking is being studied extensively, still some challenges need to be addressed, such as appearance variations, large scale and rotation variations, and occlusion. We implemented a novel tracking algorithm which works robustly in the presence of large scale changes, rotation, occlusion, illumination changes, perspective transformations and some appearance changes. Log-polar transformation is used to achieve robustness to scale and rotation. Our object tracking approach is based on template matching technique. Template matching is based on extracting an example image, template, of an object in first frame, and then finding the region which best suites this template in the subsequent frames. In template matching, we implemented a fixed template algorithm and a template update algorithm. In the fixed template algorithm we use same template for the entire image sequence, where as in the template update algorithm the template is updated according to the changes in object image. The fixed template algorithm is faster; the template update algorithm is more robust to appearance changes in the object being tracked. The proposed object tracking is highly robust to scale, rotation, illumination changes and occlusion with good implementation speed

Louisiana State University

A multi-scale, multi-wavelength source extraction method: getsources

Author: A. Men’shchikov
Alves
André
Arzoumanian
Bertin
Bontemps
di Francesco
Elmegreen
F. Motte
Falgarone
Gong
Goodman
Griffin
Hennemann
Johnstone
Kainulainen
Kennicutt
Könyves
Lagache
Larson
M. Hennemann
Maury
Men’shchikov
Miville-Deschênes
Moffat
Molinari
Motte
Motte
Motte
Motte
Motte
Myers
N. Schneider
P. Didelon
Ph. André
Pilbratt
Poglitsch
Rosolowsky
Roy
Schneider
Schneider
Stutzki
Williams
Publication venue: 'EDP Sciences'
Publication date: 19/04/2012
Field of study

We present a multi-scale, multi-wavelength source extraction algorithm called getsources. Although it has been designed primarily for use in the far-infrared surveys of Galactic star-forming regions with Herschel, the method can be applied to many other astronomical images. Instead of the traditional approach of extracting sources in the observed images, the new method analyzes fine spatial decompositions of original images across a wide range of scales and across all wavebands. It cleans those single-scale images of noise and background, and constructs wavelength-independent single-scale detection images that preserve information in both spatial and wavelength dimensions. Sources are detected in the combined detection images by following the evolution of their segmentation masks across all spatial scales. Measurements of the source properties are done in the original background-subtracted images at each wavelength; the background is estimated by interpolation under the source footprints and overlapping sources are deblended in an iterative procedure. In addition to the main catalog of sources, various catalogs and images are produced that aid scientific exploitation of the extraction results. We illustrate the performance of getsources on Herschel images by extracting sources in sub-fields of the Aquila and Rosette star-forming regions. The source extraction code and validation images with a reference extraction catalog are freely available.Comment: 31 pages, 27 figures, to be published in Astronomy & Astrophysic

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Grounding semantics in robots for Visual Question Answering

Author: Wahle Björn
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/01/2019
Field of study

In this thesis I describe an operational implementation of an object detection and description system that incorporates in an end-to-end Visual Question Answering system and evaluated it on two visual question answering datasets for compositional language and elementary visual reasoning

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

New Types of Keypoints for Detecting Known Objects in Visual Search Tasks

Author: Andrzej Sluzek
Saiful Islam
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Crossref