Search CORE

9 research outputs found

Optic Flow Statistics and Intrinsic Dimensionality

Author: Dirk Calow
Florentin Wörgötter
Markus Lappe
Michael Felsberg
Norbert Krüger
Sinan Kalkan
Publication venue
Publication date: 01/01/2004
Field of study

Different kinds of visual sub-structures can be distinguished by the intrinsic dimensionality of the local signals. The concept of intrinsic dimensionality has been mostly exercised using discrete formulations. A recent work (Kruger and Felsberg, 2003; Felsberg and Kruger, 2003) introduced a continuous definition and showed that the inherent structure of the intrinsic dimensionality has essentially the form of a triangle. The current study work analyzes the distribution of signals according to the continuous interpretation of intrinsic dimensionality and the relation to orientation and optic flow features of image patches. Among other things, we give a quantitative interpretation of the distribution of signals according to their intrinsic dimensionality that reveals specific patterns associated to established sub-structures in computer vision. Furthermore, we link quantitative and qualitative properties of the distribution of optic-flow error estimates to these patterns

CiteSeerX

VBN

The Southampton-York Natural Scenes (SYNS) dataset: statistics of surface attitude

Author: Adams Wendy J.
Elder James H.
Graf Erich W.
Leyland Julian
Lugtigheid Arthur J.
Muryy Alexander
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/10/2016
Field of study

Recovering 3D scenes from 2D images is an under-constrained task; optimal estimation depends upon knowledge of the underlying scene statistics. Here we introduce the Southampton-York Natural Scenes dataset (SYNS: https://syns.soton.ac.uk), which provides comprehensive scene statistics useful for understanding biological vision and for improving machine vision systems. In order to capture the diversity of environments that humans encounter, scenes were surveyed at random locations within 25 indoor and outdoor categories. Each survey includes (i) spherical LiDAR range data (ii) high-dynamic range spherical imagery and (iii) a panorama of stereo image pairs. We envisage many uses for the dataset and present one example: an analysis of surface attitude statistics, conditioned on scene category and viewing elevation. Surface normals were estimated using a novel adaptive scale selection algorithm. Across categories, surface attitude below the horizon is dominated by the ground plane (0° tilt). Near the horizon, probability density is elevated at 90°/270° tilt due to vertical surfaces (trees, walls). Above the horizon, probability density is elevated near 0° slant due to overhead structure such as ceilings and leaf canopies. These structural regularities represent potentially useful prior assumptions for human and machine observers, and may predict human biases in perceived surface attitude

Southampton (e-Prints Soton)

Crossref

PubMed Central

Disambiguating Multi–Modal Scene Representations Using Perceptual Grouping Constraints

Author: A Baumberg
A Sha'ashua
A Verri
C Harris
C Schmid
D Crevier
D Field
D Kraft
D Lowe
D Lowe
D Scharstein
E Baseski
E Brunswik
F Schaffalitzky
Florentin Wörgötter
HH Nagel
J Elder
J Elder
J Elder
J Koenderink
J Mayhew
J Rodrigues
J Rodrigues
J Shi
K Koffka
K Köhler
K Mikolajczyk
L van Gool
L Wolff
M Brown
M Felsber
M Felsberg
M Oram
M Popović
N Kim
N Krüger
N Krüger
N Krüger
N Pugeault
N Pugeault
N Pugeault
N Pugeault
N Pugeault
Nicolas Pugeault
Norbert Krüger
O Faugeras
P Kovesi
P König
P Parent
P Perona
R Chung
R Hartley
R Horaud
R Mohan
S Geman
S Sarkar
S Se
SH Lee
Teresa Serrano-Gotarredona
W Freeman
W Geisler
Y Aloimonos
Y Ohta
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

In its early stages, the visual system suffers from a lot of ambiguity and noise that severely limits the performance of early vision algorithms. This article presents feedback mechanisms between early visual processes, such as perceptual grouping, stereopsis and depth reconstruction, that allow the system to reduce this ambiguity and improve early representation of visual information. In the first part, the article proposes a local perceptual grouping algorithm that — in addition to commonly used geometric information — makes use of a novel multi–modal measure between local edge/line features. The grouping information is then used to: 1) disambiguate stereopsis by enforcing that stereo matches preserve groups; and 2) correct the reconstruction error due to the image pixel sampling using a linear interpolation over the groups. The integration of mutual feedback between early vision processes is shown to reduce considerably ambiguity and noise without the need for global constraints

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Open Research Exeter

GoeScholar The Publication Server of the Georg-August-Universität Göttingen

Enlighten

University of Southern Denmark Research Output

Surrey Research Insight

From receptive profiles to a metric model of V1

Author: Citti G.
Montobbio N.
Sarti A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

In this work we show how to construct connectivity kernels induced by the receptive profiles of simple cells of the primary visual cortex (V1). These kernels are directly defined by the shape of such profiles: this provides a metric model for the functional architecture of V1, whose global geometry is determined by the reciprocal interactions between local elements. Our construction adapts to any bank of filters chosen to represent a set of receptive profiles, since it does not require any structure on the parameterization of the family. The connectivity kernel that we define carries a geometrical structure consistent with the well-known properties of long-range horizontal connections in V1, and it is compatible with the perceptual rules synthesized by the concept of association field. These characteristics are still present when the kernel is constructed from a bank of filters arising from an unsupervised learning algorithm

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Invariant models of vision between phenomenology, image statistics and neurosciences

Author: Sanguinetti Gonzalo
Publication venue
Publication date
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas