10,668 research outputs found
Idealized computational models for auditory receptive fields
This paper presents a theory by which idealized models of auditory receptive
fields can be derived in a principled axiomatic manner, from a set of
structural properties to enable invariance of receptive field responses under
natural sound transformations and ensure internal consistency between
spectro-temporal receptive fields at different temporal and spectral scales.
For defining a time-frequency transformation of a purely temporal sound
signal, it is shown that the framework allows for a new way of deriving the
Gabor and Gammatone filters as well as a novel family of generalized Gammatone
filters, with additional degrees of freedom to obtain different trade-offs
between the spectral selectivity and the temporal delay of time-causal temporal
window functions.
When applied to the definition of a second-layer of receptive fields from a
spectrogram, it is shown that the framework leads to two canonical families of
spectro-temporal receptive fields, in terms of spectro-temporal derivatives of
either spectro-temporal Gaussian kernels for non-causal time or the combination
of a time-causal generalized Gammatone filter over the temporal domain and a
Gaussian filter over the logspectral domain. For each filter family, the
spectro-temporal receptive fields can be either separable over the
time-frequency domain or be adapted to local glissando transformations that
represent variations in logarithmic frequencies over time. Within each domain
of either non-causal or time-causal time, these receptive field families are
derived by uniqueness from the assumptions.
It is demonstrated how the presented framework allows for computation of
basic auditory features for audio processing and that it leads to predictions
about auditory receptive fields with good qualitative similarity to biological
receptive fields measured in the inferior colliculus (ICC) and primary auditory
cortex (A1) of mammals.Comment: 55 pages, 22 figures, 3 table
Scale Invariant Interest Points with Shearlets
Shearlets are a relatively new directional multi-scale framework for signal
analysis, which have been shown effective to enhance signal discontinuities
such as edges and corners at multiple scales. In this work we address the
problem of detecting and describing blob-like features in the shearlets
framework. We derive a measure which is very effective for blob detection and
closely related to the Laplacian of Gaussian. We demonstrate the measure
satisfies the perfect scale invariance property in the continuous case. In the
discrete setting, we derive algorithms for blob detection and keypoint
description. Finally, we provide qualitative justifications of our findings as
well as a quantitative evaluation on benchmark data. We also report an
experimental evidence that our method is very suitable to deal with compressed
and noisy images, thanks to the sparsity property of shearlets
Hyperbolic planforms in relation to visual edges and textures perception
We propose to use bifurcation theory and pattern formation as theoretical
probes for various hypotheses about the neural organization of the brain. This
allows us to make predictions about the kinds of patterns that should be
observed in the activity of real brains through, e.g. optical imaging, and
opens the door to the design of experiments to test these hypotheses. We study
the specific problem of visual edges and textures perception and suggest that
these features may be represented at the population level in the visual cortex
as a specific second-order tensor, the structure tensor, perhaps within a
hypercolumn. We then extend the classical ring model to this case and show that
its natural framework is the non-Euclidean hyperbolic geometry. This brings in
the beautiful structure of its group of isometries and certain of its subgroups
which have a direct interpretation in terms of the organization of the neural
populations that are assumed to encode the structure tensor. By studying the
bifurcations of the solutions of the structure tensor equations, the analog of
the classical Wilson and Cowan equations, under the assumption of invariance
with respect to the action of these subgroups, we predict the appearance of
characteristic patterns. These patterns can be described by what we call
hyperbolic or H-planforms that are reminiscent of Euclidean planar waves and of
the planforms that were used in [1, 2] to account for some visual
hallucinations. If these patterns could be observed through brain imaging
techniques they would reveal the built-in or acquired invariance of the neural
organization to the action of the corresponding subgroups.Comment: 34 pages, 11 figures, 2 table
- …