Search CORE

2,071 research outputs found

Sparse Codes for Speech Predict Spectrotemporal Receptive Fields in the Inferior Colliculus

Author: Carlson Nicole L.
DeWeese Michael R.
Ming Vivienne L.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 12/07/2012
Field of study

We have developed a sparse mathematical representation of speech that minimizes the number of active model neurons needed to represent typical speech sounds. The model learns several well-known acoustic features of speech such as harmonic stacks, formants, onsets and terminations, but we also find more exotic structures in the spectrogram representation of sound such as localized checkerboard patterns and frequency-modulated excitatory subregions flanked by suppressive sidebands. Moreover, several of these novel features resemble neuronal receptive fields reported in the Inferior Colliculus (IC), as well as auditory thalamus and cortex, and our model neurons exhibit the same tradeoff in spectrotemporal resolution as has been observed in IC. To our knowledge, this is the first demonstration that receptive fields of neurons in the ascending mammalian auditory pathway beyond the auditory nerve can be predicted based on coding principles and the statistical properties of recorded sounds.Comment: For Supporting Information, see PLoS website: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.100259

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

FigShare

Analysis, Visualization, and Transformation of Audio Signals Using Dictionary-based Methods

Author: Aaron McLeran
Bob L. Sturm
Boyd S.
Curtis Roads
Gabor D.
Gersho A.
John J. Shynk
Mallat S.
Meyer C.
Roads C.
Roads C.
Sturm B. L.
Xenakis I.
Publication venue: 'Informa UK Limited'
Publication date: 17/12/2009
Field of study

date-added: 2014-01-07 09:15:58 +0000 date-modified: 2014-01-07 09:15:58 +0000date-added: 2014-01-07 09:15:58 +0000 date-modified: 2014-01-07 09:15:58 +000

Crossref

Queen Mary Research Online

Learning sparse representations of depth

Author: Culpepper Benjamin J.
Olshausen Bruno A.
Tosic Ivana
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/04/2011
Field of study

This paper introduces a new method for learning and inferring sparse representations of depth (disparity) maps. The proposed algorithm relaxes the usual assumption of the stationary noise model in sparse coding. This enables learning from data corrupted with spatially varying noise or uncertainty, typically obtained by laser range scanners or structured light depth cameras. Sparse representations are learned from the Middlebury database disparity maps and then exploited in a two-layer graphical model for inferring depth from stereo, by including a sparsity prior on the learned features. Since they capture higher-order dependencies in the depth structure, these priors can complement smoothness priors commonly used in depth inference based on Markov Random Field (MRF) models. Inference on the proposed graph is achieved using an alternating iterative optimization technique, where the first layer is solved using an existing MRF-based stereo matching algorithm, then held fixed as the second layer is solved using the proposed non-stationary sparse coding algorithm. This leads to a general method for improving solutions of state of the art MRF-based depth estimation algorithms. Our experimental results first show that depth inference using learned representations leads to state of the art denoising of depth maps obtained from laser range scanners and a time of flight camera. Furthermore, we show that adding sparse priors improves the results of two depth estimation methods: the classical graph cut algorithm by Boykov et al. and the more recent algorithm of Woodford et al.Comment: 12 page

arXiv.org e-Print Archive

Crossref