Search CORE

424 research outputs found

Role of homeostasis in learning sparse representations

Author: Baudot P.
Brüderle D.
Hebb D. O.
Laughlin S. B.
Laurent U. Perrinet
Lee H.
Mallat S.
Olshausen B. A.
Olshausen B. A.
Perrinet L.
Perrinet L.
Perrinet L.
Ranzato M. A.
Saito N.
Publication venue: 'MIT Press - Journals'
Publication date: 01/07/2010
Field of study

Neurons in the input layer of primary visual cortex in primates develop edge-like receptive fields. One approach to understanding the emergence of this response is to state that neural activity has to efficiently represent sensory data with respect to the statistics of natural scenes. Furthermore, it is believed that such an efficient coding is achieved using a competition across neurons so as to generate a sparse representation, that is, where a relatively small number of neurons are simultaneously active. Indeed, different models of sparse coding, coupled with Hebbian learning and homeostasis, have been proposed that successfully match the observed emergent response. However, the specific role of homeostasis in learning such sparse representations is still largely unknown. By quantitatively assessing the efficiency of the neural representation during learning, we derive a cooperative homeostasis mechanism that optimally tunes the competition between neurons within the sparse coding algorithm. We apply this homeostasis while learning small patches taken from natural images and compare its efficiency with state-of-the-art algorithms. Results show that while different sparse coding algorithms give similar coding results, the homeostasis provides an optimal balance for the representation of natural images within the population of neurons. Competition in sparse coding is optimized when it is fair. By contributing to optimizing statistical competition across neurons, homeostasis is crucial in providing a more efficient solution to the emergence of independent components

arXiv.org e-Print Archive

Model of visual attention for video sequences

Author: B Olshausen
M Milanova
Mariofanna Milanova
U Rutishauser
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Pattern recognition, attention, and information bottlenecks in the primate visual system

Author: Anderson C. H.
Gallant J. L.
Olshausen B.
Van Essen D. C.
Publication venue: Society of Photo-optical Instrumentation Engineers (SPIE)
Publication date: 09/07/1991
Field of study

In its evolution, the primate visual system has developed impressive capabilities for recognizing complex patterns in natural images. This process involves many stages of analysis and a variety of information processing strategies. This paper concentrates on the importance of 'information bottlenecks,' which restrict the amount of information that can be handled at different stages of analysis. These steps are crucial for reducing the overwhelming computational complexity associated with recognizing countless objects from arbitrary viewing angles, distances, and perspectives. The process of directed visual attention is an especially important information bottleneck because of its flexibility in determining how information is routed to high-level pattern recognition centers

Crossref

Caltech Authors

Riemannian Sparse Coding for Positive Definite Matrices

Author: B. Olshausen
E.G. Birgin
J. Wright
M.T. Harandi
O. Tuzel
R. Luis-García
R. Sivalingam
S. Sra
T. Guha
V. Arsigny
X. Pennec
Y. Pang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

International audienceInspired by the great success of sparse coding for vector valued data, our goal is to represent symmetric positive definite (SPD) data matrices as sparse linear combinations of atoms from a dictionary, where each atom itself is an SPD matrix. Since SPD matrices follow a non-Euclidean (in fact a Riemannian) geometry, existing sparse coding techniques for Euclidean data cannot be directly extended. Prior works have approached this problem by defining a sparse coding loss function using either extrinsic similarity measures (such as the log-Euclidean distance) or kernelized variants of statistical measures (such as the Stein divergence, Jeffrey's divergence, etc.). In contrast, we propose to use the intrinsic Riemannian distance on the manifold of SPD matrices. Our main contribution is a novel mathematical model for sparse coding of SPD matrices; we also present a computationally simple algorithm for optimizing our model. Experiments on several computer vision datasets showcase superior classification and retrieval performance compared with state-of-the-art approaches

CiteSeerX

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

MPG.PuRe

Pattern recognition, attention, and information bottlenecks in the primate visual system

Author: Anderson C. H.
Gallant J. L.
Olshausen B.
Van Essen D. C.
Publication venue: Society of Photo-optical Instrumentation Engineers (SPIE)
Publication date: 09/07/1991
Field of study

A Generative Method for Textured Motion: Analysis and Synthesis

Author: A. D. Cliff
B. A. Olshausen
M. J. Black
S. Mallat
Y. Meyer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

Abstract. Natural scenes contain rich stochastic motion patterns which are characterized by the movement of a large number of small elements, such as falling snow, raining, ÿying birds, þrework and waterfall. In this paper, we call these motion patterns textured motion and present a gen-erative method that combines statistical models and algorithms from both texture and motion analysis. The generative method includes the following three aspects. 1). Photometrically, an image is represented as a superposition of linear bases in atomic decomposition using an over-complete dictionary, such as Gabor or Laplacian. Such base representa-tion is known to be generic for natural images, and it is low dimensional as the number of bases is often 100 times smaller than the number of pixels. 2). Geometrically, each moving element (called moveton), such as the individual snowÿake and bird, is represented by a deformable template which is a group of several spatially adjacent bases. Such tem-plates are learned through clustering. 3). Dynamically, the movetons ar

CiteSeerX

Crossref

eScholarship - University of California

Efficient Sparse Coding in Early Sensory Processing: Lessons from Signal Recovery

Author: A Lörincz
A Lörincz
A Lörincz
AJ Bell
András Lörincz
B Liu
B Natarajan
B Szatmáry
B Widrow
BA Olshausen
BA Olshausen
BA Olshausen
BT Vincent
C Cadieu
C Chennubhotla
D Cai
D Needell
D Needell
DCV Essen
DJ Graham
DL Donoho
DL Ringach
DW Dong
E Doi
E Doi
EJ Candès
EJ Candès
EJ Candès
EP Simoncelli
GC DeAngelis
GH Golub
Gábor Szirtes
H A
H Muehlenbein
HB Barlow
I Szita
IT Jolliffe
J Lücke
JA Cardin
JA Tropp
JJ Atick
JP Jones
Lyle J. Graham
M Rehn
M Riesenhuber
P Berkes
P Földiák
P Lennie
PT de Boer
RW Rodieck
RY Rubinstein
S Mallat
SB Laughlin
W Dai
YC Pati
Z Zhou
Zsolt Palotai
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Sensory representations are not only sparse, but often overcomplete: coding units significantly outnumber the input units. For models of neural coding this overcompleteness poses a computational challenge for shaping the signal processing channels as well as for using the large and sparse representations in an efficient way. We argue that higher level overcompleteness becomes computationally tractable by imposing sparsity on synaptic activity and we also show that such structural sparsity can be facilitated by statistics based decomposition of the stimuli into typical and atypical parts prior to sparse coding. Typical parts represent large-scale correlations, thus they can be significantly compressed. Atypical parts, on the other hand, represent local features and are the subjects of actual sparse coding. When applied on natural images, our decomposition based sparse coding model can efficiently form overcomplete codes and both center-surround and oriented filters are obtained similar to those observed in the retina and the primary visual cortex, respectively. Therefore we hypothesize that the proposed computational architecture can be seen as a coherent functional model of the first stages of sensory coding in early vision

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Publikationsserver der Universität Tübingen

ELTE Digital Institutional Repository (EDIT)

Consequences of converting graded to action potentials upon neural information coding and energy efficiency

Author: A Borst
A Destexhe
A Hasenstaub
A Manwani
A Manwani
A Treves
AA Lazar
AA Lazar
AA Lazar
AL Hodgkin
AS French
B Aguera y Arcas
B Sengupta
B Sengupta
B Sengupta
B Sengupta
B Sengupta
BA Olshausen
BC Carter
Biswa Sengupta
C Koch
C Koch
C Shannon
CC Chow
D Attwell
D Desmaisons
D Lee
DM MacKay
DT Gillespie
E Marder
E Salinas
E Schneidman
E Skaugen
EM Izhikevich
F Theunissen
FA Dodge Jr
G Laurent
G Marsaglia
GG de Polavieja
H Alle
H Alle
J Haag
JA White
JA White
JC Rekling
JC Skou
JD Victor
JD Victor
JE Niven
JE Niven
JE Niven
JE Niven
JE Niven
Jeremy Edward Niven
K Koch
M Juusola
M Matsumoto
M Pinsker
M Stemmler
MB Kennel
ME Larkum
MH Kole
MN Shadlen
MS Grubb
MV Srinivasan
NJ Lenn
O Bernander
Olaf Sporns
PG Lillywhite
PN Steinmetz
R Guttman
R Sarpeshkar
RA DiCaprio
RRdR van Steveninck
S Curti
S Laughlin
SB Laughlin
Simon Barry Laughlin
SP Strong
SR Williams
TJ Gawne
V Prelov
W Singer
Y Shu
ZF Mainen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Information is encoded in neural circuits using both graded and action potentials, converting between them within single neurons and successive processing layers. This conversion is accompanied by information loss and a drop in energy efficiency. We investigate the biophysical causes of this loss of information and efficiency by comparing spiking neuron models, containing stochastic voltage-gated Na+ and K+ channels, with generator potential and graded potential models lacking voltage-gated Na+ channels. We identify three causes of information loss in the generator potential that are the by-product of action potential generation: (1) the voltage-gated Na+ channels necessary for action potential generation increase intrinsic noise and (2) introduce non-linearities, and (3) the finite duration of the action potential creates a ‘footprint’ in the generator potential that obscures incoming signals. These three processes reduce information rates by ~50% in generator potentials, to ~3 times that of spike trains. Both generator potentials and graded potentials consume almost an order of magnitude less energy per second than spike trains. Because of the lower information rates of generator potentials they are substantially less energy efficient than graded potentials. However, both are an order of magnitude more efficient than spike trains due to the higher energy costs and low information content of spikes, emphasizing that there is a two-fold cost of converting analogue to digital; information loss and cost inflation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Open Access Repository of IISc Research Publications

Sussex Research Online

FigShare

Parametric study of EEG sensitivity to phase noise during face processing

Author: A Delorme
A Delorme
AB Sekuler
AB Sekuler
Allison B Sekuler
AV Oppenheim
B Jemel
B Rossion
BA Olshausen
BS Tjan
C Jacques
C Jacques
C Joyce
C Pernet
CA Olman
Cyril R Pernet
DA Jeffreys
DA Jeffreys
DM Tucker
EP Simoncelli
FA Kingdom
FA Wichmann
G Felsen
G Rainer
G Rainer
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
Guillaume A Rousselet
J Bullier
J Bullier
J Drewes
J Gold
J Portilla
JJ DiCarlo
JS Husk
K Bötzel
K Grill-Spector
K Tanaka
KL Hoffman
LC Loschky
LT DeCarlo
MC Morrone
MG Philiastides
MG Philiastides
MG Thomson
MG Thomson
ML Smith
MM Murray
NC Rust
NK Logothetis
O Hauk
P Sehatpour
Patrick J Bennett
PG Schyns
PG Schyns
R VanRullen
RJ Itier
RJ Itier
RJ Itier
RJ Itier
RR Wilcox
S Bentin
S Bentin
SA Hillyard
SC Dakin
SJ Thorpe
T Tanskanen
T Tanskanen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Background: The present paper examines the visual processing speed of complex objects, here faces, by mapping the relationship between object physical properties and single-trial brain responses. Measuring visual processing speed is challenging because uncontrolled physical differences that co-vary with object categories might affect brain measurements, thus biasing our speed estimates. Recently, we demonstrated that early event-related potential (ERP) differences between faces and objects are preserved even when images differ only in phase information, and amplitude spectra are equated across image categories. Here, we use a parametric design to study how early ERP to faces are shaped by phase information. Subjects performed a two-alternative force choice discrimination between two faces (Experiment 1) or textures (two control experiments). All stimuli had the same amplitude spectrum and were presented at 11 phase noise levels, varying from 0% to 100% in 10% increments, using a linear phase interpolation technique. Single-trial ERP data from each subject were analysed using a multiple linear regression model. Results: Our results show that sensitivity to phase noise in faces emerges progressively in a short time window between the P1 and the N170 ERP visual components. The sensitivity to phase noise starts at about 120–130 ms after stimulus onset and continues for another 25–40 ms. This result was robust both within and across subjects. A control experiment using pink noise textures, which had the same second-order statistics as the faces used in Experiment 1, demonstrated that the sensitivity to phase noise observed for faces cannot be explained by the presence of global image structure alone. A second control experiment used wavelet textures that were matched to the face stimuli in terms of second- and higher-order image statistics. Results from this experiment suggest that higher-order statistics of faces are necessary but not sufficient to obtain the sensitivity to phase noise function observed in response to faces. Conclusion: Our results constitute the first quantitative assessment of the time course of phase information processing by the human visual brain. We interpret our results in a framework that focuses on image statistics and single-trial analyses

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Enlighten

Catalyzing next-generation Artificial Intelligence through NeuroAI

Author: Bengio Y
Boahen K
Botvinick M
Chklovskii D
Churchland A
Clopath C
DiCarlo J
Escola S
Ganguli S
Hawkins J
Koulakov A
Körding K
LeCun Y
Lillicrap T
Marblestone A
Olshausen B
Pouget A
Richards B
Savin C
Sejnowski T
Simoncelli E
Solla S
Sussillo D
Tolias AS
Tsao D
Zador A
Ölveczky B
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/03/2023
Field of study

Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts the focus from those capabilities like game playing and language that are especially well-developed or uniquely human to those capabilities - inherited from over 500 million years of evolution - that are shared with all animals. Building models that can pass the embodied Turing test will provide a roadmap for the next generation of AI

Spiral - Imperial College Digital Repository