Search CORE

18 research outputs found

Deep Epitomic Convolutional Neural Networks

Author: Papandreou George
Publication venue
Publication date: 10/06/2014
Field of study

Deep convolutional neural networks have recently proven extremely competitive in challenging image recognition tasks. This paper proposes the epitomic convolution as a new building block for deep neural networks. An epitomic convolution layer replaces a pair of consecutive convolution and max-pooling layers found in standard deep convolutional neural networks. The main version of the proposed model uses mini-epitomes in place of filters and computes responses invariant to small translations by epitomic search instead of max-pooling over image positions. The topographic version of the proposed model uses large epitomes to learn filter maps organized in translational topographies. We show that error back-propagation can successfully learn multiple epitomic layers in a supervised fashion. The effectiveness of the proposed method is assessed in image classification tasks on standard benchmarks. Our experiments on Imagenet indicate improved recognition performance compared to standard convolutional neural networks of similar architecture. Our models pre-trained on Imagenet perform excellently on Caltech-101. We also obtain competitive image classification results on the small-image MNIST and CIFAR-10 datasets.Comment: 9 page

arXiv.org e-Print Archive

CiteSeerX

In All Likelihood, Deep Belief Is Not Enough

Author: Bethge Matthias
Gerwinn Sebastian
Sinz Fabian
Theis Lucas
Publication venue
Publication date: 28/11/2010
Field of study

Statistical models of natural stimuli provide an important tool for researchers in the fields of machine learning and computational neuroscience. A canonical way to quantitatively assess and compare the performance of statistical models is given by the likelihood. One class of statistical models which has recently gained increasing popularity and has been applied to a variety of complex data are deep belief networks. Analyses of these models, however, have been typically limited to qualitative analyses based on samples due to the computationally intractable nature of the model likelihood. Motivated by these circumstances, the present article provides a consistent estimator for the likelihood that is both computationally tractable and simple to apply in practice. Using this estimator, a deep belief network which has been suggested for the modeling of natural image patches is quantitatively investigated and compared to other models of natural image patches. Contrary to earlier claims based on qualitative results, the results presented in this article provide evidence that the model under investigation is not a particularly good model for natural image

arXiv.org e-Print Archive

MPG.PuRe

Are v1 simple cells optimized for visual occlusions? : A comparative study

Author: Bornschein Jörg
Henniges Marc
Lücke Jörg
Publication venue
Publication date: 06/06/2013
Field of study

Abstract: Simple cells in primary visual cortex were famously found to respond to low-level image components such as edges. Sparse coding and independent component analysis (ICA) emerged as the standard computational models for simple cell coding because they linked their receptive fields to the statistics of visual stimuli. However, a salient feature of image statistics, occlusions of image components, is not considered by these models. Here we ask if occlusions have an effect on the predicted shapes of simple cell receptive fields. We use a comparative approach to answer this question and investigate two models for simple cells: a standard linear model and an occlusive model. For both models we simultaneously estimate optimal receptive fields, sparsity and stimulus noise. The two models are identical except for their component superposition assumption. We find the image encoding and receptive fields predicted by the models to differ significantly. While both models predict many Gabor-like fields, the occlusive model predicts a much sparser encoding and high percentages of ‘globular’ receptive fields. This relatively new center-surround type of simple cell response is observed since reverse correlation is used in experimental studies. While high percentages of ‘globular’ fields can be obtained using specific choices of sparsity and overcompleteness in linear sparse coding, no or only low proportions are reported in the vast majority of studies on linear models (including all ICA models). Likewise, for the here investigated linear model and optimal sparsity, only low proportions of ‘globular’ fields are observed. In comparison, the occlusive model robustly infers high proportions and can match the experimentally observed high proportions of ‘globular’ fields well. Our computational study, therefore, suggests that ‘globular’ fields may be evidence for an optimal encoding of visual occlusions in primary visual cortex. Author Summary: The statistics of our visual world is dominated by occlusions. Almost every image processed by our brain consists of mutually occluding objects, animals and plants. Our visual cortex is optimized through evolution and throughout our lifespan for such stimuli. Yet, the standard computational models of primary visual processing do not consider occlusions. In this study, we ask what effects visual occlusions may have on predicted response properties of simple cells which are the first cortical processing units for images. Our results suggest that recently observed differences between experiments and predictions of the standard simple cell models can be attributed to occlusions. The most significant consequence of occlusions is the prediction of many cells sensitive to center-surround stimuli. Experimentally, large quantities of such cells are observed since new techniques (reverse correlation) are used. Without occlusions, they are only obtained for specific settings and none of the seminal studies (sparse coding, ICA) predicted such fields. In contrast, the new type of response naturally emerges as soon as occlusions are considered. In comparison with recent in vivo experiments we find that occlusive models are consistent with the high percentages of center-surround simple cells observed in macaque monkeys, ferrets and mice

Crossref

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

FigShare

Learning invariant features through topographic filter maps

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Learning invariant features through topographic filter maps

Author: K. Kavukcuoglu
M.A. Ranzato
null Yann Le-Cun
R. Fergus
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Crossref

Cortical Surround Interactions and Perceptual Salience via Natural Scene Statistics

Author: A Angelucci
A Ayaz
A Hyvärinen
A Shmuel
AJ Bell
AM Sillito
AR Koene
BA Olshausen
C Zetzsche
CD Gilbert
CD Gilbert
CI Moore
CM Bishop
CY Li
D Fitzpatrick
D Gao
DD Stettler
DJ Heeger
DK Hammond
DL Ringach
DY Ts'o
E Goddard
E Salinas
EP Simoncelli
EP Simoncelli
F Attneave
F Sengpiel
FS Chance
G Chen
GA Walker
H Ozeki
HB Barlow
HC Nothdurft
HE Jones
HW Heuer
I Nauhaus
J Allman
J Lücke
J Malo
J Portilla
J Portilla
JA Guerrero-Colon
JB Levitt
JJ Kivinen
JJ Knierim
JL Gallant
JL Gauthier
JM Ichida
JR Cavanaugh
JR Cavanaugh
KP Körding
L Itti
L Itti
L Kuhlmann
L Parra
L Schwabe
L Zhaoping
L Zhaoping
L Zhaoping
L Zhaoping
L Zhaoping
LY Zhang
M Carandini
M Kouh
M Sigman
MJ Wainwright
MK Kapadia
MP Sceniak
MW Pettet
MW Spratling
ND Bruce
O Schwartz
O Schwartz
O Schwartz
O Schwartz
Odelia Schwartz
Olaf Sporns
P Dayan
P Series
Peter Dayan
PO Hoyer
Q Li
R Coen-Cagli
RP Rao
Ruben Coen-Cagli
S Osindero
SC Dakin
SC Yen
T Kasamatsu
TN Mundhenk
U Polat
VA Lamme
W Li
W Li
WE Vinje
WS Geisler
WS Geisler
WS Geisler
Y Karklin
Y Karklin
Z Li
Z Li
Z Li
ZM Shen
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Spatial context in images induces perceptual phenomena associated with salience and modulates the responses of neurons in primary visual cortex (V1). However, the computational and ecological principles underlying contextual effects are incompletely understood. We introduce a model of natural images that includes grouping and segmentation of neighboring features based on their joint statistics, and we interpret the firing rates of V1 neurons as performing optimal recognition in this model. We show that this leads to a substantial generalization of divisive normalization, a computation that is ubiquitous in many neural areas and systems. A main novelty in our model is that the influence of the context on a target stimulus is determined by their degree of statistical dependence. We optimized the parameters of the model on natural image patches, and then simulated neural and perceptual responses on stimuli used in classical experiments. The model reproduces some rich and complex response patterns observed in V1, such as the contrast dependence, orientation tuning and spatial asymmetry of surround suppression, while also allowing for surround facilitation under conditions of weak stimulation. It also mimics the perceptual salience produced by simple displays, and leads to readily testable predictions. Our results provide a principled account of orientation-based contextual modulation in early vision and its sensitivity to the homogeneity and spatial arrangement of inputs, and lends statistical support to the theory that V1 computes visual salience

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

UCL Discovery

University of Miami: Scholarship Miami

MPG.PuRe

FigShare

Hierarchical temporal prediction captures motion processing along the visual pathway

Author: Harper Nicol S
King Andrew J
Singer Yosef
Taylor Luke
Willmore Benjamin DB
Publication venue: eLife Sciences Publications
Publication date: 16/10/2023
Field of study

Visual neurons respond selectively to features that become increasingly complex from the eyes to the cortex. Retinal neurons prefer flashing spots of light, primary visual cortical (V1) neurons prefer moving bars, and those in higher cortical areas favor complex features like moving textures. Previously, we showed that V1 simple cell tuning can be accounted for by a basic model implementing temporal prediction – representing features that predict future sensory input from past input (Singer et al., 2018). Here, we show that hierarchical application of temporal prediction can capture how tuning properties change across at least two levels of the visual system. This suggests that the brain does not efficiently represent all incoming information; instead, it selectively represents sensory inputs that help in predicting the future. When applied hierarchically, temporal prediction extracts time-varying features that depend on increasingly high-level statistics of the sensory input

Oxford University Research Archive