Search CORE

4,984 research outputs found

On Invariance and Selectivity in Representation Learning

Author: Anselmi Fabio
Poggio Tomaso
Rosasco Lorenzo
Publication venue
Publication date: 19/03/2015
Field of study

We discuss data representation which can be learned automatically from data, are invariant to transformations, and at the same time selective, in the sense that two points have the same representation only if they are one the transformation of the other. The mathematical results here sharpen some of the key claims of i-theory -- a recent theory of feedforward processing in sensory cortex

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Trieste

DSpace@MIT

Crossref

Archivio istituzionale della ricerca - Università di Genova

On invariance and selectivity in representation learning

Author: Anselmi Fabio
Poggio Tomaso A
Rosasco Lorenzo
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

We study the problem of learning from data representations that are invariant to transformations, and at the same time selective, in the sense that two points have the same representation if one is the transformation of the other. The mathematical results here sharpen some of the key claims of i-theory—a recent theory of feedforward processing in sensory cortex (Anselmi et al., 2013, Theor. Comput. Sci. and arXiv:1311.4158; Anselmi et al., 2013, Magic materials: a theory of deep hierarchical architectures for learning sensory representations. CBCL Paper; Anselmi & Poggio, 2010, Representation learning in sensory cortex: a theory. CBMM Memo No. 26).National Science Foundation (U.S.) (Award CCF-1231216

DSpace@MIT

Archivio istituzionale della ricerca - Università di Genova

A Deep Representation for Invariance And Music Classification

Author: Evangelopoulos Georgios
Poggio Tomaso
Rosasco Lorenzo
Voinea Stephen
Zhang Chiyuan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

Representations in the auditory cortex might be based on mechanisms similar to the visual ventral stream; modules for building invariance to transformations and multiple layers for compositionality and selectivity. In this paper we propose the use of such computational modules for extracting invariant and discriminative audio representations. Building on a theory of invariance in hierarchical architectures, we propose a novel, mid-level representation for acoustical signals, using the empirical distributions of projections on a set of templates and their transformations. Under the assumption that, by construction, this dictionary of templates is composed from similar classes, and samples the orbit of variance-inducing signal transformations (such as shift and scale), the resulting signature is theoretically guaranteed to be unique, invariant to transformations and stable to deformations. Modules of projection and pooling can then constitute layers of deep networks, for learning composite representations. We present the main theoretical and computational aspects of a framework for unsupervised learning of invariant audio representations, empirically evaluated on music genre classification.Comment: 5 pages, CBMM Memo No. 002, (to appear) IEEE 2014 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Crossref

View-tolerant face recognition and Hebbian learning imply mirror-symmetric neural tuning to head orientation

Author: Abbott
Adelson
Afraz
Anselmi
Bart
Bart
Berkes
Bruce
Cox
DiCarlo
Fabio Anselmi
Farzmahdi
Freiwald
Földiák
Hassoun
Hengen
Hung
Isik
Isik
Joel Z. Leibo
Keck
Ku
Leibo
Li
Meyers
Miyashita
Moeller
Oja
Oja
Poggio
Qianli Liao
Riesenhuber
Rolls
Sanger
Serre
Tan
Thorpe
Tomaso Poggio
Tsao
Tsao
Tsao
Turrigiano
Wallis
Winrich A. Freiwald
Wiskott
Publication venue
Publication date: 03/06/2016
Field of study

The primate brain contains a hierarchy of visual areas, dubbed the ventral stream, which rapidly computes object representations that are both specific for object identity and relatively robust against identity-preserving transformations like depth-rotations. Current computational models of object recognition, including recent deep learning networks, generate these properties through a hierarchy of alternating selectivity-increasing filtering and tolerance-increasing pooling operations, similar to simple-complex cells operations. While simulations of these models recapitulate the ventral stream's progression from early view-specific to late view-tolerant representations, they fail to generate the most salient property of the intermediate representation for faces found in the brain: mirror-symmetric tuning of the neural population to head orientation. Here we prove that a class of hierarchical architectures and a broad set of biologically plausible learning rules can provide approximate invariance at the top level of the network. While most of the learning rules do not yield mirror-symmetry in the mid-level representations, we characterize a specific biologically-plausible Hebb-type learning rule that is guaranteed to generate mirror-symmetric tuning to faces tuning at intermediate levels of the architecture

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Trieste

DSpace@MIT

Crossref

PubMed Central

Unsupervised Learning of Individuals and Categories from Images

Author: Koch Christof
Waydo Stephen
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2008
Field of study

Motivated by the existence of highly selective, sparsely firing cells observed in the human medial temporal lobe (MTL), we present an unsupervised method for learning and recognizing object categories from unlabeled images. In our model, a network of nonlinear neurons learns a sparse representation of its inputs through an unsupervised expectation-maximization process. We show that the application of this strategy to an invariant feature-based description of natural images leads to the development of units displaying sparse, invariant selectivity for particular individuals or image categories much like those observed in the MTL data

Crossref

Caltech Authors