Search CORE

1,174 research outputs found

Evaluating Sparse Codes on Handwritten Digits

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

PADDLE: Proximal Algorithm for Dual Dictionaries LEarning

Author: Basso Curzio
Santoro Matteo
Verri Alessandro
Villa Silvia
Publication venue
Publication date: 16/11/2010
Field of study

Recently, considerable research efforts have been devoted to the design of methods to learn from data overcomplete dictionaries for sparse coding. However, learned dictionaries require the solution of an optimization problem for coding new data. In order to overcome this drawback, we propose an algorithm aimed at learning both a dictionary and its dual: a linear mapping directly performing the coding. By leveraging on proximal methods, our algorithm jointly minimizes the reconstruction error of the dictionary and the coding error of its dual; the sparsity of the representation is induced by an

\ell_1

-based penalty on its coefficients. The results obtained on synthetic data and real images show that the algorithm is capable of recovering the expected dictionaries. Furthermore, on a benchmark dataset, we show that the image features obtained from the dual matrix yield state-of-the-art classification performance while being much less computational intensive

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Genova

Network Plasticity as Bayesian Inference

Author: Habenschuss Stefan
Kappel David
Legenstein Robert
Maass Wolfgang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 20/04/2015
Field of study

General results from statistical learning theory suggest to understand not only brain computations, but also brain plasticity as probabilistic inference. But a model for that has been missing. We propose that inherently stochastic features of synaptic plasticity and spine motility enable cortical networks of neurons to carry out probabilistic inference by sampling from a posterior distribution of network configurations. This model provides a viable alternative to existing models that propose convergence of parameters to maximum likelihood values. It explains how priors on weight distributions and connection probabilities can be merged optimally with learned experience, how cortical networks can generalize learned information so well to novel experiences, and how they can compensate continuously for unforeseen disturbances of the network. The resulting new theory of network plasticity explains from a functional perspective a number of experimental data on stochastic aspects of synaptic plasticity that previously appeared to be quite puzzling.Comment: 33 pages, 5 figures, the supplement is available on the author's web page http://www.igi.tugraz.at/kappe

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

A linear approach for sparse coding by a two-layer neural network

Author: Montalto Alessandro
Prevete Roberto
Tessitore Giovanni
Publication venue
Publication date: 01/01/2015
Field of study

Many approaches to transform classification problems from non-linear to linear by feature transformation have been recently presented in the literature. These notably include sparse coding methods and deep neural networks. However, many of these approaches require the repeated application of a learning process upon the presentation of unseen data input vectors, or else involve the use of large numbers of parameters and hyper-parameters, which must be chosen through cross-validation, thus increasing running time dramatically. In this paper, we propose and experimentally investigate a new approach for the purpose of overcoming limitations of both kinds. The proposed approach makes use of a linear auto-associative network (called SCNN) with just one hidden layer. The combination of this architecture with a specific error function to be minimized enables one to learn a linear encoder computing a sparse code which turns out to be as similar as possible to the sparse coding that one obtains by re-training the neural network. Importantly, the linearity of SCNN and the choice of the error function allow one to achieve reduced running time in the learning phase. The proposed architecture is evaluated on the basis of two standard machine learning tasks. Its performances are compared with those of recently proposed non-linear auto-associative neural networks. The overall results suggest that linear encoders can be profitably used to obtain sparse data representations in the context of machine learning problems, provided that an appropriate error function is used during the learning phase

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II