Search CORE

7,574 research outputs found

Building high-level features using large scale unsupervised learning

Author: Chen Kai
Corrado Greg S.
Dean Jeff
Devin Matthieu
Le Quoc V.
Monga Rajat
Ng Andrew Y.
Ranzato Marc'Aurelio
Publication venue
Publication date: 01/01/2012
Field of study

We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a 9-layered locally connected sparse autoencoder with pooling and local contrast normalization on a large dataset of images (the model has 1 billion connections, the dataset has 10 million 200x200 pixel images downloaded from the Internet). We train this network using model parallelism and asynchronous SGD on a cluster with 1,000 machines (16,000 cores) for three days. Contrary to what appears to be a widely-held intuition, our experimental results reveal that it is possible to train a face detector without having to label images as containing a face or not. Control experiments show that this feature detector is robust not only to translation but also to scaling and out-of-plane rotation. We also find that the same network is sensitive to other high-level concepts such as cat faces and human bodies. Starting with these learned features, we trained our network to obtain 15.8% accuracy in recognizing 20,000 object categories from ImageNet, a leap of 70% relative improvement over the previous state-of-the-art

arXiv.org e-Print Archive

CiteSeerX

Subitizing with Variational Autoencoders

Author: A Nieder
A Nieder
BM Harvey
BR Jansen
C Arteta
D Burr
E Walach
EL Kaufman
G Lakoff
G Lemaître
H Davis
I Goodfellow
I Stoianov
J Zhang
L Feigenson
M Piazza
M Poncet
M-M Cheng
MC Franka
Olga Russakovsky
P Viswanathan
S Dehaene
T-Y Lin
WS Jevons
Y Hu
Y LeCun
Publication venue
Publication date: 01/08/2018
Field of study

Numerosity, the number of objects in a set, is a basic property of a given visual scene. Many animals develop the perceptual ability to subitize: the near-instantaneous identification of the numerosity in small sets of visual items. In computer vision, it has been shown that numerosity emerges as a statistical property in neural networks during unsupervised learning from simple synthetic images. In this work, we focus on more complex natural images using unsupervised hierarchical neural networks. Specifically, we show that variational autoencoders are able to spontaneously perform subitizing after training without supervision on a large amount images from the Salient Object Subitizing dataset. While our method is unable to outperform supervised convolutional networks for subitizing, we observe that the networks learn to encode numerosity as basic visual property. Moreover, we find that the learned representations are likely invariant to object area; an observation in alignment with studies on biological neural networks in cognitive neuroscience

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications

Unsupervised Learning of Individuals and Categories from Images

Author: Koch Christof
Waydo Stephen
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2008
Field of study

Motivated by the existence of highly selective, sparsely firing cells observed in the human medial temporal lobe (MTL), we present an unsupervised method for learning and recognizing object categories from unlabeled images. In our model, a network of nonlinear neurons learns a sparse representation of its inputs through an unsupervised expectation-maximization process. We show that the application of this strategy to an invariant feature-based description of natural images leads to the development of units displaying sparse, invariant selectivity for particular individuals or image categories much like those observed in the MTL data

Crossref

Caltech Authors

Learned versus Hand-Designed Feature Representations for 3d Agglomeration

Author: Bogovic John A.
Huang Gary B.
Jain Viren
Publication venue
Publication date: 20/12/2013
Field of study

For image recognition and labeling tasks, recent results suggest that machine learning methods that rely on manually specified feature representations may be outperformed by methods that automatically derive feature representations based on the data. Yet for problems that involve analysis of 3d objects, such as mesh segmentation, shape retrieval, or neuron fragment agglomeration, there remains a strong reliance on hand-designed feature descriptors. In this paper, we evaluate a large set of hand-designed 3d feature descriptors alongside features learned from the raw data using both end-to-end and unsupervised learning techniques, in the context of agglomeration of 3d neuron fragments. By combining unsupervised learning techniques with a novel dynamic pooling scheme, we show how pure learning-based methods are for the first time competitive with hand-designed 3d shape descriptors. We investigate data augmentation strategies for dramatically increasing the size of the training set, and show how combining both learned and hand-designed features leads to the highest accuracy

arXiv.org e-Print Archive

CiteSeerX

An emergentist perspective on the origin of number sense

Author
Publication venue: 'The Royal Society'
Publication date: 22/02/2018
Field of study

open2noopenZorzi, Marco; Testolin, AlbertoZorzi, Marco; Testolin, Albert

Archivio istituzionale della ricerca - Università di Padova

An Analysis of the Connections Between Layers of Deep Neural Networks

Author: Bates Jordan
Culurciello Eugenio
Dundar Aysegul
Jin Jonghoon
Publication venue
Publication date: 01/06/2013
Field of study

We present an analysis of different techniques for selecting the connection be- tween layers of deep neural networks. Traditional deep neural networks use ran- dom connection tables between layers to keep the number of connections small and tune to different image features. This kind of connection performs adequately in supervised deep networks because their values are refined during the training. On the other hand, in unsupervised learning, one cannot rely on back-propagation techniques to learn the connections between layers. In this work, we tested four different techniques for connecting the first layer of the network to the second layer on the CIFAR and SVHN datasets and showed that the accuracy can be im- proved up to 3% depending on the technique used. We also showed that learning the connections based on the co-occurrences of the features does not confer an advantage over a random connection table in small networks. This work is helpful to improve the efficiency of connections between the layers of unsupervised deep neural networks

arXiv.org e-Print Archive

CiteSeerX