Search CORE

4,221 research outputs found

Mining Object Parts from CNNs via Active Question-Answering

Author: Cao Ruiming
Wu Ying Nian
Zhang Quanshi
Zhu Song-Chun
Publication venue
Publication date: 11/04/2017
Field of study

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent a four-layer semantic hierarchy of the part. As an interpretable model, the AOG associates different CNN units with different explicit object parts. We use an active human-computer communication to incrementally grow such an AOG on the pre-trained CNN as follows. We allow the computer to actively identify objects, whose neural patterns cannot be explained by the current AOG. Then, the computer asks human about the unexplained objects, and uses the answers to automatically discover certain CNN patterns corresponding to the missing knowledge. We incrementally grow the AOG to encode new knowledge discovered during the active-learning process. In experiments, our method exhibits high learning efficiency. Our method uses about 1/6-1/3 of the part annotations for training, but achieves similar or better part-localization performance than fast-RCNN methods.Comment: Published in CVPR 201

arXiv.org e-Print Archive

Crossref

Expanded Parts Model for Semantic Description of Humans in Still Images

Author: Jurie Frederic
Schmid Cordelia
Sharma Gaurav
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/02/2016
Field of study

We introduce an Expanded Parts Model (EPM) for recognizing human attributes (e.g. young, short hair, wearing suit) and actions (e.g. running, jumping) in still images. An EPM is a collection of part templates which are learnt discriminatively to explain specific scale-space regions in the images (in human centric coordinates). This is in contrast to current models which consist of a relatively few (i.e. a mixture of) 'average' templates. EPM uses only a subset of the parts to score an image and scores the image sparsely in space, i.e. it ignores redundant and random background in an image. To learn our model, we propose an algorithm which automatically mines parts and learns corresponding discriminative templates together with their respective locations from a large number of candidate parts. We validate our method on three recent challenging datasets of human attributes and actions. We obtain convincing qualitative and state-of-the-art quantitative results on the three datasets.Comment: Accepted for publication in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI

arXiv.org e-Print Archive

HAL - Normandie Université

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

MPG.PuRe

A Method to Distinguish Quiescent and Dusty Star-forming Galaxies with Machine Learning

Author: Davidzon Iary
Faisst Andreas L.
Masters Dan
Maxfield Jack
Schemel Madeline
Steinhardt Charles L.
Toft Sune
Weaver John R.
Publication venue: 'American Astronomical Society'
Publication date: 13/02/2020
Field of study

Large photometric surveys provide a rich source of observations of quiescent galaxies, including a surprisingly large population at z > 1. However, identifying large, but clean, samples of quiescent galaxies has proven difficult because of their near-degeneracy with interlopers such as dusty, star-forming galaxies. We describe a new technique for selecting quiescent galaxies based upon t-distributed stochastic neighbor embedding (t-SNE), an unsupervised machine-learning algorithm for dimensionality reduction. This t-SNE selection provides an improvement both over UVJ, removing interlopers that otherwise would pass color selection, and over photometric template fitting, more strongly toward high redshift. Due to the similarity between the colors of high- and low-redshift quiescent galaxies, under our assumptions, t-SNE outperforms template fitting in 63% of trials at redshifts where a large training sample already exists. It also may be able to select quiescent galaxies more efficiently at higher redshifts than the training sample

arXiv.org e-Print Archive

Copenhagen University Research Information System

Caltech Authors

View-tolerant face recognition and Hebbian learning imply mirror-symmetric neural tuning to head orientation

Author: Abbott
Adelson
Afraz
Anselmi
Bart
Bart
Berkes
Bruce
Cox
DiCarlo
Fabio Anselmi
Farzmahdi
Freiwald
Földiák
Hassoun
Hengen
Hung
Isik
Isik
Joel Z. Leibo
Keck
Ku
Leibo
Li
Meyers
Miyashita
Moeller
Oja
Oja
Poggio
Qianli Liao
Riesenhuber
Rolls
Sanger
Serre
Tan
Thorpe
Tomaso Poggio
Tsao
Tsao
Tsao
Turrigiano
Wallis
Winrich A. Freiwald
Wiskott
Publication venue
Publication date: 03/06/2016
Field of study

The primate brain contains a hierarchy of visual areas, dubbed the ventral stream, which rapidly computes object representations that are both specific for object identity and relatively robust against identity-preserving transformations like depth-rotations. Current computational models of object recognition, including recent deep learning networks, generate these properties through a hierarchy of alternating selectivity-increasing filtering and tolerance-increasing pooling operations, similar to simple-complex cells operations. While simulations of these models recapitulate the ventral stream's progression from early view-specific to late view-tolerant representations, they fail to generate the most salient property of the intermediate representation for faces found in the brain: mirror-symmetric tuning of the neural population to head orientation. Here we prove that a class of hierarchical architectures and a broad set of biologically plausible learning rules can provide approximate invariance at the top level of the network. While most of the learning rules do not yield mirror-symmetry in the mid-level representations, we characterize a specific biologically-plausible Hebb-type learning rule that is guaranteed to generate mirror-symmetric tuning to faces tuning at intermediate levels of the architecture

arXiv.org e-Print Archive

DSpace@MIT

Crossref

PubMed Central