14,701 research outputs found
Unsupervised Learning of Individuals and Categories from Images
Motivated by the existence of highly selective, sparsely firing cells observed in the human medial temporal lobe (MTL), we present an unsupervised method for learning and recognizing object categories from unlabeled images. In our model, a network of nonlinear neurons learns a sparse representation of its inputs through an unsupervised expectation-maximization process. We show that the application of this strategy to an invariant feature-based description of natural images leads to the development of units displaying sparse, invariant selectivity for particular individuals or image categories much like those observed in the MTL data
Automatic Recognition of Mammal Genera on Camera-Trap Images using Multi-Layer Robust Principal Component Analysis and Mixture Neural Networks
The segmentation and classification of animals from camera-trap images is due
to the conditions under which the images are taken, a difficult task. This work
presents a method for classifying and segmenting mammal genera from camera-trap
images. Our method uses Multi-Layer Robust Principal Component Analysis (RPCA)
for segmenting, Convolutional Neural Networks (CNNs) for extracting features,
Least Absolute Shrinkage and Selection Operator (LASSO) for selecting features,
and Artificial Neural Networks (ANNs) or Support Vector Machines (SVM) for
classifying mammal genera present in the Colombian forest. We evaluated our
method with the camera-trap images from the Alexander von Humboldt Biological
Resources Research Institute. We obtained an accuracy of 92.65% classifying 8
mammal genera and a False Positive (FP) class, using automatic-segmented
images. On the other hand, we reached 90.32% of accuracy classifying 10 mammal
genera, using ground-truth images only. Unlike almost all previous works, we
confront the animal segmentation and genera classification in the camera-trap
recognition. This method shows a new approach toward a fully-automatic
detection of animals from camera-trap images
Learning Sparse Adversarial Dictionaries For Multi-Class Audio Classification
Audio events are quite often overlapping in nature, and more prone to noise
than visual signals. There has been increasing evidence for the superior
performance of representations learned using sparse dictionaries for
applications like audio denoising and speech enhancement. This paper
concentrates on modifying the traditional reconstructive dictionary learning
algorithms, by incorporating a discriminative term into the objective function
in order to learn class-specific adversarial dictionaries that are good at
representing samples of their own class at the same time poor at representing
samples belonging to any other class. We quantitatively demonstrate the
effectiveness of our learned dictionaries as a stand-alone solution for both
binary as well as multi-class audio classification problems.Comment: Accepted in Asian Conference of Pattern Recognition (ACPR-2017
- …