Search CORE

53 research outputs found

Sparse Image Representation with Epitomes

Author: Bach Francis
Benoît Louise
Mairal Julien
Ponce Jean
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/06/2011
Field of study

Sparse coding, which is the decomposition of a vector using only a few basis elements, is widely used in machine learning and image processing. The basis set, also called dictionary, is learned to adapt to specific data. This approach has proven to be very effective in many image processing tasks. Traditionally, the dictionary is an unstructured "flat" set of atoms. In this paper, we study structured dictionaries which are obtained from an epitome, or a set of epitomes. The epitome is itself a small image, and the atoms are all the patches of a chosen size inside this image. This considerably reduces the number of parameters to learn and provides sparse image decompositions with shiftinvariance properties. We propose a new formulation and an algorithm for learning the structured dictionaries associated with epitomes, and illustrate their use in image denoising tasks.Comment: Computer Vision and Pattern Recognition, Colorado Springs : United States (2011

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Feature Learning from Spectrograms for Assessment of Personality Traits

Author: Attabi Yazid
Carbonneau Marc-André
Gagnon Ghyslain
Granger Eric
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/10/2016
Field of study

Several methods have recently been proposed to analyze speech and automatically infer the personality of the speaker. These methods often rely on prosodic and other hand crafted speech processing features extracted with off-the-shelf toolboxes. To achieve high accuracy, numerous features are typically extracted using complex and highly parameterized algorithms. In this paper, a new method based on feature learning and spectrogram analysis is proposed to simplify the feature extraction process while maintaining a high level of accuracy. The proposed method learns a dictionary of discriminant features from patches extracted in the spectrogram representations of training speech segments. Each speech segment is then encoded using the dictionary, and the resulting feature set is used to perform classification of personality traits. Experiments indicate that the proposed method achieves state-of-the-art results with a significant reduction in complexity when compared to the most recent reference methods. The number of features, and difficulties linked to the feature extraction process are greatly reduced as only one type of descriptors is used, for which the 6 parameters can be tuned automatically. In contrast, the simplest reference method uses 4 types of descriptors to which 6 functionals are applied, resulting in over 20 parameters to be tuned.Comment: 12 pages, 3 figure

arXiv.org e-Print Archive

The Sample Complexity of Dictionary Learning

Author: Aharon
Amaldi
Baraniuk
Bruckstein
Burges
Campadelli
Campbell
Campbell
Candes
Candès
Chapelle
Dehak
Donoho
Eliathamby Ambikairajah
Fauve
Figueiredo
Friedman
Georghiades
Huang
Ji
Jia Min Karen Kua
Julien Epps
Kinnunen
Kreutz-Delgado
Mairal
McLaren
Reynolds
Reynolds
Tao
Tibshirani
Tikhonov
Webb
Wright
Zou
Publication venue: 'Elsevier BV'
Publication date: 24/11/2010
Field of study

A large set of signals can sometimes be described sparsely using a dictionary, that is, every element can be represented as a linear combination of few elements from the dictionary. Algorithms for various signal processing applications, including classification, denoising and signal separation, learn a dictionary from a set of signals to be represented. Can we expect that the representation found by such a dictionary for a previously unseen example from the same source will have L_2 error of the same magnitude as those for the given examples? We assume signals are generated from a fixed distribution, and study this questions from a statistical learning theory perspective. We develop generalization bounds on the quality of the learned dictionary for two types of constraints on the coefficient selection, as measured by the expected L_2 error in representation when the dictionary is used. For the case of l_1 regularized coefficient selection we provide a generalization bound of the order of O(sqrt(np log(m lambda)/m)), where n is the dimension, p is the number of elements in the dictionary, lambda is a bound on the l_1 norm of the coefficient vector and m is the number of samples, which complements existing results. For the case of representing a new signal as a combination of at most k dictionary elements, we provide a bound of the order O(sqrt(np log(m k)/m)) under an assumption on the level of orthogonality of the dictionary (low Babel function). We further show that this assumption holds for most dictionaries in high dimensions in a strong probabilistic sense. Our results further yield fast rates of order 1/m as opposed to 1/sqrt(m) using localized Rademacher complexity. We provide similar results in a general setting using kernels with weak smoothness requirements

arXiv.org e-Print Archive

CiteSeerX

Crossref

Dictionary Snakes

Author: Dahl Anders Bjorholm
Dahl Vedrana Andersen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

Crossref

Online Research Database In Technology

Learning Dictionaries of Discriminative Image Patches

Author: Dahl Anders Lindbjerg
Larsen Rasmus
Publication venue: 'British Machine Vision Association and Society for Pattern Recognition'
Publication date: 01/01/2011
Field of study

Crossref

Online Research Database In Technology

Sparse representation for pose invariant face recognition

Author: Wei Shen
Yumin Zeng
Zhi Chen
Publication venue: Faculty of Civil Engineering, Architecture and Geodesy ; Faculty of Electrical Engineering, Mechanical Engineering and Naval Architecture
Publication date: 01/01/2017
Field of study

Face recognition is easily affected by pose angle. In order to improve the obustness to pose angle, we need to solve the pose estimation, face synthesis and recognition problem. Sparse representation can represent a face image with linear combination of atom faces. In this paper, we construct different pose dictionaries using face images captured under the same pose angle to estimate pose angle and synthesize front face images for recognition. Experimental results show that sparse representation can estimate pose angle accurately, synthesize near frontal faces very well and significantly improve the recognition rate for large pose angles

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia