Search CORE

156 research outputs found

Sparse Modeling for Image and Vision Processing

Author: Ecole Normale Supérieure
Francis Bach
Francis Bach
Hal Id Hal
Jean Ponce
Jean Ponce
Julien Mairal
Julien Mairal
Sparse Modeling Image
Vision Processing
Publication venue
Publication date: 01/01/2014
Field of study

In recent years, a large amount of multi-disciplinary research has been conducted on sparse models and their applications. In statistics and machine learning, the sparsity principle is used to perform model selection---that is, automatically selecting a simple model among a large collection of them. In signal processing, sparse coding consists of representing data with linear combinations of a few dictionary elements. Subsequently, the corresponding tools have been widely adopted by several scientific communities such as neuroscience, bioinformatics, or computer vision. The goal of this monograph is to offer a self-contained view of sparse modeling for visual recognition and image processing. More specifically, we focus on applications where the dictionary is learned and adapted to data, yielding a compact representation that has been successful in various contexts.Comment: 205 pages, to appear in Foundations and Trends in Computer Graphics and Visio

arXiv.org e-Print Archive

CiteSeerX

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Manifold Learning Approaches to Compressing Latent Spaces of Unsupervised Feature Hierarchies

Author: De Deuge Mark
Publication venue: Faculty of Engineering and Information Technologies, School of Aerospace, Mechanical and Mechatronic Engineering
Publication date: 01/01/2016
Field of study

Field robots encounter dynamic unstructured environments containing a vast array of unique objects. In order to make sense of the world in which they are placed, they collect large quantities of unlabelled data with a variety of sensors. Producing robust and reliable applications depends entirely on the ability of the robot to understand the unlabelled data it obtains. Deep Learning techniques have had a high level of success in learning powerful unsupervised representations for a variety of discriminative and generative models. Applying these techniques to problems encountered in field robotics remains a challenging endeavour. Modern Deep Learning methods are typically trained with a substantial labelled dataset, while datasets produced in a field robotics context contain limited labelled training data. The primary motivation for this thesis stems from the problem of applying large scale Deep Learning models to field robotics datasets that are label poor. While the lack of labelled ground truth data drives the desire for unsupervised methods, the need for improving the model scaling is driven by two factors, performance and computational requirements. When utilising unsupervised layer outputs as representations for classification, the classification performance increases with layer size. Scaling up models with multiple large layers of features is problematic, as the sizes of subsequent hidden layers scales with the size of the previous layer. This quadratic scaling, and the associated time required to train such networks has prevented adoption of large Deep Learning models beyond cluster computing. The contributions in this thesis are developed from the observation that parameters or filter el- ements learnt in Deep Learning systems are typically highly structured, and contain related ele- ments. Firstly, the structure of unsupervised filters is utilised to construct a mapping from the high dimensional filter space to a low dimensional manifold. This creates a significantly smaller repre- sentation for subsequent feature learning. This mapping, and its effect on the resulting encodings, highlights the need for the ability to learn highly overcomplete sets of convolutional features. Driven by this need, the unsupervised pretraining of Deep Convolutional Networks is developed to include a number of modern training and regularisation methods. These pretrained models are then used to provide initialisations for supervised convolutional models trained on low quantities of labelled data. By utilising pretraining, a significant increase in classification performance on a number of publicly available datasets is achieved. In order to apply these techniques to outdoor 3D Laser Illuminated Detection And Ranging data, we develop a set of resampling techniques to provide uniform input to Deep Learning models. The features learnt in these systems outperform the high effort hand engineered features developed specifically for 3D data. The representation of a given signal is then reinterpreted as a combination of modes that exist on the learnt low dimensional filter manifold. From this, we develop an encoding technique that allows the high dimensional layer output to be represented as a combination of low dimensional components. This allows the growth of subsequent layers to only be dependent on the intrinsic dimensionality of the filter manifold and not the number of elements contained in the previous layer. Finally, the resulting unsupervised convolutional model, the encoding frameworks and the em- bedding methodology are used to produce a new unsupervised learning stratergy that is able to encode images in terms of overcomplete filter spaces, without producing an explosion in the size of the intermediate parameter spaces. This model produces classification results on par with state of the art models, yet requires significantly less computational resources and is suitable for use in the constrained computation environment of a field robot

Sydney eScholarship

Super-resolution:A comprehensive survey

Author: A Adler
A Almansa
A Chakrabarti
A Corduneanu
A Gholipour
A Giachetti
A Lorette
A Marquina
A Panagiotopoulou
A Schatzberg
A Zomet
AJ Patti
AJ Patti
AJ Storkey
AJ Tatem
AK Katsaggelos
ALD Martins
AWMV Eekeren
AWMV Eekeren
B Choi
B Cohen
B Huhle
B Li
B Li
B Narayanan
B Wu
BC Song
BGV Kumar
BK Gunturk
BK Gunturk
BK Gunturk
BK Gunturk
BR Hunt
C Jung
C Liu
C Liu
C Miravet
C Miravet
C Papathanassiou
C Pohl
C Su
C Wang
C Wang
CA Segall
CA Segall
CA Segall
CS Tong
CV Jiji
CV Jiji
CV Jiji
D Calle
D Capel
D Datsenko
D Lin
D Pastina
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Robinson
D Robinson
D Yldrm
D Zhang
D Zhang
DO Walsh
DP Capel
E Salari
E Shechtman
EM Hung
F Champagnat
F Humblot
F Rousseau
F Sroubek
F Sroubek
F Zhou
FM Candocia
G Dedeoglu
G Gilboa
G Ye
GH Costa
GH Costa
GH Costa
GK Chantas
GM Callic
GM Callico
H Bouzari
H Chang
H Demirel
H He
H He
H Huang
H Huanga
H Ji
H Nasir
H Shekarforoush
H Shekarforoush
H Shekarforoush
H Stark
H Su
H Su
H Takeda
H Takeda
H Yang
H Zhang
H Zhang
H Zhao
HF Shen
HK Aghajan
I Begin
J Chen
J Chung
J Cui
J Sun
J Tian
J Tian
J Tian
J Wang
J Wang
J Wu
J Yang
J Yang
J Yu
JA Kennedy
JD Ouwerkerk van
JJ Green
JS Park
K Aizawa
K Choi
K Donaldson
K Jia
K Jia
K Jia
K Kimura
K Nasrollahi
Kamal Nasrollahi
KD Sauer
KH Yap
KI Kim
KV Suresh
L Ma
L Zhang
LC Pickup
LC Pickup
LC Pickup
LJ Karam
M Ben-Ezra
M Ben-Ezra
M Bertero
M Carcenac
M Chappalli
M Elad
M Elad
M Elad
M Elad
M Elad
M Elad
M Gevrekci
M Gevrekci
M Gonzalez-Audcana
M Irani
M Irani
M Jung
M Protter
M Protter
M Shen
M Shen
M Shen
M Singh
MC Chiang
MC Hong
MC Pan
MD Robinson
ME Tipping
ME Tipping
MH Cheng
MK Nema
MK Ng
MK Ng
MK Ng
MM Islam
MV Joshi
MV Joshi
MVW Zibetti
MVW Zibetti
MVW Zibetti
MVW Zibetti
N Bose
N Bose
N Goldberg
N Kulkarni
N Nguyen
N Nguyen
NA Woods
NA Yamany
NK Bose
NK Bose
NK Bose
OA Omer
OA Omer
P Chainais
P Kramer
P Milanfar
P Purkait
P Vandewalle
P Vandewalle
P Vandewalle
PD Santis
PE Eren
PP Gajjar
Q Pan
Q Yuan
Q Yuan
R Fransens
R He
R Molina
R Sasaharay
R Tsai
RC Hardie
RC Hardie
RC Hardie
RR Schultz
RR Schultz
RR Schultz
RS Prendergast
RW Gerchberg
S Baker
S Chaudhuri
S Dai
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Kim
S Liu
S Lui
S Mallat
S Peleg
S Pelletier
S Peng
S Rajaram
S Yang
S Zhang
S Zhao
SC Park
SD Babacan
SH Keller
SP Belekos
SW Park
T Akgun
T Gotoh
T Katsuki
T Komatsu
T Szydzik
TA Stephenson
TC Ho
TF Gee
Thomas B. Moeslund
V Patanavijit
V Patanavijit
V Patanavijit
W Fan
W Liu
W Liu
W Wu
W Zhang
W Zhao
WT Freeman
WT Freeman
WWW Zou
WZ Shao
X Gao
X Gao
X Gao
X Li
X Li
X Li
X Ma
X Maa
X Wang
X Zeng
X Zhang
Y Altunbasak
Y He
Y He
Y Hu
Y Hu
Y Huang
Y Mochizuki
Y Zhuang
Y-W Tai
YJ Ma
YR Li
Z Arycan
Z Bi
Z Jiang
Z Lin
Z Lin
Z Wang
Z Wang
Z Xiong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/06/2014
Field of study

Crossref

VBN

Investigation of new learning methods for visual recognition

Author: Liu Qingfeng
Publication venue: Digital Commons @ NJIT
Publication date: 01/04/2017
Field of study

Visual recognition is one of the most difficult and prevailing problems in computer vision and pattern recognition due to the challenges in understanding the semantics and contents of digital images. Two major components of a visual recognition system are discriminatory feature representation and efficient and accurate pattern classification. This dissertation therefore focuses on developing new learning methods for visual recognition. Based on the conventional sparse representation, which shows its robustness for visual recognition problems, a series of new methods is proposed. Specifically, first, a new locally linear K nearest neighbor method, or LLK method, is presented. The LLK method derives a new representation, which is an approximation to the ideal representation, by optimizing an objective function based on a host of criteria for sparsity, locality, and reconstruction. The novel representation is further processed by two new classifiers, namely, an LLK based classifier (LLKc) and a locally linear nearest mean based classifier (LLNc), for visual recognition. The proposed classifiers are shown to connect to the Bayes decision rule for minimum error. Second, a new generative and discriminative sparse representation (GDSR) method is proposed by taking advantage of both a coarse modeling of the generative information and a modeling of the discriminative information. The proposed GDSR method integrates two new criteria, namely, a discriminative criterion and a generative criterion, into the conventional sparse representation criterion. A new generative and discriminative sparse representation based classification (GDSRc) method is then presented based on the derived new representation. Finally, a new Score space based multiple Metric Learning (SML) method is presented for a challenging visual recognition application, namely, recognizing kinship relations or kinship verification. The proposed SML method, which goes beyond the conventional Mahalanobis distance metric learning, not only learns the distance metric but also models the generative process of features by taking advantage of the score space. The SML method is optimized by solving a constrained, non-negative, and weighted variant of the sparse representation problem. To assess the feasibility of the proposed new learning methods, several visual recognition tasks, such as face recognition, scene recognition, object recognition, computational fine art analysis, action recognition, fine grained recognition, as well as kinship verification are applied. The experimental results show that the proposed new learning methods achieve better performance than the other popular methods

Digital Commons @ New Jersey Institute of Technology (NJIT)

Trends in Mathematical Imaging and Surface Processing

Author
Publication venue: Zürich : EMS Publ. House
Publication date: 01/01/2011
Field of study

Motivated both by industrial applications and the challenge of new problems, one observes an increasing interest in the field of image and surface processing over the last years. It has become clear that even though the applications areas differ significantly the methodological overlap is enormous. Even if contributions to the field come from almost any discipline in mathematics, a major role is played by partial differential equations and in particular by geometric and variational modeling and by their numerical counterparts. The aim of the workshop was to gather a group of leading experts coming from mathematics, engineering and computer graphics to cover the main developments

Repositorium für Naturwissenschaften und Technik

Motion capture data processing, retrieval and recognition.

Author: Wang Zhao
Publication venue
Publication date
Field of study

Character animation plays an essential role in the area of featured film and computer games. Manually creating character animation by animators is both tedious and inefficient, where motion capture techniques (MoCap) have been developed and become the most popular method for creating realistic character animation products. Commercial MoCap systems are expensive and the capturing process itself usually requires an indoor studio environment. Procedural animation creation is often lacking extensive user control during the generation progress. Therefore, efficiently and effectively reusing MoCap data can brings significant benefits, which has motivated wider research in terms of machine learning based MoCap data processing. A typical work flow of MoCap data reusing can be divided into 3 stages: data capture, data management and data reusing. There are still many challenges at each stage. For instance, the data capture and management often suffer from data quality problems. The efficient and effective retrieval method is also demanding due to the large amount of data being used. In addition, classification and understanding of actions are the fundamental basis of data reusing. This thesis proposes to use machine learning on MoCap data for reusing purposes, where a frame work of motion capture data processing is designed. The modular design of this framework enables motion data refinement, retrieval and recognition. The first part of this thesis introduces various methods used in existing motion capture processing approaches in literature and a brief introduction of relevant machine learning methods used in this framework. In general, the frameworks related to refinement, retrieval, recognition are discussed. A motion refinement algorithm based on dictionary learning will then be presented, where kinematical structural and temporal information are exploited. The designed optimization method and data preprocessing technique can ensure a smooth property for the recovered result. After that, a motion refinement algorithm based on matrix completion is presented, where the low-rank property and spatio-temporal information is exploited. Such model does not require preparing data for training. The designed optimization method outperforms existing approaches in regard to both effectiveness and efficiency. A motion retrieval method based on multi-view feature selection is also proposed, where the intrinsic relations between visual words in each motion feature subspace are discovered as a means of improving the retrieval performance. A provisional trace-ratio objective function and an iterative optimization method are also included. A non-negative matrix factorization based motion data clustering method is proposed for recognition purposes, which aims to deal with large scale unsupervised/semi-supervised problems. In addition, deep learning models are used for motion data recognition, e.g. 2D gait recognition and 3D MoCap recognition. To sum up, the research on motion data refinement, retrieval and recognition are presented in this thesis with an aim to tackle the major challenges in motion reusing. The proposed motion refinement methods aim to provide high quality clean motion data for downstream applications. The designed multi-view feature selection algorithm aims to improve the motion retrieval performance. The proposed motion recognition methods are equally essential for motion understanding. A collection of publications by the author of this thesis are noted in publications section

Bournemouth University Research Online