67,207 research outputs found
Nuclear Norm based Matrix Regression with Applications to Face Recognition with Occlusion and Illumination Changes
Recently regression analysis becomes a popular tool for face recognition. The
existing regression methods all use the one-dimensional pixel-based error
model, which characterizes the representation error pixel by pixel individually
and thus neglects the whole structure of the error image. We observe that
occlusion and illumination changes generally lead to a low-rank error image. To
make use of this low-rank structural information, this paper presents a
two-dimensional image matrix based error model, i.e. matrix regression, for
face representation and classification. Our model uses the minimal nuclear norm
of representation error image as a criterion, and the alternating direction
method of multipliers method to calculate the regression coefficients. Compared
with the current regression methods, the proposed Nuclear Norm based Matrix
Regression (NMR) model is more robust for alleviating the effect of
illumination, and more intuitive and powerful for removing the structural noise
caused by occlusion. We experiment using four popular face image databases, the
Extended Yale B database, the AR database, the Multi-PIE and the FRGC database.
Experimental results demonstrate the performance advantage of NMR over the
state-of-the-art regression based face recognition methods.Comment: 30 page
A Kernel Classification Framework for Metric Learning
Learning a distance metric from the given training samples plays a crucial
role in many machine learning tasks, and various models and optimization
algorithms have been proposed in the past decade. In this paper, we generalize
several state-of-the-art metric learning methods, such as large margin nearest
neighbor (LMNN) and information theoretic metric learning (ITML), into a kernel
classification framework. First, doublets and triplets are constructed from the
training samples, and a family of degree-2 polynomial kernel functions are
proposed for pairs of doublets or triplets. Then, a kernel classification
framework is established, which can not only generalize many popular metric
learning methods such as LMNN and ITML, but also suggest new metric learning
methods, which can be efficiently implemented, interestingly, by using the
standard support vector machine (SVM) solvers. Two novel metric learning
methods, namely doublet-SVM and triplet-SVM, are then developed under the
proposed framework. Experimental results show that doublet-SVM and triplet-SVM
achieve competitive classification accuracies with state-of-the-art metric
learning methods such as ITML and LMNN but with significantly less training
time.Comment: 11 pages, 7 figure
Real Time Surveillance for Low Resolution and Limited-Data Scenarios: An Image Set Classification Approach
This paper proposes a novel image set classification technique based on the
concept of linear regression. Unlike most other approaches, the proposed
technique does not involve any training or feature extraction. The gallery
image sets are represented as subspaces in a high dimensional space. Class
specific gallery subspaces are used to estimate regression models for each
image of the test image set. Images of the test set are then projected on the
gallery subspaces. Residuals, calculated using the Euclidean distance between
the original and the projected test images, are used as the distance metric.
Three different strategies are devised to decide on the final class of the test
image set. We performed extensive evaluations of the proposed technique under
the challenges of low resolution, noise and less gallery data for the tasks of
surveillance, video-based face recognition and object recognition. Experiments
show that the proposed technique achieves a better classification accuracy and
a faster execution time compared to existing techniques especially under the
challenging conditions of low resolution and small gallery and test data
Population-Guided Large Margin Classifier for High-Dimension Low -Sample-Size Problems
Various applications in different fields, such as gene expression analysis or
computer vision, suffer from data sets with high-dimensional low-sample-size
(HDLSS), which has posed significant challenges for standard statistical and
modern machine learning methods. In this paper, we propose a novel linear
binary classifier, denoted by population-guided large margin classifier
(PGLMC), which is applicable to any sorts of data, including HDLSS. PGLMC is
conceived with a projecting direction w given by the comprehensive
consideration of local structural information of the hyperplane and the
statistics of the training samples. Our proposed model has several advantages
compared to those widely used approaches. First, it is not sensitive to the
intercept term b. Second, it operates well with imbalanced data. Third, it is
relatively simple to be implemented based on Quadratic Programming. Fourth, it
is robust to the model specification for various real applications. The
theoretical properties of PGLMC are proven. We conduct a series of evaluations
on two simulated and six real-world benchmark data sets, including DNA
classification, digit recognition, medical image analysis, and face
recognition. PGLMC outperforms the state-of-the-art classification methods in
most cases, or at least obtains comparable results.Comment: 48 Page
Facial expression recognition based on local region specific features and support vector machines
Facial expressions are one of the most powerful, natural and immediate means
for human being to communicate their emotions and intensions. Recognition of
facial expression has many applications including human-computer interaction,
cognitive science, human emotion analysis, personality development etc. In this
paper, we propose a new method for the recognition of facial expressions from
single image frame that uses combination of appearance and geometric features
with support vector machines classification. In general, appearance features
for the recognition of facial expressions are computed by dividing face region
into regular grid (holistic representation). But, in this paper we extracted
region specific appearance features by dividing the whole face region into
domain specific local regions. Geometric features are also extracted from
corresponding domain specific regions. In addition, important local regions are
determined by using incremental search approach which results in the reduction
of feature dimension and improvement in recognition accuracy. The results of
facial expressions recognition using features from domain specific regions are
also compared with the results obtained using holistic representation. The
performance of the proposed facial expression recognition system has been
validated on publicly available extended Cohn-Kanade (CK+) facial expression
data sets.Comment: Facial expressions, Local representation, Appearance features,
Geometric features, Support vector machine
Learning with Privileged Information for Multi-Label Classification
In this paper, we propose a novel approach for learning multi-label
classifiers with the help of privileged information. Specifically, we use
similarity constraints to capture the relationship between available
information and privileged information, and use ranking constraints to capture
the dependencies among multiple labels. By integrating similarity constraints
and ranking constraints into the learning process of classifiers, the
privileged information and the dependencies among multiple labels are exploited
to construct better classifiers during training. A maximum margin classifier is
adopted, and an efficient learning algorithm of the proposed method is also
developed. We evaluate the proposed method on two applications: multiple object
recognition from images with the help of implicit information about object
importance conveyed by the list of manually annotated image tags; and multiple
facial action unit detection from low-resolution images augmented by
high-resolution images. Experimental results demonstrate that the proposed
method can effectively take full advantage of privileged information and
dependencies among multiple labels for better object recognition and better
facial action unit detection
Fast Approximate L_infty Minimization: Speeding Up Robust Regression
Minimization of the norm, which can be viewed as approximately
solving the non-convex least median estimation problem, is a powerful method
for outlier removal and hence robust regression. However, current techniques
for solving the problem at the heart of norm minimization are slow,
and therefore cannot scale to large problems. A new method for the minimization
of the norm is presented here, which provides a speedup of multiple
orders of magnitude for data with high dimension. This method, termed Fast
Minimization, allows robust regression to be applied to a class of
problems which were previously inaccessible. It is shown how the
norm minimization problem can be broken up into smaller sub-problems, which can
then be solved extremely efficiently. Experimental results demonstrate the
radical reduction in computation time, along with robustness against large
numbers of outliers in a few model-fitting problems.Comment: 11 page
A survey of sparse representation: algorithms and applications
Sparse representation has attracted much attention from researchers in fields
of signal processing, image processing, computer vision and pattern
recognition. Sparse representation also has a good reputation in both
theoretical research and practical applications. Many different algorithms have
been proposed for sparse representation. The main purpose of this article is to
provide a comprehensive study and an updated review on sparse representation
and to supply a guidance for researchers. The taxonomy of sparse representation
methods can be studied from various viewpoints. For example, in terms of
different norm minimizations used in sparsity constraints, the methods can be
roughly categorized into five groups: sparse representation with -norm
minimization, sparse representation with -norm (0p1) minimization,
sparse representation with -norm minimization and sparse representation
with -norm minimization. In this paper, a comprehensive overview of
sparse representation is provided. The available sparse representation
algorithms can also be empirically categorized into four groups: greedy
strategy approximation, constrained optimization, proximity algorithm-based
optimization, and homotopy algorithm-based sparse representation. The
rationales of different algorithms in each category are analyzed and a wide
range of sparse representation applications are summarized, which could
sufficiently reveal the potential nature of the sparse representation theory.
Specifically, an experimentally comparative study of these sparse
representation algorithms was presented. The Matlab code used in this paper can
be available at: http://www.yongxu.org/lunwen.html.Comment: Published on IEEE Access, Vol. 3, pp. 490-530, 201
Face Recognition: A Novel Multi-Level Taxonomy based Survey
In a world where security issues have been gaining growing importance, face
recognition systems have attracted increasing attention in multiple application
areas, ranging from forensics and surveillance to commerce and entertainment.
To help understanding the landscape and abstraction levels relevant for face
recognition systems, face recognition taxonomies allow a deeper dissection and
comparison of the existing solutions. This paper proposes a new, more
encompassing and richer multi-level face recognition taxonomy, facilitating the
organization and categorization of available and emerging face recognition
solutions; this taxonomy may also guide researchers in the development of more
efficient face recognition solutions. The proposed multi-level taxonomy
considers levels related to the face structure, feature support and feature
extraction approach. Following the proposed taxonomy, a comprehensive survey of
representative face recognition solutions is presented. The paper concludes
with a discussion on current algorithmic and application related challenges
which may define future research directions for face recognition.Comment: This paper is a preprint of a paper submitted to IET Biometrics. If
accepted, the copy of record will be available at the IET Digital Librar
Evaluation of the Spatio-Temporal features and GAN for Micro-expression Recognition System
Owing to the development and advancement of artificial intelligence, numerous
works were established in the human facial expression recognition system.
Meanwhile, the detection and classification of micro-expressions are attracting
attentions from various research communities in the recent few years. In this
paper, we first review the processes of a conventional optical-flow-based
recognition system, which comprised of facial landmarks annotations, optical
flow guided images computation, features extraction and emotion class
categorization. Secondly, a few approaches have been proposed to improve the
feature extraction part, such as exploiting GAN to generate more image samples.
Particularly, several variations of optical flow are computed in order to
generate optimal images to lead to high recognition accuracy. Next, GAN, a
combination of Generator and Discriminator, is utilized to generate new "fake"
images to increase the sample size. Thirdly, a modified state-of-the-art
Convolutional neural networks is proposed. To verify the effectiveness of the
the proposed method, the results are evaluated on spontaneous micro-expression
databases, namely SMIC, CASME II and SAMM. Both the F1-score and accuracy
performance metrics are reported in this paper.Comment: 15 pages, 16 figures, 6 table
- …