9,906 research outputs found
A Unified Weight Learning and Low-Rank Regression Model for Robust Complex Error Modeling
One of the most important problems in regression-based error model is
modeling the complex representation error caused by various corruptions and
environment changes in images. For example, in robust face recognition, images
are often affected by varying types and levels of corruptions, such as random
pixel corruptions, block occlusions, or disguises. However, existing works are
not robust enough to solve this problem due to they cannot model the complex
corrupted errors very well. In this paper, we address this problem by a unified
sparse weight learning and low-rank approximation regression model, which
enables the random noises and contiguous occlusions in images to be treated
simultaneously. For the random noise, we define a generalized correntropy (GC)
function to match the error distribution. For the structured error caused by
occlusions or disguises, we propose a GC function based rank approximation to
measure the rank of error matrices. Since the proposed objective function is
non-convex, an effective iterative optimization algorithm is developed to
achieve the optimal weight learning and low-rank approximation. Extensive
experimental results on three public face databases show that the proposed
model can fit the error distribution and structure very well, thus obtain
better recognition accuracies in comparison with the existing methods
Joint Label Prediction based Semi-Supervised Adaptive Concept Factorization for Robust Data Representation
Constrained Concept Factorization (CCF) yields the enhanced representation
ability over CF by incorporating label information as additional constraints,
but it cannot classify and group unlabeled data appropriately. Minimizing the
difference between the original data and its reconstruction directly can enable
CCF to model a small noisy perturbation, but is not robust to gross sparse
errors. Besides, CCF cannot preserve the manifold structures in new
representation space explicitly, especially in an adaptive manner. In this
paper, we propose a joint label prediction based Robust Semi-Supervised
Adaptive Concept Factorization (RS2ACF) framework. To obtain robust
representation, RS2ACF relaxes the factorization to make it simultaneously
stable to small entrywise noise and robust to sparse errors. To enrich prior
knowledge to enhance the discrimination, RS2ACF clearly uses class information
of labeled data and more importantly propagates it to unlabeled data by jointly
learning an explicit label indicator for unlabeled data. By the label
indicator, RS2ACF can ensure the unlabeled data of the same predicted label to
be mapped into the same class in feature space. Besides, RS2ACF incorporates
the joint neighborhood reconstruction error over the new representations and
predicted labels of both labeled and unlabeled data, so the manifold structures
can be preserved explicitly and adaptively in the representation space and
label space at the same time. Owing to the adaptive manner, the tricky process
of determining the neighborhood size or kernel width can be avoided. Extensive
results on public databases verify that our RS2ACF can deliver state-of-the-art
data representation, compared with other related methods.Comment: Accepted at IEEE TKD
Low-Rank Modeling and Its Applications in Image Analysis
Low-rank modeling generally refers to a class of methods that solve problems
by representing variables of interest as low-rank matrices. It has achieved
great success in various fields including computer vision, data mining, signal
processing and bioinformatics. Recently, much progress has been made in
theories, algorithms and applications of low-rank modeling, such as exact
low-rank matrix recovery via convex programming and matrix completion applied
to collaborative filtering. These advances have brought more and more
attentions to this topic. In this paper, we review the recent advance of
low-rank modeling, the state-of-the-art algorithms, and related applications in
image analysis. We first give an overview to the concept of low-rank modeling
and challenging problems in this area. Then, we summarize the models and
algorithms for low-rank matrix recovery and illustrate their advantages and
limitations with numerical experiments. Next, we introduce a few applications
of low-rank modeling in the context of image analysis. Finally, we conclude
this paper with some discussions.Comment: To appear in ACM Computing Survey
A survey of sparse representation: algorithms and applications
Sparse representation has attracted much attention from researchers in fields
of signal processing, image processing, computer vision and pattern
recognition. Sparse representation also has a good reputation in both
theoretical research and practical applications. Many different algorithms have
been proposed for sparse representation. The main purpose of this article is to
provide a comprehensive study and an updated review on sparse representation
and to supply a guidance for researchers. The taxonomy of sparse representation
methods can be studied from various viewpoints. For example, in terms of
different norm minimizations used in sparsity constraints, the methods can be
roughly categorized into five groups: sparse representation with -norm
minimization, sparse representation with -norm (0p1) minimization,
sparse representation with -norm minimization and sparse representation
with -norm minimization. In this paper, a comprehensive overview of
sparse representation is provided. The available sparse representation
algorithms can also be empirically categorized into four groups: greedy
strategy approximation, constrained optimization, proximity algorithm-based
optimization, and homotopy algorithm-based sparse representation. The
rationales of different algorithms in each category are analyzed and a wide
range of sparse representation applications are summarized, which could
sufficiently reveal the potential nature of the sparse representation theory.
Specifically, an experimentally comparative study of these sparse
representation algorithms was presented. The Matlab code used in this paper can
be available at: http://www.yongxu.org/lunwen.html.Comment: Published on IEEE Access, Vol. 3, pp. 490-530, 201
Low-rank Kernel Learning for Graph-based Clustering
Constructing the adjacency graph is fundamental to graph-based clustering.
Graph learning in kernel space has shown impressive performance on a number of
benchmark data sets. However, its performance is largely determined by the
chosen kernel matrix. To address this issue, the previous multiple kernel
learning algorithm has been applied to learn an optimal kernel from a group of
predefined kernels. This approach might be sensitive to noise and limits the
representation ability of the consensus kernel. In contrast to existing
methods, we propose to learn a low-rank kernel matrix which exploits the
similarity nature of the kernel matrix and seeks an optimal kernel from the
neighborhood of candidate kernels. By formulating graph construction and kernel
learning in a unified framework, the graph and consensus kernel can be
iteratively enhanced by each other. Extensive experimental results validate the
efficacy of the proposed method
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
The paper gives futuristic challenges disscussed in the cvpaper.challenge. In
2015 and 2016, we thoroughly study 1,600+ papers in several
conferences/journals such as CVPR/ICCV/ECCV/NIPS/PAMI/IJCV
Dual-reference Face Retrieval
Face retrieval has received much attention over the past few decades, and
many efforts have been made in retrieving face images against pose,
illumination, and expression variations. However, the conventional works fail
to meet the requirements of a potential and novel task --- retrieving a
person's face image at a specific age, especially when the specific 'age' is
not given as a numeral, i.e. 'retrieving someone's image at the similar age
period shown by another person's image'. To tackle this problem, we propose a
dual reference face retrieval framework in this paper, where the system takes
two inputs: an identity reference image which indicates the target identity and
an age reference image which reflects the target age. In our framework, the raw
images are first projected on a joint manifold, which preserves both the age
and identity locality. Then two similarity metrics of age and identity are
exploited and optimized by utilizing our proposed quartet-based model. The
experiments show promising results, outperforming hierarchical methods.Comment: Accepted at AAAI 201
Multi-modal Face Pose Estimation with Multi-task Manifold Deep Learning
Human face pose estimation aims at estimating the gazing direction or head
postures with 2D images. It gives some very important information such as
communicative gestures, saliency detection and so on, which attracts plenty of
attention recently. However, it is challenging because of complex background,
various orientations and face appearance visibility. Therefore, a descriptive
representation of face images and mapping it to poses are critical. In this
paper, we make use of multi-modal data and propose a novel face pose estimation
method that uses a novel deep learning framework named Multi-task Manifold Deep
Learning . It is based on feature extraction with improved deep neural
networks and multi-modal mapping relationship with multi-task learning. In the
proposed deep learning based framework, Manifold Regularized Convolutional
Layers (MRCL) improve traditional convolutional layers by learning the
relationship among outputs of neurons. Besides, in the proposed mapping
relationship learning method, different modals of face representations are
naturally combined to learn the mapping function from face images to poses. In
this way, the computed mapping model with multiple tasks is improved.
Experimental results on three challenging benchmark datasets DPOSE, HPID and
BKHPD demonstrate the outstanding performance of
Representation Learning by Rotating Your Faces
The large pose discrepancy between two face images is one of the fundamental
challenges in automatic face recognition. Conventional approaches to
pose-invariant face recognition either perform face frontalization on, or learn
a pose-invariant representation from, a non-frontal face image. We argue that
it is more desirable to perform both tasks jointly to allow them to leverage
each other. To this end, this paper proposes a Disentangled Representation
learning-Generative Adversarial Network (DR-GAN) with three distinct novelties.
First, the encoder-decoder structure of the generator enables DR-GAN to learn a
representation that is both generative and discriminative, which can be used
for face image synthesis and pose-invariant face recognition. Second, this
representation is explicitly disentangled from other face variations such as
pose, through the pose code provided to the decoder and pose estimation in the
discriminator. Third, DR-GAN can take one or multiple images as the input, and
generate one unified identity representation along with an arbitrary number of
synthetic face images. Extensive quantitative and qualitative evaluation on a
number of controlled and in-the-wild databases demonstrate the superiority of
DR-GAN over the state of the art in both learning representations and rotating
large-pose face images.Comment: IEEE Transactions on Pattern Analysis and Machine Intelligence
(TPAMI
Robust and Low-Rank Representation for Fast Face Identification with Occlusions
In this paper we propose an iterative method to address the face
identification problem with block occlusions. Our approach utilizes a robust
representation based on two characteristics in order to model contiguous errors
(e.g., block occlusion) effectively. The first fits to the errors a
distribution described by a tailored loss function. The second describes the
error image as having a specific structure (resulting in low-rank in comparison
to image size). We will show that this joint characterization is effective for
describing errors with spatial continuity. Our approach is computationally
efficient due to the utilization of the Alternating Direction Method of
Multipliers (ADMM). A special case of our fast iterative algorithm leads to the
robust representation method which is normally used to handle non-contiguous
errors (e.g., pixel corruption). Extensive results on representative face
databases (in constrained and unconstrained environments) document the
effectiveness of our method over existing robust representation methods with
respect to both identification rates and computational time.
Code is available at Github, where you can find implementations of the
F-LR-IRNNLS and F-IRNNLS (fast version of the RRC) :
https://github.com/miliadis/FIRCComment: IEEE Transactions on Image Processing (TIP), 201
- …