1,459 research outputs found
HEp-2 Cell Classification: The Role of Gaussian Scale Space Theory as A Pre-processing Approach
\textit{Indirect Immunofluorescence Imaging of Human Epithelial Type 2}
(HEp-2) cells is an effective way to identify the presence of Anti-Nuclear
Antibody (ANA). Most existing works on HEp-2 cell classification mainly focus
on feature extraction, feature encoding and classifier design. Very few efforts
have been devoted to study the importance of the pre-processing techniques. In
this paper, we analyze the importance of the pre-processing, and investigate
the role of Gaussian Scale Space (GSS) theory as a pre-processing approach for
the HEp-2 cell classification task. We validate the GSS pre-processing under
the Local Binary Pattern (LBP) and the Bag-of-Words (BoW) frameworks. Under the
BoW framework, the introduced pre-processing approach, using only one Local
Orientation Adaptive Descriptor (LOAD), achieved superior performance on the
Executable Thematic on Pattern Recognition Techniques for Indirect
Immunofluorescence (ET-PRT-IIF) image analysis. Our system, using only one
feature, outperformed the winner of the ICPR 2014 contest that combined four
types of features. Meanwhile, the proposed pre-processing method is not
restricted to this work; it can be generalized to many existing works.Comment: 9 pages, 6 figure
Probing the Intra-Component Correlations within Fisher Vector for Material Classification
Fisher vector (FV) has become a popular image representation. One notable
underlying assumption of the FV framework is that local descriptors are well
decorrelated within each cluster so that the covariance matrix for each
Gaussian can be simplified to be diagonal. Though the FV usually relies on the
Principal Component Analysis (PCA) to decorrelate local features, the PCA is
applied to the entire training data and hence it only diagonalizes the
\textit{universal} covariance matrix, rather than those w.r.t. the local
components. As a result, the local decorrelation assumption is usually not
supported in practice.
To relax this assumption, this paper proposes a completed model of the Fisher
vector, which is termed as the Completed Fisher vector (CFV). The CFV is a more
general framework of the FV, since it encodes not only the variances but also
the correlations of the whitened local descriptors. The CFV thus leads to
improved discriminative power. We take the task of material categorization as
an example and experimentally show that: 1) the CFV outperforms the FV under
all parameter settings; 2) the CFV is robust to the changes in the number of
components in the mixture; 3) even with a relatively small visual vocabulary
the CFV still works well on two challenging datasets.Comment: It is manuscript submitted to Neurocomputing on the end of April,
2015 (!). One year past but no review comments we received yet
Recurrent Convolutional Neural Network Regression for Continuous Pain Intensity Estimation in Video
Automatic pain intensity estimation possesses a significant position in
healthcare and medical field. Traditional static methods prefer to extract
features from frames separately in a video, which would result in unstable
changes and peaks among adjacent frames. To overcome this problem, we propose a
real-time regression framework based on the recurrent convolutional neural
network for automatic frame-level pain intensity estimation. Given vector
sequences of AAM-warped facial images, we used a sliding-window strategy to
obtain fixed-length input samples for the recurrent network. We then carefully
design the architecture of the recurrent network to output continuous-valued
pain intensity. The proposed end-to-end pain intensity regression framework can
predict the pain intensity of each frame by considering a sufficiently large
historical frames while limiting the scale of the parameters within the model.
Our method achieves promising results regarding both accuracy and running speed
on the published UNBC-McMaster Shoulder Pain Expression Archive Database.Comment: This paper is the pre-print technical report of the paper accepted by
the IEEE CVPR Workshop of Affect "in-the-wild". The final version will be
available after the worksho
Micro-Expression Spotting: A Benchmark
Micro-expressions are rapid and involuntary facial expressions, which
indicate the suppressed or concealed emotions. Recently, the research on
automatic micro-expression (ME) spotting obtains increasing attention. ME
spotting is a crucial step prior to further ME analysis tasks. The spotting
results can be used as important cues to assist many other human-oriented tasks
and thus have many potential applications. In this paper, by investigating
existing ME spotting methods, we recognize the immediacy of standardizing the
performance evaluation of micro-expression spotting methods. To this end, we
construct a micro-expression spotting benchmark (MESB). Firstly, we set up a
sliding window based multi-scale evaluation framework. Secondly, we introduce a
series of protocols. Thirdly, we also provide baseline results of popular
methods. The MESB facilitates the research on ME spotting with fairer and more
comprehensive evaluation and also enables to leverage the cutting-edge machine
learning tools widely
Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network
This paper proposes a novel 4D Facial Expression Recognition (FER) method
using Collaborative Cross-domain Dynamic Image Network (CCDN). Given a 4D data
of face scans, we first compute its geometrical images, and then combine their
correlated information in the proposed cross-domain image representations. The
acquired set is then used to generate cross-domain dynamic images (CDI) via
rank pooling that encapsulates facial deformations over time in terms of a
single image. For the training phase, these CDIs are fed into an end-to-end
deep learning model, and the resultant predictions collaborate over multi-views
for performance gain in expression classification. Furthermore, we propose a 4D
augmentation scheme that not only expands the training data scale but also
introduces significant facial muscle movement patterns to improve the FER
performance. Results from extensive experiments on the commonly used BU-4DFE
dataset under widely adopted settings show that our proposed method outperforms
the state-of-the-art 4D FER methods by achieving an accuracy of 96.5%
indicating its effectiveness.Comment: Published in the 30th British Machine Vision Conference (BMVC) 201
HEp-2 Cell Classification via Fusing Texture and Shape Information
Indirect Immunofluorescence (IIF) HEp-2 cell image is an effective evidence
for diagnosis of autoimmune diseases. Recently computer-aided diagnosis of
autoimmune diseases by IIF HEp-2 cell classification has attracted great
attention. However the HEp-2 cell classification task is quite challenging due
to large intra-class variation and small between-class variation. In this paper
we propose an effective and efficient approach for the automatic classification
of IIF HEp-2 cell image by fusing multi-resolution texture information and
richer shape information. To be specific, we propose to: a) capture the
multi-resolution texture information by a novel Pairwise Rotation Invariant
Co-occurrence of Local Gabor Binary Pattern (PRICoLGBP) descriptor, b) depict
the richer shape information by using an Improved Fisher Vector (IFV) model
with RootSIFT features which are sampled from large image patches in multiple
scales, and c) combine them properly. We evaluate systematically the proposed
approach on the IEEE International Conference on Pattern Recognition (ICPR)
2012, IEEE International Conference on Image Processing (ICIP) 2013 and ICPR
2014 contest data sets. The experimental results for the proposed methods
significantly outperform the winners of ICPR 2012 and ICIP 2013 contest, and
achieve comparable performance with the winner of the newly released ICPR 2014
contest.Comment: 11 pages, 7 figure
From BoW to CNN: Two Decades of Texture Representation for Texture Classification
Texture is a fundamental characteristic of many types of images, and texture
representation is one of the essential and challenging problems in computer
vision and pattern recognition which has attracted extensive research
attention. Since 2000, texture representations based on Bag of Words (BoW) and
on Convolutional Neural Networks (CNNs) have been extensively studied with
impressive performance. Given this period of remarkable evolution, this paper
aims to present a comprehensive survey of advances in texture representation
over the last two decades. More than 200 major publications are cited in this
survey covering different aspects of the research, which includes (i) problem
description; (ii) recent advances in the broad categories of BoW-based,
CNN-based and attribute-based methods; and (iii) evaluation issues,
specifically benchmark datasets and state of the art results. In retrospect of
what has been achieved so far, the survey discusses open challenges and
directions for future research.Comment: Accepted by IJC
Fermi Surface and Carriers Compensation of pyrite-type PtBi Revealed by Quantum Oscillations
Large non-saturating magnetoresistance has been observed in various materials
and electron-hole compensation has been regarded as one of the main mechanisms.
Here we present a detailed study of the angle-dependent Shubnikov -de Haas
effect on large magnetoresistance material pyrite-type PtBi, which allows
us to experimentally reconstruct its Fermi-surface structure and extract the
physical properties of each pocket. We find its Fermi surface contains four
types of pockets in the Brillouin zone: three ellipsoid-like hole pockets
with C symmetry located on the edges (M points), one intricate
electron pocket merged from four ellipsoids along [111] located on the
corners (R points), two smooth and cambered octahedrons (electron) and
(hole) on the center ( point). The deduced carrier densities
of electrons and holes from the volume of pockets prove carrier compensation.
This compensation at low temperatures is also supported by fitting the field
dependence of Hall and magnetoresistance at different temperatures. We conclude
that the compensation is the main mechanism for the large non-saturating
magnetoresistance in pyrite-type PtBi. We found the hole pockets {\alpha}
may contribute major mobility because of their light masses and anisotropy to
relatively avoid large-angle scattering at low temperature. This may be a
common feature of semimetals with large magnetoresistance. The found
sub-quadratic magnetoresistance in high field is probably due to
field-dependent mobilities, another feature of semimetals under high magnetic
fields.Comment: 5 pages, 4 figure
LOAD: Local Orientation Adaptive Descriptor for Texture and Material Classification
In this paper, we propose a novel local feature, called Local Orientation
Adaptive Descriptor (LOAD), to capture regional texture in an image. In LOAD,
we proposed to define point description on an Adaptive Coordinate System (ACS),
adopt a binary sequence descriptor to capture relationships between one point
and its neighbors and use multi-scale strategy to enhance the discriminative
power of the descriptor. The proposed LOAD enjoys not only discriminative power
to capture the texture information, but also has strong robustness to
illumination variation and image rotation. Extensive experiments on benchmark
data sets of texture classification and real-world material recognition show
that the proposed LOAD yields the state-of-the-art performance. It is worth to
mention that we achieve a 65.4\% classification accuracy-- which is, to the
best of our knowledge, the highest record by far --on Flickr Material Database
by using a single feature. Moreover, by combining LOAD with the feature
extracted by Convolutional Neural Networks (CNN), we obtain significantly
better performance than both the LOAD and CNN. This result confirms that the
LOAD is complementary to the learning-based features.Comment: 13 pages, 7 figure
A Global Alignment Kernel based Approach for Group-level Happiness Intensity Estimation
With the progress in automatic human behavior understanding, analysing the
perceived affect of multiple people has been recieved interest in affective
computing community. Unlike conventional facial expression analysis, this paper
primarily focuses on analysing the behaviour of multiple people in an image.
The proposed method is based on support vector regression with the combined
global alignment kernels (GAKs) to estimate the happiness intensity of a group
of people. We first exploit Riesz-based volume local binary pattern (RVLBP) and
deep convolutional neural network (CNN) based features for characterizing
facial images. Furthermore, we propose to use the GAK for RVLBP and deep CNN
features, respectively for explicitly measuring the similarity of two
group-level images. Specifically, we exploit the global weight sort scheme to
sort the face images from group-level image according to their spatial weights,
making an efficient data structure to GAK. Lastly, we propose Multiple kernel
learning based on three combination strategies for combining two respective
GAKs based on RVLBP and deep CNN features, such that enhancing the
discriminative ability of each GAK. Intensive experiments are performed on the
challenging group-level happiness intensity database, namely HAPPEI. Our
experimental results demonstrate that the proposed approach achieves promising
performance for group happiness intensity analysis, when compared with the
recent state-of-the-art methods
- …
