Search CORE

1,459 research outputs found

HEp-2 Cell Classification: The Role of Gaussian Scale Space Theory as A Pre-processing Approach

Author: Chen Jie
Pietikäinen Matti
Qi Xianbiao
Zhao Guoying
Publication venue
Publication date: 08/09/2015
Field of study

\textit{Indirect Immunofluorescence Imaging of Human Epithelial Type 2} (HEp-2) cells is an effective way to identify the presence of Anti-Nuclear Antibody (ANA). Most existing works on HEp-2 cell classification mainly focus on feature extraction, feature encoding and classifier design. Very few efforts have been devoted to study the importance of the pre-processing techniques. In this paper, we analyze the importance of the pre-processing, and investigate the role of Gaussian Scale Space (GSS) theory as a pre-processing approach for the HEp-2 cell classification task. We validate the GSS pre-processing under the Local Binary Pattern (LBP) and the Bag-of-Words (BoW) frameworks. Under the BoW framework, the introduced pre-processing approach, using only one Local Orientation Adaptive Descriptor (LOAD), achieved superior performance on the Executable Thematic on Pattern Recognition Techniques for Indirect Immunofluorescence (ET-PRT-IIF) image analysis. Our system, using only one feature, outperformed the winner of the ICPR 2014 contest that combined four types of features. Meanwhile, the proposed pre-processing method is not restricted to this work; it can be generalized to many existing works.Comment: 9 pages, 6 figure

arXiv.org e-Print Archive

Probing the Intra-Component Correlations within Fisher Vector for Material Classification

Author: Hong Xiaopeng
Pietikäinen Matti
Qi Xianbiao
Zhao Guoying
Publication venue
Publication date: 15/04/2016
Field of study

Fisher vector (FV) has become a popular image representation. One notable underlying assumption of the FV framework is that local descriptors are well decorrelated within each cluster so that the covariance matrix for each Gaussian can be simplified to be diagonal. Though the FV usually relies on the Principal Component Analysis (PCA) to decorrelate local features, the PCA is applied to the entire training data and hence it only diagonalizes the \textit{universal} covariance matrix, rather than those w.r.t. the local components. As a result, the local decorrelation assumption is usually not supported in practice. To relax this assumption, this paper proposes a completed model of the Fisher vector, which is termed as the Completed Fisher vector (CFV). The CFV is a more general framework of the FV, since it encodes not only the variances but also the correlations of the whitened local descriptors. The CFV thus leads to improved discriminative power. We take the task of material categorization as an example and experimentally show that: 1) the CFV outperforms the FV under all parameter settings; 2) the CFV is robust to the changes in the number of components in the mixture; 3) even with a relatively small visual vocabulary the CFV still works well on two challenging datasets.Comment: It is manuscript submitted to Neurocomputing on the end of April, 2015 (!). One year past but no review comments we received yet

arXiv.org e-Print Archive

Recurrent Convolutional Neural Network Regression for Continuous Pain Intensity Estimation in Video

Author: Hong Xiaopeng
Su Fei
Zhao Guoying
Zhou Jing
Publication venue
Publication date: 03/05/2016
Field of study

Automatic pain intensity estimation possesses a significant position in healthcare and medical field. Traditional static methods prefer to extract features from frames separately in a video, which would result in unstable changes and peaks among adjacent frames. To overcome this problem, we propose a real-time regression framework based on the recurrent convolutional neural network for automatic frame-level pain intensity estimation. Given vector sequences of AAM-warped facial images, we used a sliding-window strategy to obtain fixed-length input samples for the recurrent network. We then carefully design the architecture of the recurrent network to output continuous-valued pain intensity. The proposed end-to-end pain intensity regression framework can predict the pain intensity of each frame by considering a sufficiently large historical frames while limiting the scale of the parameters within the model. Our method achieves promising results regarding both accuracy and running speed on the published UNBC-McMaster Shoulder Pain Expression Archive Database.Comment: This paper is the pre-print technical report of the paper accepted by the IEEE CVPR Workshop of Affect "in-the-wild". The final version will be available after the worksho

arXiv.org e-Print Archive

Micro-Expression Spotting: A Benchmark

Author: Hong Xiaopeng
Tran Thuong-Khanh
Zhao Guoying
Publication venue
Publication date: 08/10/2017
Field of study

Micro-expressions are rapid and involuntary facial expressions, which indicate the suppressed or concealed emotions. Recently, the research on automatic micro-expression (ME) spotting obtains increasing attention. ME spotting is a crucial step prior to further ME analysis tasks. The spotting results can be used as important cues to assist many other human-oriented tasks and thus have many potential applications. In this paper, by investigating existing ME spotting methods, we recognize the immediacy of standardizing the performance evaluation of micro-expression spotting methods. To this end, we construct a micro-expression spotting benchmark (MESB). Firstly, we set up a sliding window based multi-scale evaluation framework. Secondly, we introduce a series of protocols. Thirdly, we also provide baseline results of popular methods. The MESB facilitates the research on ME spotting with fairer and more comprehensive evaluation and also enables to leverage the cutting-edge machine learning tools widely

arXiv.org e-Print Archive

Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network

Author: Behzad Muzammil
Li Xiaobai
Vo Nhat
Zhao Guoying
Publication venue
Publication date: 07/02/2020
Field of study

This paper proposes a novel 4D Facial Expression Recognition (FER) method using Collaborative Cross-domain Dynamic Image Network (CCDN). Given a 4D data of face scans, we first compute its geometrical images, and then combine their correlated information in the proposed cross-domain image representations. The acquired set is then used to generate cross-domain dynamic images (CDI) via rank pooling that encapsulates facial deformations over time in terms of a single image. For the training phase, these CDIs are fed into an end-to-end deep learning model, and the resultant predictions collaborate over multi-views for performance gain in expression classification. Furthermore, we propose a 4D augmentation scheme that not only expands the training data scale but also introduces significant facial muscle movement patterns to improve the FER performance. Results from extensive experiments on the commonly used BU-4DFE dataset under widely adopted settings show that our proposed method outperforms the state-of-the-art 4D FER methods by achieving an accuracy of 96.5% indicating its effectiveness.Comment: Published in the 30th British Machine Vision Conference (BMVC) 201

arXiv.org e-Print Archive

HEp-2 Cell Classification via Fusing Texture and Shape Information

Author: Guo Jun
Li Chun-Guang
Pietikäinen Matti
Qi Xianbiao
Zhao Guoying
Publication venue
Publication date: 16/02/2015
Field of study

Indirect Immunofluorescence (IIF) HEp-2 cell image is an effective evidence for diagnosis of autoimmune diseases. Recently computer-aided diagnosis of autoimmune diseases by IIF HEp-2 cell classification has attracted great attention. However the HEp-2 cell classification task is quite challenging due to large intra-class variation and small between-class variation. In this paper we propose an effective and efficient approach for the automatic classification of IIF HEp-2 cell image by fusing multi-resolution texture information and richer shape information. To be specific, we propose to: a) capture the multi-resolution texture information by a novel Pairwise Rotation Invariant Co-occurrence of Local Gabor Binary Pattern (PRICoLGBP) descriptor, b) depict the richer shape information by using an Improved Fisher Vector (IFV) model with RootSIFT features which are sampled from large image patches in multiple scales, and c) combine them properly. We evaluate systematically the proposed approach on the IEEE International Conference on Pattern Recognition (ICPR) 2012, IEEE International Conference on Image Processing (ICIP) 2013 and ICPR 2014 contest data sets. The experimental results for the proposed methods significantly outperform the winners of ICPR 2012 and ICIP 2013 contest, and achieve comparable performance with the winner of the newly released ICPR 2014 contest.Comment: 11 pages, 7 figure

arXiv.org e-Print Archive

From BoW to CNN: Two Decades of Texture Representation for Texture Classification

Author: Chellappa Rama
Chen Jie
Fieguth Paul
Liu Li
Pietikainen Matti
Zhao Guoying
Publication venue
Publication date: 03/10/2018
Field of study

Texture is a fundamental characteristic of many types of images, and texture representation is one of the essential and challenging problems in computer vision and pattern recognition which has attracted extensive research attention. Since 2000, texture representations based on Bag of Words (BoW) and on Convolutional Neural Networks (CNNs) have been extensively studied with impressive performance. Given this period of remarkable evolution, this paper aims to present a comprehensive survey of advances in texture representation over the last two decades. More than 200 major publications are cited in this survey covering different aspects of the research, which includes (i) problem description; (ii) recent advances in the broad categories of BoW-based, CNN-based and attribute-based methods; and (iii) evaluation issues, specifically benchmark datasets and state of the art results. In retrospect of what has been achieved so far, the survey discusses open challenges and directions for future research.Comment: Accepted by IJC

arXiv.org e-Print Archive

Fermi Surface and Carriers Compensation of pyrite-type PtBi $_{2}$ Revealed by Quantum Oscillations

Author: Gao Guoying
Wu Xuming
Xu Liangcai
Zhao Lingxiao
Zhu Zengwei
Zuo Huakun
Publication venue: 'American Physical Society (APS)'
Publication date: 14/09/2018
Field of study

Large non-saturating magnetoresistance has been observed in various materials and electron-hole compensation has been regarded as one of the main mechanisms. Here we present a detailed study of the angle-dependent Shubnikov -de Haas effect on large magnetoresistance material pyrite-type PtBi

_{2}

, which allows us to experimentally reconstruct its Fermi-surface structure and extract the physical properties of each pocket. We find its Fermi surface contains four types of pockets in the Brillouin zone: three ellipsoid-like hole pockets

\alpha

with C

_4

symmetry located on the edges (M points), one intricate electron pocket

\beta

merged from four ellipsoids along [111] located on the corners (R points), two smooth and cambered octahedrons

\gamma

(electron) and

\delta

(hole) on the center (

\Gamma

point). The deduced carrier densities of electrons and holes from the volume of pockets prove carrier compensation. This compensation at low temperatures is also supported by fitting the field dependence of Hall and magnetoresistance at different temperatures. We conclude that the compensation is the main mechanism for the large non-saturating magnetoresistance in pyrite-type PtBi

_{2}

. We found the hole pockets {\alpha} may contribute major mobility because of their light masses and anisotropy to relatively avoid large-angle scattering at low temperature. This may be a common feature of semimetals with large magnetoresistance. The found sub-quadratic magnetoresistance in high field is probably due to field-dependent mobilities, another feature of semimetals under high magnetic fields.Comment: 5 pages, 4 figure

arXiv.org e-Print Archive

LOAD: Local Orientation Adaptive Descriptor for Texture and Material Classification

Author: Li Qingquan
Pietikainen Matti
Qi Xianbiao
Shen Linlin
Zhao Guoying
Publication venue
Publication date: 22/04/2015
Field of study

In this paper, we propose a novel local feature, called Local Orientation Adaptive Descriptor (LOAD), to capture regional texture in an image. In LOAD, we proposed to define point description on an Adaptive Coordinate System (ACS), adopt a binary sequence descriptor to capture relationships between one point and its neighbors and use multi-scale strategy to enhance the discriminative power of the descriptor. The proposed LOAD enjoys not only discriminative power to capture the texture information, but also has strong robustness to illumination variation and image rotation. Extensive experiments on benchmark data sets of texture classification and real-world material recognition show that the proposed LOAD yields the state-of-the-art performance. It is worth to mention that we achieve a 65.4\% classification accuracy-- which is, to the best of our knowledge, the highest record by far --on Flickr Material Database by using a single feature. Moreover, by combining LOAD with the feature extracted by Convolutional Neural Networks (CNN), we obtain significantly better performance than both the LOAD and CNN. This result confirms that the LOAD is complementary to the learning-based features.Comment: 13 pages, 7 figure

arXiv.org e-Print Archive

A Global Alignment Kernel based Approach for Group-level Happiness Intensity Estimation

Author: Dhall Abhinav
Goecke Roland
Huang Xiaohua
Pietikainen Matti
Zhao Guoying
Publication venue
Publication date: 03/09/2018
Field of study

With the progress in automatic human behavior understanding, analysing the perceived affect of multiple people has been recieved interest in affective computing community. Unlike conventional facial expression analysis, this paper primarily focuses on analysing the behaviour of multiple people in an image. The proposed method is based on support vector regression with the combined global alignment kernels (GAKs) to estimate the happiness intensity of a group of people. We first exploit Riesz-based volume local binary pattern (RVLBP) and deep convolutional neural network (CNN) based features for characterizing facial images. Furthermore, we propose to use the GAK for RVLBP and deep CNN features, respectively for explicitly measuring the similarity of two group-level images. Specifically, we exploit the global weight sort scheme to sort the face images from group-level image according to their spatial weights, making an efficient data structure to GAK. Lastly, we propose Multiple kernel learning based on three combination strategies for combining two respective GAKs based on RVLBP and deep CNN features, such that enhancing the discriminative ability of each GAK. Intensive experiments are performed on the challenging group-level happiness intensity database, namely HAPPEI. Our experimental results demonstrate that the proposed approach achieves promising performance for group happiness intensity analysis, when compared with the recent state-of-the-art methods

arXiv.org e-Print Archive