2,937 research outputs found
Automatic depression scale prediction using facial expression dynamics and regression
Depression is a state of low mood and aversion to activity that can affect a person's thoughts, behaviour, feelings and sense of well-being. In such a low mood, both the facial expression and voice appear different from the ones in normal states. In this paper, an automatic system is proposed to predict the scales of Beck Depression Inventory from naturalistic facial expression of the patients with depression. Firstly, features are extracted from corresponding video and audio signals to represent characteristics of facial and vocal expression under depression. Secondly, dynamic features generation method is proposed in the extracted video feature space based on the idea of Motion History Histogram (MHH) for 2-D video motion extraction. Thirdly, Partial Least Squares (PLS) and Linear regression are applied to learn the relationship between the dynamic features and depression scales using training data, and then to predict the depression scale for unseen ones. Finally, decision level fusion was done for combining predictions from both video and audio modalities. The proposed approach is evaluated on the AVEC2014 dataset and the experimental results demonstrate its effectiveness.The work by Asim Jan was supported by School of Engineering & Design/Thomas Gerald Gray PGR Scholarship. The work by Hongying Meng and Saeed Turabzadeh was partially funded by the award of the Brunel Research Initiative and Enterprise Fund (BRIEF). The work by Yona Falinie Binti Abd Gaus was supported by Majlis Amanah Rakyat (MARA) Scholarship
Recommended from our members
Automatic affective dimension recognition from naturalistic facial expressions based on wavelet filtering and PLS regression
Automatic affective dimension recognition from facial expression continuously in naturalistic contexts is a very challenging research topic but very important in human-computer interaction. In this paper, an automatic recognition system was proposed to predict the affective dimensions such as Arousal, Valence and Dominance continuously in naturalistic facial expression videos. Firstly, visual and vocal features are extracted from image frames and audio segments in facial expression videos. Secondly, a wavelet transform based digital filtering method is applied to remove the irrelevant noise information in the feature space. Thirdly, Partial Least Squares regression is used to predict the affective dimensions from both video and audio modalities. Finally, two modalities are combined to boost overall performance in the decision fusion process. The proposed method is tested in the fourth international Audio/Visual Emotion Recognition Challenge (AVEC2014) dataset and compared to other state-of-the-art methods in the affect recognition sub-challenge with a good performance
Towards Effective Codebookless Model for Image Classification
The bag-of-features (BoF) model for image classification has been thoroughly
studied over the last decade. Different from the widely used BoF methods which
modeled images with a pre-trained codebook, the alternative codebook free image
modeling method, which we call Codebookless Model (CLM), attracted little
attention. In this paper, we present an effective CLM that represents an image
with a single Gaussian for classification. By embedding Gaussian manifold into
a vector space, we show that the simple incorporation of our CLM into a linear
classifier achieves very competitive accuracy compared with state-of-the-art
BoF methods (e.g., Fisher Vector). Since our CLM lies in a high dimensional
Riemannian manifold, we further propose a joint learning method of low-rank
transformation with support vector machine (SVM) classifier on the Gaussian
manifold, in order to reduce computational and storage cost. To study and
alleviate the side effect of background clutter on our CLM, we also present a
simple yet effective partial background removal method based on saliency
detection. Experiments are extensively conducted on eight widely used databases
to demonstrate the effectiveness and efficiency of our CLM method
Hyperspectral Imaging and Their Applications in the Nondestructive Quality Assessment of Fruits and Vegetables
Over the past decade, hyperspectral imaging has been rapidly developing and widely used as an emerging scientific tool in nondestructive fruit and vegetable quality assessment. Hyperspectral imaging technique integrates both the imaging and spectroscopic techniques into one system, and it can acquire a set of monochromatic images at almost continuous hundreds of thousands of wavelengths. Many researches based on spatial image and/or spectral image processing and analysis have been published proposing the use of hyperspectral imaging technique in the field of quality assessment of fruits and vegetables. This chapter presents a detailed overview of the introduction, latest developments and applications of hyperspectral imaging in the nondestructive assessment of fruits and vegetables. Additionally, the principal components, basic theories, and corresponding processing and analytical methods are also reported in this chapter
A study on non-destructive method for detecting Toxin in pepper using Neural networks
Mycotoxin contamination in certain agricultural systems have been a serious
concern for human and animal health. Mycotoxins are toxic substances produced
mostly as secondary metabolites by fungi that grow on seeds and feed in the
field, or in storage. The food-borne Mycotoxins likely to be of greatest
significance for human health in tropical developing countries are Aflatoxins
and Fumonisins. Chili pepper is also prone to Aflatoxin contamination during
harvesting, production and storage periods.Various methods used for detection
of Mycotoxins give accurate results, but they are slow, expensive and
destructive. Destructive method is testing a material that degrades the sample
under investigation. Whereas, non-destructive testing will, after testing,
allow the part to be used for its intended purpose. Ultrasonic methods,
Multispectral image processing methods, Terahertz methods, X-ray and
Thermography have been very popular in nondestructive testing and
characterization of materials and health monitoring. Image processing methods
are used to improve the visual quality of the pictures and to extract useful
information from them. In this proposed work, the chili pepper samples will be
collected, and the X-ray, multispectral images of the samples will be processed
using image processing methods. The term "Computational Intelligence" referred
as simulation of human intelligence on computers. It is also called as
"Artificial Intelligence" (AI) approach. The techniques used in AI approach are
Neural network, Fuzzy logic and evolutionary computation. Finally, the
computational intelligence method will be used in addition to image processing
to provide best, high performance and accurate results for detecting the
Mycotoxin level in the samples collected.Comment: 11 pages,1 figure; International Journal of Artificial Intelligence &
Applications (IJAIA), Vol.3, No.4, July 201
Facial Image Analysis for Body Mass Index, Makeup and Identity
The principal aim of facial image analysis in computer vision is to extract valuable information(e.g., age, gender, ethnicity, and identity) by interpreting perceived electronic signals from face images. In this dissertation, we develop facial image analysis systems for body mass index (BMI) prediction, makeup detection, as well as facial identity with makeup changes and BMI variations.;BMI is a commonly used measure of body fatness. In the first part of this thesis, we study BMI related topics. At first, we develop a computational method to predict BMI information from face images automatically. We formulate the BMI prediction from facial features as a machine vision problem. Three regression methods, including least square estimation, Gaussian processes for regression, and support vector regression are employed to predict the BMI value. Our preliminary results show that it is feasible to develop a computational system for BMI prediction from face images. Secondly, we address the influence of BMI changes on face identity. Both synthesized and real face images are assembled as the databases to facilitate our study. Empirically, we found that large BMI alterations can significantly reduce the matching accuracy of the face recognition system. Then we study if the influence of BMI changes can be reduced to improve the face recognition performance. The partial least squares (PLS) method is applied for this purpose. Experimental results show the feasibility to develop algorithms to address the influence of facial adiposity variations on face recognition, caused by BMI changes.;Makeup can affect facial appearance obviously. In the second part of this thesis, we deal with makeup influence on face identity. It is principal to perform makeup detection at first to address makeup influence. Four categories of features are proposed to characterize facial makeup cues in our study, including skin color tone, skin smoothness, texture, and highlight. A patch selection scheme and discriminative mapping are presented to enhance the performance of makeup detection. Secondly, we study dual attributes from makeup and non-makeup faces separately to reflect facial appearance changes caused by makeup in a semantic level. Cross-makeup attribute classification and accuracy change analysis is operated to divide dual attributes into four categories according to different makeup effects. To develop a face recognition system that is robust to facial makeup, PLS method is proposed on features extracted from local patches. We also propose a dual-attributes based method for face verification. Shared dual attributes can be used to measure facial similarity, rather than a direct matching with low-level features. Experimental results demonstrate the feasibility to eliminate the influence of makeup on face recognition.;In summary, contributions of this dissertation center in developing facial image analysis systems to deal with newly emerged topics effectively, i.e., BMI prediction, makeup detection, and the rcognition of face identity with makeup and BMI changes. In particular,to the best of our knowledge, BMI related topics, i.e., BMI prediction; the influence of BMI changes on face recognition; and face recognition robust to BMI changes are first explorations to the biometrics society
Automatic age and gender classification using supervised appearance model
YesAge and gender classification are two important problems that recently gained popularity in the
research community, due to their wide range of applications. Research has shown that both age and gender
information are encoded in the face shape and texture, hence the active appearance model (AAM), a statistical
model that captures shape and texture variations, has been one of the most widely used feature extraction
techniques for the aforementioned problems. However, AAM suffers from some drawbacks, especially when
used for classification. This is primarily because principal component analysis (PCA), which is at the core of
the model, works in an unsupervised manner, i.e., PCA dimensionality reduction does not take into account
how the predictor variables relate to the response (class labels). Rather, it explores only the underlying structure
of the predictor variables, thus, it is no surprise if PCA discards valuable parts of the data that represent discriminatory
features. Toward this end, we propose a supervised appearance model (sAM) that improves on AAM
by replacing PCA with partial least-squares regression. This feature extraction technique is then used for the
problems of age and gender classification. Our experiments show that sAM has better predictive power than the
conventional AAM
- …