1,009 research outputs found
A statistical multiresolution approach for face recognition using structural hidden Markov models
This paper introduces a novel methodology that combines the multiresolution feature of the discrete wavelet transform (DWT) with the local interactions of the facial structures expressed through the structural hidden Markov model (SHMM). A range of wavelet filters such as Haar, biorthogonal 9/7, and Coiflet, as well as Gabor, have been implemented in order to search for the best performance. SHMMs perform a thorough probabilistic analysis of any sequential pattern by revealing both its inner and outer structures simultaneously. Unlike traditional HMMs, the SHMMs do not perform the state conditional independence of the visible observation sequence assumption. This is achieved via the concept of local structures introduced by the SHMMs. Therefore, the long-range dependency problem inherent to traditional HMMs has been drastically reduced. SHMMs have not previously been applied to the problem of face identification. The results reported in this application have shown that SHMM outperforms the traditional hidden Markov model with a 73% increase in accuracy
Hubungan gaya pembelajaran dengan pencapaian akademik pelajar aliran vokasional
Analisis keputusan Sijil Pelajaran Malaysia (SPM) 2011 menunjukkan penurunan
pencapaian bagi Sekolah Menengah Vokasional. Oleh itu, kajian ini dilaksanakan
bertujuan untuk mengkaji hubungan di antara gaya pembelajaran dengan pencapaian
akademik pelajar. Kajian ini juga ingin mengenalpasti gaya pembelajaran paling
dominan yang diamalkan oleh pelajar serta melihat perbezaan gaya pembelajaran
dengan jantina pelajar. Seramai 131 orang Pelajar Tingkatan Empat Kursus
Vokasional Di Sekolah Menengah Vokasional Segamat di Johor telah terlibat dalam
kajian ini. Soal selidik Index of Learning Style (ILS) yang dibangunkan oleh Felder
dan Silverman (1991) yang mengandungi 44 soalan telah digunakan untukh
menjalankan kajian ini. Gaya pembelajaran pelajar dapat dilihat melalui empat
dimensi gaya pembelajaran yang terdiri dari dua sub-skala yang bertentangan iaitu
dimensi pelajar Aktif dan Reflektif, dimensi pelajar Konkrit dan Intuitif, dimensi
pelajar Verbal dan Visual, serta dimensi pelajar Tersusun dan Global. Data yang
diperolehi dianalisis dengan menggunakan perisian Statistical Package for Social
Science for WINDOW release 20.0 (SPSS.20.0). Ujian Korelasi Pearson digunakan
untuk menganalisis data dalam mengkaji hubungan gaya pembelajaran dengan
pencapaian akademik pelajar. Nilai pekali p yang diperolehi di antara gaya
pembelajaran dengan pencapaian pelajar adalah (p=0.1 hingga 0.4). Ini menunjukkan
tidak terdapat hubungan yang signifikan di antara dua pembolehubah tersebut. Kajian
ini juga mendapati bahawa gaya pembelajaran yang menjadi amalan pelajar ialah
gaya pembelajaran Tersusun. Hasil kajian juga mendapati bahawa tidak terdapat
perbezaan yang signifikan di antara gaya pembelajaran dengan jantina pelajar
A dynamic texture based approach to recognition of facial actions and their temporal models
In this work, we propose a dynamic texture-based approach to the recognition of facial Action Units (AUs, atomic facial gestures) and their temporal models (i.e., sequences of temporal segments: neutral, onset, apex, and offset) in near-frontal-view face videos. Two approaches to modeling the dynamics and the appearance in the face region of an input video are compared: an extended version of Motion History Images and a novel method based on Nonrigid Registration using Free-Form Deformations (FFDs). The extracted motion representation is used to derive motion orientation histogram descriptors in both the spatial and temporal domain. Per AU, a combination of discriminative, frame-based GentleBoost ensemble learners and dynamic, generative Hidden Markov Models detects the presence of the AU in question and its temporal segments in an input image sequence. When tested for recognition of all 27 lower and upper face AUs, occurring alone or in combination in 264 sequences from the MMI facial expression database, the proposed method achieved an average event recognition accuracy of 89.2 percent for the MHI method and 94.3 percent for the FFD method. The generalization performance of the FFD method has been tested using the Cohn-Kanade database. Finally, we also explored the performance on spontaneous expressions in the Sensitive Artificial Listener data set
Recommended from our members
Evaluation and analysis of hybrid intelligent pattern recognition techniques for speaker identification
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The rapid momentum of the technology progress in the recent years has led to a tremendous rise in the use of biometric authentication systems. The objective of this research is to investigate the problem
of identifying a speaker from its voice regardless of the content (i.e.
text-independent), and to design efficient methods of combining face and voice in producing a robust authentication system.
A novel approach towards speaker identification is developed using
wavelet analysis, and multiple neural networks including Probabilistic
Neural Network (PNN), General Regressive Neural Network (GRNN)and Radial Basis Function-Neural Network (RBF NN) with the AND
voting scheme. This approach is tested on GRID and VidTIMIT cor-pora and comprehensive test results have been validated with state-
of-the-art approaches. The system was found to be competitive and it improved the recognition rate by 15% as compared to the classical Mel-frequency Cepstral Coe±cients (MFCC), and reduced the recognition time by 40% compared to Back Propagation Neural Network (BPNN), Gaussian Mixture Models (GMM) and Principal Component Analysis (PCA).
Another novel approach using vowel formant analysis is implemented using Linear Discriminant Analysis (LDA). Vowel formant based speaker identification is best suitable for real-time implementation and requires only a few bytes of information to be stored for each speaker, making it both storage and time efficient. Tested on GRID and Vid-TIMIT, the proposed scheme was found to be 85.05% accurate when Linear Predictive Coding (LPC) is used to extract the vowel formants, which is much higher than the accuracy of BPNN and GMM. Since the proposed scheme does not require any training time other than creating a small database of vowel formants, it is faster as well. Furthermore, an increasing number of speakers makes it di±cult for BPNN and GMM to sustain their accuracy, but the proposed score-based methodology stays almost linear.
Finally, a novel audio-visual fusion based identification system is implemented using GMM and MFCC for speaker identi¯cation and PCA for face recognition. The results of speaker identification and face recognition are fused at different levels, namely the feature, score and decision levels. Both the score-level and decision-level (with OR voting) fusions were shown to outperform the feature-level fusion in terms of accuracy and error resilience. The result is in line with the distinct nature of the two modalities which lose themselves when combined at the feature-level. The GRID and VidTIMIT test results validate that
the proposed scheme is one of the best candidates for the fusion of
face and voice due to its low computational time and high recognition accuracy
A new approach to face recognition using Curvelet Transform
Multiresolution tools have been profusely employed in face recognition. Wavelet Transform is the best known among these multiresolution tools and is widely used for identification of human faces. Of late, following the success of wavelets a number of new multiresolution tools have been developed. Curvelet Transform is a recent addition to that list. It has better directional ability and effective curved edge representation capability. These two properties make curvelet transform a powerful weapon for extracting edge information from facial images. Our work aims at exploring the possibilities of curvelet transform for feature extraction from human faces in order to introduce a new alternative approach towards face recognition
An Analysis of Facial Expression Recognition Techniques
In present era of technology , we need applications which could be easy to use and are user-friendly , that even people with specific disabilities use them easily. Facial Expression Recognition has vital role and challenges in communities of computer vision, pattern recognition which provide much more attention due to potential application in many areas such as human machine interaction, surveillance , robotics , driver safety, non- verbal communication, entertainment, health- care and psychology study. Facial Expression Recognition has major importance ration in face recognition for significant image applications understanding and analysis. There are many algorithms have been implemented on different static (uniform background, identical poses, similar illuminations ) and dynamic (position variation, partial occlusion orientation, varying lighting )conditions. In general way face expression recognition consist of three main steps first is face detection then feature Extraction and at last classification. In this survey paper we discussed different types of facial expression recognition techniques and various methods which is used by them and their performance measures
LOMo: Latent Ordinal Model for Facial Analysis in Videos
We study the problem of facial analysis in videos. We propose a novel weakly
supervised learning method that models the video event (expression, pain etc.)
as a sequence of automatically mined, discriminative sub-events (eg. onset and
offset phase for smile, brow lower and cheek raise for pain). The proposed
model is inspired by the recent works on Multiple Instance Learning and latent
SVM/HCRF- it extends such frameworks to model the ordinal or temporal aspect in
the videos, approximately. We obtain consistent improvements over relevant
competitive baselines on four challenging and publicly available video based
facial analysis datasets for prediction of expression, clinical pain and intent
in dyadic conversations. In combination with complimentary features, we report
state-of-the-art results on these datasets.Comment: 2016 IEEE Conference on Computer Vision and Pattern Recognition
(CVPR
Recommended from our members
Information theory and the iriscode
Iris recognition has legendary resistance to False
Matches, and the tools of information theory can help to explain
why. The concept of entropy is fundamental to understanding
biometric collision avoidance. This paper analyses the bit sequences
of IrisCodes computed both from real iris images and
from synthetic “white noise” iris images whose pixel values are
random and uncorrelated. The capacity of the IrisCode as a
channel is found to be 0.566 bits per bit encoded, of which
0.469 bits of entropy per bit is encoded from natural iris images.
The difference between these two rates reflects the existence of
anatomical correlations within a natural iris, and the remaining
gap from one full bit of entropy per bit encoded reflects the
correlations in both phase and amplitude introduced by the
Gabor wavelets underlying the IrisCode. A simple two-state
Hidden Markov Model is shown to emulate exactly the statistics
of bit sequences generated both from natural and white noise
iris images, including their “imposter” distributions, and may be
useful for generating large synthetic IrisCode databases.This is the author accepted manuscript. The final version is available from IEEE via http://dx.doi.org/10.1109/TIFS.2015.250019
- …