Search CORE

1,009 research outputs found

A statistical multiresolution approach for face recognition using structural hidden Markov models

Author: Amira A
Bouchaffra D
Nicholl P
Perrott R H
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2007
Field of study

This paper introduces a novel methodology that combines the multiresolution feature of the discrete wavelet transform (DWT) with the local interactions of the facial structures expressed through the structural hidden Markov model (SHMM). A range of wavelet filters such as Haar, biorthogonal 9/7, and Coiflet, as well as Gabor, have been implemented in order to search for the best performance. SHMMs perform a thorough probabilistic analysis of any sequential pattern by revealing both its inner and outer structures simultaneously. Unlike traditional HMMs, the SHMMs do not perform the state conditional independence of the visible observation sequence assumption. This is achieved via the concept of local structures introduced by the SHMMs. Therefore, the long-range dependency problem inherent to traditional HMMs has been drastically reduced. SHMMs have not previously been applied to the problem of face identification. The results reported in this application have shown that SHMM outperforms the traditional hidden Markov model with a 73% increase in accuracy

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Brunel University Research Archive

Hubungan gaya pembelajaran dengan pencapaian akademik pelajar aliran vokasional

Author: Anwar Nurul Farahanaa
Publication venue
Publication date: 01/01/2013
Field of study

Analisis keputusan Sijil Pelajaran Malaysia (SPM) 2011 menunjukkan penurunan pencapaian bagi Sekolah Menengah Vokasional. Oleh itu, kajian ini dilaksanakan bertujuan untuk mengkaji hubungan di antara gaya pembelajaran dengan pencapaian akademik pelajar. Kajian ini juga ingin mengenalpasti gaya pembelajaran paling dominan yang diamalkan oleh pelajar serta melihat perbezaan gaya pembelajaran dengan jantina pelajar. Seramai 131 orang Pelajar Tingkatan Empat Kursus Vokasional Di Sekolah Menengah Vokasional Segamat di Johor telah terlibat dalam kajian ini. Soal selidik Index of Learning Style (ILS) yang dibangunkan oleh Felder dan Silverman (1991) yang mengandungi 44 soalan telah digunakan untukh menjalankan kajian ini. Gaya pembelajaran pelajar dapat dilihat melalui empat dimensi gaya pembelajaran yang terdiri dari dua sub-skala yang bertentangan iaitu dimensi pelajar Aktif dan Reflektif, dimensi pelajar Konkrit dan Intuitif, dimensi pelajar Verbal dan Visual, serta dimensi pelajar Tersusun dan Global. Data yang diperolehi dianalisis dengan menggunakan perisian Statistical Package for Social Science for WINDOW release 20.0 (SPSS.20.0). Ujian Korelasi Pearson digunakan untuk menganalisis data dalam mengkaji hubungan gaya pembelajaran dengan pencapaian akademik pelajar. Nilai pekali p yang diperolehi di antara gaya pembelajaran dengan pencapaian pelajar adalah (p=0.1 hingga 0.4). Ini menunjukkan tidak terdapat hubungan yang signifikan di antara dua pembolehubah tersebut. Kajian ini juga mendapati bahawa gaya pembelajaran yang menjadi amalan pelajar ialah gaya pembelajaran Tersusun. Hasil kajian juga mendapati bahawa tidak terdapat perbezaan yang signifikan di antara gaya pembelajaran dengan jantina pelajar

UTHM Institutional Repository

A dynamic texture based approach to recognition of facial actions and their temporal models

Author: Koelstra Sander
Pantic Maja
Patras Ioannis (Yannis)
Publication venue: IEEE
Publication date: 01/01/2010
Field of study

In this work, we propose a dynamic texture-based approach to the recognition of facial Action Units (AUs, atomic facial gestures) and their temporal models (i.e., sequences of temporal segments: neutral, onset, apex, and offset) in near-frontal-view face videos. Two approaches to modeling the dynamics and the appearance in the face region of an input video are compared: an extended version of Motion History Images and a novel method based on Nonrigid Registration using Free-Form Deformations (FFDs). The extracted motion representation is used to derive motion orientation histogram descriptors in both the spatial and temporal domain. Per AU, a combination of discriminative, frame-based GentleBoost ensemble learners and dynamic, generative Hidden Markov Models detects the presence of the AU in question and its temporal segments in an input image sequence. When tested for recognition of all 27 lower and upper face AUs, occurring alone or in combination in 264 sequences from the MMI facial expression database, the proposed method achieved an average event recognition accuracy of 89.2 percent for the MHI method and 94.3 percent for the FFD method. The generalization performance of the FFD method has been tested using the Cohn-Kanade database. Finally, we also explored the performance on spontaneous expressions in the Sensitive Artificial Listener data set

CiteSeerX

Spiral - Imperial College Digital Repository

University of Twente Research Information

Recommended from our members

Evaluation and analysis of hybrid intelligent pattern recognition techniques for speaker identification

Author: Almaadeed Noor
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2014
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The rapid momentum of the technology progress in the recent years has led to a tremendous rise in the use of biometric authentication systems. The objective of this research is to investigate the problem of identifying a speaker from its voice regardless of the content (i.e. text-independent), and to design efficient methods of combining face and voice in producing a robust authentication system. A novel approach towards speaker identification is developed using wavelet analysis, and multiple neural networks including Probabilistic Neural Network (PNN), General Regressive Neural Network (GRNN)and Radial Basis Function-Neural Network (RBF NN) with the AND voting scheme. This approach is tested on GRID and VidTIMIT cor-pora and comprehensive test results have been validated with state- of-the-art approaches. The system was found to be competitive and it improved the recognition rate by 15% as compared to the classical Mel-frequency Cepstral Coe±cients (MFCC), and reduced the recognition time by 40% compared to Back Propagation Neural Network (BPNN), Gaussian Mixture Models (GMM) and Principal Component Analysis (PCA). Another novel approach using vowel formant analysis is implemented using Linear Discriminant Analysis (LDA). Vowel formant based speaker identification is best suitable for real-time implementation and requires only a few bytes of information to be stored for each speaker, making it both storage and time efficient. Tested on GRID and Vid-TIMIT, the proposed scheme was found to be 85.05% accurate when Linear Predictive Coding (LPC) is used to extract the vowel formants, which is much higher than the accuracy of BPNN and GMM. Since the proposed scheme does not require any training time other than creating a small database of vowel formants, it is faster as well. Furthermore, an increasing number of speakers makes it di±cult for BPNN and GMM to sustain their accuracy, but the proposed score-based methodology stays almost linear. Finally, a novel audio-visual fusion based identification system is implemented using GMM and MFCC for speaker identi¯cation and PCA for face recognition. The results of speaker identification and face recognition are fused at different levels, namely the feature, score and decision levels. Both the score-level and decision-level (with OR voting) fusions were shown to outperform the feature-level fusion in terms of accuracy and error resilience. The result is in line with the distinct nature of the two modalities which lose themselves when combined at the feature-level. The GRID and VidTIMIT test results validate that the proposed scheme is one of the best candidates for the fusion of face and voice due to its low computational time and high recognition accuracy

Brunel University Research Archive

Automatic Face Recognition System for Hidden Markov Model Techniques

Author: Corcoran Peter M.
Iancu Claudia
Publication venue: 'IntechOpen'
Publication date: 01/08/2011
Field of study

IntechOpen

A new approach to face recognition using Curvelet Transform

Author: Mandal Tanaya
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2008
Field of study

Multiresolution tools have been profusely employed in face recognition. Wavelet Transform is the best known among these multiresolution tools and is widely used for identification of human faces. Of late, following the success of wavelets a number of new multiresolution tools have been developed. Curvelet Transform is a recent addition to that list. It has better directional ability and effective curved edge representation capability. These two properties make curvelet transform a powerful weapon for extracting edge information from facial images. Our work aims at exploring the possibilities of curvelet transform for feature extraction from human faces in order to introduce a new alternative approach towards face recognition

Scholarship at UWindsor

An Analysis of Facial Expression Recognition Techniques

Author: Vandana Patidar, Dr. Devang Pandya, Asst. Prof. Dharmesh Tank
Publication venue: Auricle Global Society of Education and Research
Publication date: 30/11/2017
Field of study

In present era of technology , we need applications which could be easy to use and are user-friendly , that even people with specific disabilities use them easily. Facial Expression Recognition has vital role and challenges in communities of computer vision, pattern recognition which provide much more attention due to potential application in many areas such as human machine interaction, surveillance , robotics , driver safety, non- verbal communication, entertainment, health- care and psychology study. Facial Expression Recognition has major importance ration in face recognition for significant image applications understanding and analysis. There are many algorithms have been implemented on different static (uniform background, identical poses, similar illuminations ) and dynamic (position variation, partial occlusion orientation, varying lighting )conditions. In general way face expression recognition consist of three main steps first is face detection then feature Extraction and at last classification. In this survey paper we discussed different types of facial expression recognition techniques and various methods which is used by them and their performance measures

International Journal on Future Revolution in Computer Science & Communication Engineering

LOMo: Latent Ordinal Model for Facial Analysis in Videos

Author: Bartlett Marian
Sharma Gaurav
Sikka Karan
Publication venue
Publication date: 01/01/2016
Field of study

We study the problem of facial analysis in videos. We propose a novel weakly supervised learning method that models the video event (expression, pain etc.) as a sequence of automatically mined, discriminative sub-events (eg. onset and offset phase for smile, brow lower and cheek raise for pain). The proposed model is inspired by the recent works on Multiple Instance Learning and latent SVM/HCRF- it extends such frameworks to model the ordinal or temporal aspect in the videos, approximately. We obtain consistent improvements over relevant competitive baselines on four challenging and publicly available video based facial analysis datasets for prediction of expression, clinical pain and intent in dyadic conversations. In combination with complimentary features, we report state-of-the-art results on these datasets.Comment: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR

arXiv.org e-Print Archive

MPG.PuRe

Recommended from our members

Information theory and the iriscode

Author: Daugman J
Publication venue: IEEE Transactions on Information Forensics and Security
Publication date: 12/11/2015
Field of study

Iris recognition has legendary resistance to False Matches, and the tools of information theory can help to explain why. The concept of entropy is fundamental to understanding biometric collision avoidance. This paper analyses the bit sequences of IrisCodes computed both from real iris images and from synthetic “white noise” iris images whose pixel values are random and uncorrelated. The capacity of the IrisCode as a channel is found to be 0.566 bits per bit encoded, of which 0.469 bits of entropy per bit is encoded from natural iris images. The difference between these two rates reflects the existence of anatomical correlations within a natural iris, and the remaining gap from one full bit of entropy per bit encoded reflects the correlations in both phase and amplitude introduced by the Gabor wavelets underlying the IrisCode. A simple two-state Hidden Markov Model is shown to emulate exactly the statistics of bit sequences generated both from natural and white noise iris images, including their “imposter” distributions, and may be useful for generating large synthetic IrisCode databases.This is the author accepted manuscript. The final version is available from IEEE via http://dx.doi.org/10.1109/TIFS.2015.250019

Apollo (Cambridge)