Search CORE

1,032 research outputs found

A statistical multiresolution approach for face recognition using structural hidden Markov models

Author: Amira A
Bouchaffra D
Nicholl P
Perrott R H
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2007
Field of study

This paper introduces a novel methodology that combines the multiresolution feature of the discrete wavelet transform (DWT) with the local interactions of the facial structures expressed through the structural hidden Markov model (SHMM). A range of wavelet filters such as Haar, biorthogonal 9/7, and Coiflet, as well as Gabor, have been implemented in order to search for the best performance. SHMMs perform a thorough probabilistic analysis of any sequential pattern by revealing both its inner and outer structures simultaneously. Unlike traditional HMMs, the SHMMs do not perform the state conditional independence of the visible observation sequence assumption. This is achieved via the concept of local structures introduced by the SHMMs. Therefore, the long-range dependency problem inherent to traditional HMMs has been drastically reduced. SHMMs have not previously been applied to the problem of face identification. The results reported in this application have shown that SHMM outperforms the traditional hidden Markov model with a 73% increase in accuracy

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Brunel University Research Archive

Hubungan gaya pembelajaran dengan pencapaian akademik pelajar aliran vokasional

Author: Anwar Nurul Farahanaa
Publication venue
Publication date: 01/01/2013
Field of study

Analisis keputusan Sijil Pelajaran Malaysia (SPM) 2011 menunjukkan penurunan pencapaian bagi Sekolah Menengah Vokasional. Oleh itu, kajian ini dilaksanakan bertujuan untuk mengkaji hubungan di antara gaya pembelajaran dengan pencapaian akademik pelajar. Kajian ini juga ingin mengenalpasti gaya pembelajaran paling dominan yang diamalkan oleh pelajar serta melihat perbezaan gaya pembelajaran dengan jantina pelajar. Seramai 131 orang Pelajar Tingkatan Empat Kursus Vokasional Di Sekolah Menengah Vokasional Segamat di Johor telah terlibat dalam kajian ini. Soal selidik Index of Learning Style (ILS) yang dibangunkan oleh Felder dan Silverman (1991) yang mengandungi 44 soalan telah digunakan untukh menjalankan kajian ini. Gaya pembelajaran pelajar dapat dilihat melalui empat dimensi gaya pembelajaran yang terdiri dari dua sub-skala yang bertentangan iaitu dimensi pelajar Aktif dan Reflektif, dimensi pelajar Konkrit dan Intuitif, dimensi pelajar Verbal dan Visual, serta dimensi pelajar Tersusun dan Global. Data yang diperolehi dianalisis dengan menggunakan perisian Statistical Package for Social Science for WINDOW release 20.0 (SPSS.20.0). Ujian Korelasi Pearson digunakan untuk menganalisis data dalam mengkaji hubungan gaya pembelajaran dengan pencapaian akademik pelajar. Nilai pekali p yang diperolehi di antara gaya pembelajaran dengan pencapaian pelajar adalah (p=0.1 hingga 0.4). Ini menunjukkan tidak terdapat hubungan yang signifikan di antara dua pembolehubah tersebut. Kajian ini juga mendapati bahawa gaya pembelajaran yang menjadi amalan pelajar ialah gaya pembelajaran Tersusun. Hasil kajian juga mendapati bahawa tidak terdapat perbezaan yang signifikan di antara gaya pembelajaran dengan jantina pelajar

UTHM Institutional Repository

Speeding up Convolutional Neural Networks with Low Rank Expansions

Author: Jaderberg Max
Vedaldi Andrea
Zisserman Andrew
Publication venue
Publication date: 01/01/2014
Field of study

The focus of this paper is speeding up the evaluation of convolutional neural networks. While delivering impressive results across a range of computer vision and machine learning tasks, these networks are computationally demanding, limiting their deployability. Convolutional layers generally consume the bulk of the processing time, and so in this work we present two simple schemes for drastically speeding up these layers. This is achieved by exploiting cross-channel or filter redundancy to construct a low rank basis of filters that are rank-1 in the spatial domain. Our methods are architecture agnostic, and can be easily applied to existing CPU and GPU convolutional frameworks for tuneable speedup performance. We demonstrate this with a real world network designed for scene text character recognition, showing a possible 2.5x speedup with no loss in accuracy, and 4.5x speedup with less than 1% drop in accuracy, still achieving state-of-the-art on standard benchmarks

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Hidden Markov Models in Automatic Face Recognition - A Review

Author: Iancu Claudia
M. Corcoran Peter
Publication venue: 'IntechOpen'
Publication date: 27/07/2011
Field of study

IntechOpen

A Multi-Stage Classifier for Face Recognition Undertaken by Coarse-to-fine Strategy

Author: Chen-Hui Kuo
Jiann-Der Lee
Publication venue: 'IntechOpen'
Publication date: 01/01/2009
Field of study

Face recognition has been a very active research area for past two decades due to its widely applications such as identity authentication, airport security and access control, surveillance, and video retrieval systems, etc. Numerous approaches have been proposed for face recognition and considerable successes have been reported [1]. A successful face recognitio

IntechOpen

CiteSeerX

Review of constraints on vision-based gesture recognition for human–computer interaction

Author: Bhuyan M. K.
Chakraborty Biplab Ketan
MacDorman Karl F.
Sarma Debajit
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/01/2018
Field of study

The ability of computers to recognise hand gestures visually is essential for progress in human-computer interaction. Gesture recognition has applications ranging from sign language to medical assistance to virtual reality. However, gesture recognition is extremely challenging not only because of its diverse contexts, multiple interpretations, and spatio-temporal variations but also because of the complex non-rigid properties of the hand. This study surveys major constraints on vision-based gesture recognition occurring in detection and pre-processing, representation and feature extraction, and recognition. Current challenges are explored in detail

Crossref

IUPUIScholarWorks

Directory of Open Access Journals

Recommended from our members

3D multiresolution statistical approaches for accelerated medical image and volume segmentation

Author: Alzubi S
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2011
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Medical volume segmentation got the attraction of many researchers; therefore, many techniques have been implemented in terms of medical imaging including segmentations and other imaging processes. This research focuses on an implementation of segmentation system which uses several techniques together or on their own to segment medical volumes, the system takes a stack of 2D slices or a full 3D volumes acquired from medical scanners as a data input. Two main approaches have been implemented in this research for segmenting medical volume which are multi-resolution analysis and statistical modeling. Multi-resolution analysis has been mainly employed in this research for extracting the features. Higher dimensions of discontinuity (line or curve singularity) have been extracted in medical images using a modified multi-resolution analysis transforms such as ridgelet and curvelet transforms. The second implemented approach in this thesis is the use of statistical modeling in medical image segmentation; Hidden Markov models have been enhanced here to segment medical slices automatically, accurately, reliably and with lossless results. But the problem with using Markov models here is the computational time which is too long. This has been addressed by using feature reduction techniques which has also been implemented in this thesis. Some feature reduction and dimensionality reduction techniques have been used to accelerate the slowest block in the proposed system. This includes Principle Components Analysis, Gaussian Pyramids and other methods. The feature reduction techniques have been employed efficiently with the 3D volume segmentation techniques such as 3D wavelet and 3D Hidden Markov models. The system has been tested and validated using several procedures starting at a comparison with the predefined results, crossing the specialists’ validations, and ending by validating the system using a survey filled by the end users explaining the techniques and the results. This concludes that Markovian models segmentation results has overcome all other techniques in most patients’ cases. Curvelet transform has been also proved promising segmentation results; the end users rate it better than Markovian models due to the long time required with Hidden Markov models

Brunel University Research Archive

Visual speech recognition and utterance segmentation based on mouth movement

Author: Kumar D
Weghorn H
Yau W
Publication venue: IEEE (Piscataway, USA)
Publication date: 01/01/2007
Field of study

This paper presents a vision-based approach to recognize speech without evaluating the acoustic signals. The proposed technique combines motion features and support vector machines (SVMs) to classify utterances. Segmentation of utterances is important in a visual speech recognition system. This research proposes a video segmentation method to detect the start and end frames of isolated utterances from an image sequence. Frames that correspond to `speaking' and `silence' phases are identified based on mouth movement information. The experimental results demonstrate that the proposed visual speech recognition technique yields high accuracy in a phoneme classification task. Potential applications of such a system are, e.g., human computer interface (HCI) for mobility-impaired users, lip-reading mobile phones, in-vehicle systems, and improvement of speech-based computer control in noisy environments

RMIT Research Repository

Application of orthogonal neighborhood preserving projections and two dimensional hidden Markov model for the degradation evaluation of rolling elements bearings

Author: Shengjie Wang
Shuai Zhang
Yongxiang Zhang
Yujie Xiao
Publication venue: 'JVE International Ltd.'
Publication date: 30/06/2017
Field of study

An effective degradation indicator created from the general features is still a hotspot for the condition monitoring of bearing. To cover the shortage of the general features based indicator, some new indicators are built using multiple general features extracted from the original vibration signal without considering the internal relevancy among the features. To address that problem, a new indicator is proposed using the Orthogonal Neighborhood Preserving Projections (ONPP) and 2-Dimensional Hidden Markov Model (2-D HMM). With the ability of keeping the local structure of data set, Orthogonal Neighborhood Preserving Projections is used to obtain the low dimensional features with the main information remained. Unlike 1-Dimensional data-processing algorithm that commonly converts the multiple features into a vector to deal with the high-dimensional data with the integral property of the multiple features considered only, 2-Dimensional Hidden Markov Model not only take the relevance between the individuals of fault features into consideration but also capture the global characteristics of the multiple features. Then a likelihood probability based health assessment indication can be constructed by combing 2-D HMM with the data pre-processed by ONPP. The experiment results indicate that the proposed indicator show great abilities to make degradation performance of the bearing and is sensitive to incipient defects

Journal of Vibroengineering

Robust density modelling using the student's t-distribution for human action recognition

Author: Moghaddam Z
Piccardi M
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2011
Field of study

The extraction of human features from videos is often inaccurate and prone to outliers. Such outliers can severely affect density modelling when the Gaussian distribution is used as the model since it is highly sensitive to outliers. The Gaussian distribution is also often used as base component of graphical models for recognising human actions in the videos (hidden Markov model and others) and the presence of outliers can significantly affect the recognition accuracy. In contrast, the Student's t-distribution is more robust to outliers and can be exploited to improve the recognition rate in the presence of abnormal data. In this paper, we present an HMM which uses mixtures of t-distributions as observation probabilities and show how experiments over two well-known datasets (Weizmann, MuHAVi) reported a remarkable improvement in classification accuracy. © 2011 IEEE

OPUS - University of Technology Sydney