49 research outputs found

    Analysis of tongue shape and motion in speech production using statistical modeling

    Get PDF
    The mechanisms of speech production are complex and have been raising attention from researchers of both medical and computer vision fields. In the speech production mechanism, the articulator’s study is a complex issue, since they have a high level of freedom along this process, namely the tongue, which instigates a problem in its control and observation. In this work it is automatically characterized the tongues shape during the articulation of the oral vowels of Portuguese European by using statistical modeling on MR-images. A point distribution model is built from a set of images collected during artificially sustained articulations of Portuguese European sounds, which can extract the main characteristics of the motion of the tongue. The model built in this work allows under standing more clearly the dynamic speech events involved during sustained articulations. The tongue shape model built can also be useful for speech rehabilitation purposes, specifically to recognize the compensatory movements of the articulators during speech production

    Паралельно-ієрархічне перетворення як системна модель для розпізнавання зображень

    Get PDF
    У статті розглянуто метод паралельно-ієрархічного перетворення на основі популяційного кодування та його застосування для задач розпізнавання зображень. Паралельно-ієрархічне перетворення описується як системна модель для розпізнавання зображень. Наведені теоретичні відомості, експериментальні дослідження і програмна реалізація.В статье рассмотрен метод параллельно-иерархического преобразования на основе популяционного кодирования и его применение для задач распознавания изображений. Параллельно-иерархическое преобразование описывается как системная модель для распознавания изображений. Приведены теоретические сведения, экспериментальные исследования и программная реализация.In this article researched parallel-hierarchical transformation method based on popular coding and its using for images recognition tasks. Parallel-hierarchical transformation is descript as the system model for images recognition. Leading up theoretical information, experimental researching and program realization

    A Dynamic Approach to Pose Invariant Face Identification Using Cellular Simultaneous Recurrent Networks

    Get PDF
    Face recognition is a widely covered and desirable research field that produced multiple techniques and different approaches. Most of them have severe limitations with pose variations or face rotation. The immediate goal of this thesis is to deal with pose variations by implementing a face recognition system using a Cellular Simultaneous Recurrent Network (CSRN). The CSRN is a novel bio-inspired recurrent neural network that mimics reinforcement learning in the brain. The recognition task is defined as an identification problem on image sequences. The goal is to correctly match a set of unknown pose distorted probe face sequences with a set of known gallery sequences. This system comprises of a pre-processing stage for face and feature extraction and a recognition stage to perform the identification. The face detection algorithm is based on the scale-space method combined with facial structural knowledge. These steps include extraction of key landmark points and motion unit vectors that describe movement of face sequqnces. The identification process applies Eigenface and PCA and reduces each image to a pattern vector used as input for the CSRN. In the training phase the CSRN learns the temporal information contained in image sequences. In the testing phase the network predicts the output pattern and finds similarity with a test input pattern indicating a match or mismatch.Previous applications of a CSRN system in face recognition have shown promise. The first objective of this research is to evaluate those prior implementations of CSRN-based pose invariant face recognition in video images with large scale databases. The publicly available VidTIMIT Audio-Video face dataset provides all the sequences needed for this study. The second objective is to modify a few well know standard face recognition algorithms to handle pose invariant face recognition for appropriate benchmarking with the CSRN. The final objective is to further improve CSRN face recognition by introducing motion units which can be used to capture the direction and intensity of movement of feature points in a rotating fac

    Facial Expression Recognition Using Uniform Local Binary Pattern with Improved Firefly Feature Selection

    Get PDF
    Facial expressions are essential communication tools in our daily life. In this paper, the uniform local binary pattern is employed to extract features from the face. However, this feature representation is very high in dimensionality. The high dimensionality would not only affect the recognition accuracy but also can impose computational constraints. Hence, to reduce the dimensionality of the feature vector, the firefly algorithm is used to select the optimal subset that leads to better classification accuracy. However, the standard firefly algorithm suffers from the risk of being trapped in local optima after a certain number of generations. Hence, this limitation has been addressed by proposing an improved version of the firefly where the great deluge algorithm (GDA) has been integrated. The great deluge is a local search algorithm that helps to enhance the exploitation ability of the firefly algorithm, thus preventing being trapped in local optima. The improved firefly algorithm has been employed in a facial expression system. Experimental results using the Japanese female facial expression database show that the proposed approach yielded good classification accuracy compared to state-of-the-art methods. The best classification accuracy obtained by the proposed method is 96.7% with 1230 selected features, whereas, Gabor-SRC method achieved 97.6% with 2560 features

    Face Detection and Identification

    Get PDF
    Práce se zabývá problematikou detekce a identifikace obličeje na fotografích. V úvodu jsou rozebrány nejznámější metody se stručnými popisy jejich pricipů. Některé z nich v rámci praktické části implementujeme a otestujeme na volně dostupných databázích. V závěru práce jsou zhodnoceny výsledky metod a uzavření celé práce.This work is focused on the problematic of face detection and identification in photography. The introduction is devoted to the most popular methods with briefly descriptions of their principles and rules. Within the practical part of this work we implement and test on free available databases the several of these methods. In the conclusion we evaluate the results and addition of this whole work.

    Analysis of tongue shape and motion in speech production using statistical modeling

    Get PDF
    The mechanisms of speech production are complex and have been raising attention from researchers of both medical and computer vision fields. In the speech production mechanism, the articulators study is a complex issue, since they have a high level of freedom along this process, namely the tongue, which instigates a problem in its control and observation. In this work it is automatically characterized the tongues shape during the articulation of the oral vowels of Portuguese European by using statistical modeling on MR-images. A point distribution model is built from a set of images collected during artificially sustained articulations of Portuguese European sounds, which can extract the main characteristics of the motion of the tongue. The model built in this work allows understanding more clearly the dynamic speech events involved during sustained articulations. The tongue shape model built can also be useful for speech rehabilitation purposes, specifically to recognize the compensatory movements of the articulators during speech production

    Recognition of unfamiliar faces

    Get PDF
    People are excellent at identifying faces familiar to them, even from very low quality images, but are bad at recognising, or even matching, faces that are unfamiliar. In this review we shall consider some of the factors which affect our abilities to match unfamiliar faces. Major differences in orientation (e.g. inversion) or greyscale information (e.g. negation) affect face processing dramatically, and such effects are informative about the nature of the representations derived from unfamiliar faces, suggesting that these are based on relatively low-level image descriptions. Consistent with this, even relatively minor differences in lighting and viewpoint create problems for human face matching, leading to potentially important problems over the use of images from security video images. The relationships between different parts of the face (its "configuration") are as important to the impression created of an upright face as local features themselves, suggesting further constraints on the representations derived from faces. The review then turns to consider what computer face recognition systems may contribute to understanding both the theory and the practical problems of face identification. Computer systems can be used as an aid to person identification, but also in an attempt to model human perceptual processes. There are many approaches to computer recognition of faces, including ones based on low-level image analysis of whole face images, which have potential as models of human performance. Some systems show significant correlations with human perceptions of the same faces, for example recognising distinctive faces more easily. In some circumstances, some systems may exceed human abilities on unfamiliar faces. Finally, we look to the future of work in this area, that will incorporate motion and three-dimensional shape information
    corecore