8 research outputs found

    Low latency and tight resources viseme recognition from speech using an artificial neural network

    Get PDF
    We present a speech driven real-time viseme recognition system based on Artificial Neural Network (ANN). A Multi-Layer Perceptron (MLP) is used to provide a light and responsive framework, adapted to the final application (i.e., the animation of the lips of an avatar on multi-task platforms with embedded resources and latency constraints). Several improvements of this system are studied such as data selection, network size, training set size, or choice of the best acoustic unit to recognize. All variants are compared to a baseline system, and the combined improvements achieve a recognition rate of 64.3% for a set of 18 visemes and 70.8% for 9 visemes. We then propose a tradeoff system between the recognition performance, the resource requirements and the latency constraints. A scalable method is also described.Ce rapport présente un système de reconnaissance de visèmes à partir du signal de parole utilisant un réseau de neurones artificiels et capable de fonctionner en temps réel. Un Multi-Layer Perceptron (MLP) permet d'obtenir une méthode rapide et légère adaptée à l'application finale (i.e., l'animation des lèvres d'un avatar par une plateforme multitâche de type set-top-box avec des contraintes de ressources et de latence). Plusieurs améliorations de ce système sont également présentées telles que la sélection des données d'apprentissage, la taille du réseau, la taille de la base d'apprentissage ou encore le choix de l'unité acoustique à reconnaître. Toutes ces variantes sont comparées au système de base. La combinaison de toutes ces améliorations permet d'atteindre un taux de reconnaissance de 64.3% pour un jeu de 18 visèmes et 70.8% pour 9 visèmes. Nous proposons ensuite un système faisant le compromis entre performance, besoin en ressources et latence. Une variante adaptable (scalable) est aussi décrite

    On the Recognition of Emotion from Physiological Data

    Get PDF
    This work encompasses several objectives, but is primarily concerned with an experiment where 33 participants were shown 32 slides in order to create ‗weakly induced emotions‘. Recordings of the participants‘ physiological state were taken as well as a self report of their emotional state. We then used an assortment of classifiers to predict emotional state from the recorded physiological signals, a process known as Physiological Pattern Recognition (PPR). We investigated techniques for recording, processing and extracting features from six different physiological signals: Electrocardiogram (ECG), Blood Volume Pulse (BVP), Galvanic Skin Response (GSR), Electromyography (EMG), for the corrugator muscle, skin temperature for the finger and respiratory rate. Improvements to the state of PPR emotion detection were made by allowing for 9 different weakly induced emotional states to be detected at nearly 65% accuracy. This is an improvement in the number of states readily detectable. The work presents many investigations into numerical feature extraction from physiological signals and has a chapter dedicated to collating and trialing facial electromyography techniques. There is also a hardware device we created to collect participant self reported emotional states which showed several improvements to experimental procedure

    Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021

    Get PDF
    The eighth edition of the Italian Conference on Computational Linguistics (CLiC-it 2021) was held at Università degli Studi di Milano-Bicocca from 26th to 28th January 2022. After the edition of 2020, which was held in fully virtual mode due to the health emergency related to Covid-19, CLiC-it 2021 represented the first moment for the Italian research community of Computational Linguistics to meet in person after more than one year of full/partial lockdown

    Human-Computer Interaction

    Get PDF
    In this book the reader will find a collection of 31 papers presenting different facets of Human Computer Interaction, the result of research projects and experiments as well as new approaches to design user interfaces. The book is organized according to the following main topics in a sequential order: new interaction paradigms, multimodality, usability studies on several interaction mechanisms, human factors, universal design and development methodologies and tools

    Machine Medical Ethics

    Get PDF
    In medical settings, machines are in close proximity with human beings: with patients who are in vulnerable states of health, who have disabilities of various kinds, with the very young or very old, and with medical professionals. Machines in these contexts are undertaking important medical tasks that require emotional sensitivity, knowledge of medical codes, human dignity, and privacy. As machine technology advances, ethical concerns become more urgent: should medical machines be programmed to follow a code of medical ethics? What theory or theories should constrain medical machine conduct? What design features are required? Should machines share responsibility with humans for the ethical consequences of medical actions? How ought clinical relationships involving machines to be modeled? Is a capacity for empathy and emotion detection necessary? What about consciousness? The essays in this collection by researchers from both humanities and science describe various theoretical and experimental approaches to adding medical ethics to a machine, what design features are necessary in order to achieve this, philosophical and practical questions concerning justice, rights, decision-making and responsibility, and accurately modeling essential physician-machine-patient relationships. This collection is the first book to address these 21st-century concerns

    An integrative computational modelling of music structure apprehension

    Get PDF

    Pilot study for subgroup classification for autism spectrum disorder based on dysmorphology and physical measurements in Chinese children

    Get PDF
    Poster Sessions: 157 - Comorbid Medical Conditions: abstract 157.058 58BACKGROUND: Autism Spectrum Disorder (ASD) is a complex neurodevelopmental disorder affecting individuals along a continuum of severity in communication, social interaction and behaviour. The impact of ASD significantly varies amongst individuals, and the cause of ASD can originate broadly between genetic and environmental factors. Objectives: Previous ASD researches indicate that early identification combined with a targeted treatment plan involving behavioural interventions and multidisciplinary therapies can provide substantial improvement for ASD patients. Currently there is no cure for ASD, and the clinical variability and uncertainty of the disorder still remains. Hence, the search to unravel heterogeneity within ASD by subgroup classification may provide clinicians with a better understanding of ASD and to work towards a more definitive course of action. METHODS: In this study, a norm of physical measurements including height, weight, head circumference, ear length, outer and inner canthi, interpupillary distance, philtrum, hand and foot length was collected from 658 Typical Developing (TD) Chinese children aged 1 to 7 years (mean age of 4.19 years). The norm collected was compared against 80 ASD Chinese children aged 1 to 12 years (mean age of 4.36 years). We then further attempted to find subgroups within ASD based on identifying physical abnormalities; individuals were classified as (non) dysmorphic with the Autism Dysmorphology Measure (ADM) from physical examinations of 12 body regions. RESULTS: Our results show that there were significant differences between ASD and TD children for measurements in: head circumference (p=0.009), outer (p=0.021) and inner (p=0.021) canthus, philtrum length (p=0.003), right (p=0.023) and left (p=0.20) foot length. Within the 80 ASD patients, 37(46%) were classified as dysmorphic (p=0.00). CONCLUSIONS: This study attempts to identify subgroups within ASD based on physical measurements and dysmorphology examinations. The information from this study seeks to benefit ASD community by identifying possible subtypes of ASD in Chinese population; in seek for a more definitive diagnosis, referral and treatment plan.published_or_final_versio
    corecore