Search CORE

257 research outputs found

Emotion and mental state recognition from speech

Author: Cowie Roddy
Epps Julien
Narayanan Shrikanth
Schuller Björn
Tao Jianhua
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

OPUS Augsburg

Crossref

Springer - Publisher Connector

A Factored Language Model for Prosody Dependent Speech Recognition

Author: Jennifer S. Cole
Ken Chen
Mark A. Hasegawa-Johnson
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Speech Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes

Directory of Open Access Books (DOAB)

USER-AWARENESS AND ADAPTATION IN CONVERSATIONAL AGENTS

Author: Bojanić Milana
Delić Vlado
Gnjatović Milan
Jakovljević Nikša
Jokić Ivan
Popović Branislav
Publication venue: Published by the University of Niš, Serbia
Publication date: 13/06/2014
Field of study

This paper considers the research question of developing user-aware and adaptive conversational agents. The conversational agent is a system which is user-aware to the extent that it recognizes the user identity and his/her emotional states that are relevant in a given interaction domain. The conversational agent is user-adaptive to the extent that it dynamically adapts its dialogue behavior according to the user and his/her emotional state. The paper summarizes some aspects of our previous work and presents work-in-progress in the field of speech-based human-machine interaction. It focuses particularly on the development of speech recognition modules in cooperation with both modules for emotion recognition and speaker recognition, as well as the dialogue management module. Finally, it proposes an architecture of a conversational agent that integrates those modules and improves each of them based on some kind of synergies among themselves

University of Niš: Facta Universitatis (E-Journals) / Универзитет у Нишу

PEAKS – A system for the automatic evaluation of voice and speech disorders

Author: A. Batliner
A. Maier
Batliner
Batliner
Batliner
Batliner
Bellandese
Bodin
Bressmann
Brown
Brown
Cohen
Cohen
Courrieu
E. Nöth
Enderby
F. Rosanowski
Furia
Gales
Harding
Haughey
Henningsson
Keuning
Knuuttila
Kuttner
M. Schuster
Mahanna
Markkanen-Leppanen
Millard
Moore
Mády
Paal
Panchal
Pauloski
Paulowski
Penrose
Press
Riedhammer
Robbins
Robbins
Rosanowski
Ruben
Schutte
Schönweiler
Schönweiler
Seikaly
Su
T. Haderlein
Terai
U. Eysholdt
Wantia
Witten
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Multi-Sensory Emotion Recognition with Speech and Facial Expression

Author: Yao Qingmei
Publication venue: The Aquila Digital Community
Publication date: 01/08/2016
Field of study

Emotion plays an important role in human beings’ daily lives. Understanding emotions and recognizing how to react to others’ feelings are fundamental to engaging in successful social interactions. Currently, emotion recognition is not only significant in human beings’ daily lives, but also a hot topic in academic research, as new techniques such as emotion recognition from speech context inspires us as to how emotions are related to the content we are uttering. The demand and importance of emotion recognition have highly increased in many applications in recent years, such as video games, human computer interaction, cognitive computing, and affective computing. Emotion recognition can be done from many sources including text, speech, hand, and body gesture as well as facial expression. Presently, most of the emotion recognition methods only use one of these sources. The emotion of human beings changes every second and using a single way to process the emotion recognition may not reflect the emotion correctly. This research is motivated by the desire to understand and evaluate human beings’ emotion from multiple ways such as speech and facial expressions. In this dissertation, multi-sensory emotion recognition has been exploited. The proposed framework can recognize emotion from speech, facial expression, and both of them. There are three important parts in the design of the system: the facial emotion recognizer, the speech emotion recognizer, and the information fusion. The information fusion part uses the results from the speech emotion recognition and facial emotion recognition. Then, a novel weighted method is used to integrate the results, and a final decision of the emotion is given after the fusion. The experiments show that with the weighted fusion methods, the accuracy can be improved to an average of 3.66% compared to fusion without adding weight. The improvement of the recognition rate can reach 18.27% and 5.66% compared to the speech emotion recognition and facial expression recognition, respectively. By improving the emotion recognition accuracy, the proposed multi-sensory emotion recognition system can help to improve the naturalness of human computer interaction

Aquila Digital Community (University of Southern Mississippi, USM)