Search CORE

1,559 research outputs found

Recognizing emotion from Turkish speech using acoustic features

Author: Caglar Oflazoglu
Serdar Yildirim
Publication venue: Springer Nature
Publication date: 01/01/2013
Field of study

Springer - Publisher Connector

Recognizing emotion from Turkish speech using acoustic features

Author: A Batliner
A Batliner
AJ Smola
B Schuller
B Schuller
B Schuller
B Schuller
B Schuller
BS Schuller
C Busso
C Clavel
C Clavel
C Oflazoglu
Caglar Oflazoglu
CC Chang
CC Lee
CM Lee
E Douglas-Cowie
E Douglas-Cowie
E Douglas-Cowie
EM Albornoz
F Burkhardt
F Eyben
G McKeown
IS Engberg
J Ang
J Fleiss
JHL Hansen
KR Scherer
M Bradley
M Grimm
M Grimm
M Grimm
M Hall
M Hall
M Liberman
M Shami
P Ekman
R Bouckaert
S Arunachalam
S Steidl
S Yildirim
Serdar Yildirim
T Banziger
T Polzehl
TL Nwe
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Machine Understanding of Human Behavior

Author: Huang Thomas
Nijholt Anton
Pantic Maja
Pentland Alex
Publication venue: University of Twente, Centre for Telematics and Information Technology (CTIT)
Publication date: 01/01/2007
Field of study

A widely accepted prediction is that computing will move to the background, weaving itself into the fabric of our everyday living spaces and projecting the human user into the foreground. If this prediction is to come true, then next generation computing, which we will call human computing, should be about anticipatory user interfaces that should be human-centered, built for humans based on human models. They should transcend the traditional keyboard and mouse to include natural, human-like interactive functions including understanding and emulating certain human behaviors such as affective and social signaling. This article discusses a number of components of human behavior, how they might be integrated into computers, and how far we are from realizing the front end of human computing, that is, how far are we from enabling computers to understand human behavior

University of Twente Research Information

Emotion Estimation in Speech Using a 3D Emotion Space Concept

Author: Kristian Kroschel
Michael Grimm
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Continuous Analysis of Affect from Voice and Face

Author: Gunes Hatice
Nicolaou Mihalis A.
Pantic Maja
Publication venue: Springer
Publication date: 01/01/2011
Field of study

Human affective behavior is multimodal, continuous and complex. Despite major advances within the affective computing research field, modeling, analyzing, interpreting and responding to human affective behavior still remains a challenge for automated systems as affect and emotions are complex constructs, with fuzzy boundaries and with substantial individual differences in expression and experience [7]. Therefore, affective and behavioral computing researchers have recently invested increased effort in exploring how to best model, analyze and interpret the subtlety, complexity and continuity (represented along a continuum e.g., from −1 to +1) of affective behavior in terms of latent dimensions (e.g., arousal, power and valence) and appraisals, rather than in terms of a small number of discrete emotion categories (e.g., happiness and sadness). This chapter aims to (i) give a brief overview of the existing efforts and the major accomplishments in modeling and analysis of emotional expressions in dimensional and continuous space while focusing on open issues and new challenges in the field, and (ii) introduce a representative approach for multimodal continuous analysis of affect from voice and face, and provide experimental results using the audiovisual Sensitive Artificial Listener (SAL) Database of natural interactions. The chapter concludes by posing a number of questions that highlight the significant issues in the field, and by extracting potential answers to these questions from the relevant literature. The chapter is organized as follows. Section 10.2 describes theories of emotion, Sect. 10.3 provides details on the affect dimensions employed in the literature as well as how emotions are perceived from visual, audio and physiological modalities. Section 10.4 summarizes how current technology has been developed, in terms of data acquisition and annotation, and automatic analysis of affect in continuous space by bringing forth a number of issues that need to be taken into account when applying a dimensional approach to emotion recognition, namely, determining the duration of emotions for automatic analysis, modeling the intensity of emotions, determining the baseline, dealing with high inter-subject expression variation, defining optimal strategies for fusion of multiple cues and modalities, and identifying appropriate machine learning techniques and evaluation measures. Section 10.5 presents our representative system that fuses vocal and facial expression cues for dimensional and continuous prediction of emotions in valence and arousal space by employing the bidirectional Long Short-Term Memory neural networks (BLSTM-NN), and introduces an output-associative fusion framework that incorporates correlations between the emotion dimensions to further improve continuous affect prediction. Section 10.6 concludes the chapter

University of Twente Research Information

Using Crowdsourcing for Labelling Emotional Speech Assets

Author: Cullen Charlie
Delany Sarah Jane
Tarasov Alexey
Publication venue: Technological University Dublin
Publication date: 01/01/2010
Field of study

The success of supervised learning approaches for the classification of emotion in speech depends highly on the quality of the training data. The manual annotation of emotion speech assets is the primary way of gathering training data for emotional speech recognition. This position paper proposes the use of crowdsourcing for the rating of emotion speech assets. Recent developments in learning from crowdsourcing offer opportunities to determine accurate ratings for assets which have been annotated by large numbers of non-expert individuals. The challenges involved include identifying good annotators, determining consensus ratings and learning the bias of annotators

Arrow@TUDublin