In this paper, we describe emotion recognition experiments carried out for spontaneous aﬀective speech with the aim to compare the added value of annotation of felt emotion versus annotation of perceived emotion. Using speech material available in the TNO-GAMING corpus (a corpus containing audiovisual recordings of people playing videogames), speech-based aﬀect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were developed in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The  experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried out. \u

Jong, Franciska M.G. de

Leeuwen, David A. van

Neerincx, Mark A.

Truong, Khiet P.

English

In this paper, we describe emotion recognition experiments car- ried out for spontaneous affective speech with the aim to com- pare the added value of annotation of felt emotion versus an- notation of perceived emotion. Using speech material avail- able in the TNO-GAMING corpus (a corpus containing audio- visual recordings of people playing videogames), speech-based affect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were devel- oped in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried ou

Truong, K.P.

Leeuwen, D.A. van

Neerincx, M.A.

Jong, F. M. G de

NARCIS 

Arousal and valence prediction in spontaneous emotional speech: felt versus perceived emotion

In this paper, we describe emotion recognition experiments carried out for spontaneous aﬀective speech with the aim to compare the added value of annotation of felt emotion versus annotation of perceived emotion. Using speech material available in the TNO-GAMING corpus (a corpus containing audiovisual recordings of people playing videogames), speech-based aﬀect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were developed in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried out

Truong, Khiet Phuong

van Leeuwen, David A.

de Jong, Franciska M.G.

Arousal and Valence Prediction in Spontaneous Emotional Speech: Felt versus Perceived Emotion

Contains fulltext :
                  91351.pdf (author's version ) (Open Access)In this paper, we describe emotion recognition experiments car- ried out for spontaneous affective speech with the aim to com- pare the added value of annotation of felt emotion versus an- notation of perceived emotion. Using speech material avail- able in the TNO-GAMING corpus (a corpus containing audio- visual recordings of people playing videogames), speech-based affect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were devel- oped in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried outInterspeec

Radboud Repository

University of Twente Research Information

Arousal and Valence Prediction in Spontaneous Emotional Speech:Felt versus Perceived Emotion

In this paper, we describe emotion recognition experiments carried out for spontaneous affective speech with the aim to compare the added value of annotation of felt emotion versus annotation of perceived emotion. Using speech material available in the TNO-GAMING corpus (a corpus containing audiovisual recordings of people playing videogames), speech-based affect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were developed in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried out. Index Terms: emotion, emotional speech database, emotion recognitio

Jong, F.M.G. de

Arousal and Valence prediction in spontaneous emotional speech: felt versus perceived emotion

A tutorial on support vector regression”, produced as part

Acoustic proﬁles in vocal emotion expression”,

Assessing Agreement of Observer- and Self-Annotations in Spontaneous Multimodal Emotion Data”,

Auto-annotation: an alternative method to label expressive corpora”,

Automatic Recognition of Spontaneous Emotions in Speech Using Acoustic and Lexical Features”,

Emotional speech recognition: Resources, features, and methods”,

Inducing and measuring emotion through a multiplayer ﬁrst-person shooter computer game”,

LIBSVM: a library for Support Vector Machines”, Online: http://www.csie.ntu.edu.tw/ cjlin/libsvm,

Praat: doing phonetics by computer”,

Reliability in Content Analysis”,

Support vector regression for automatic recognition of spontaneous emotions in speech”,

The expression and perception of emotions: Comparing Assessments of Self versus Others”,

The nature of statistical learning theory,

http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.153.6619

Arousal and Valence Prediction in Spontaneous Emotional Speech: Felt versus Perceived Emotion

Abstract

Similar works

Full text

Available Versions

NARCIS

NARCIS

Radboud Repository

University of Twente Research Information

NARCIS