Search CORE

3,785 research outputs found

Emotion Recognition from Acted and Spontaneous Speech

Author: Atassi Hicham
Publication venue: Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií
Publication date: 01/01/2014
Field of study

Dizertační práce se zabývá rozpoznáním emočního stavu mluvčích z řečového signálu. Práce je rozdělena do dvou hlavních častí, první část popisuju navržené metody pro rozpoznání emočního stavu z hraných databází. V rámci této části jsou představeny výsledky rozpoznání použitím dvou různých databází s různými jazyky. Hlavními přínosy této části je detailní analýza rozsáhlé škály různých příznaků získaných z řečového signálu, návrh nových klasifikačních architektur jako je například „emoční párování“ a návrh nové metody pro mapování diskrétních emočních stavů do dvou dimenzionálního prostoru. Druhá část se zabývá rozpoznáním emočních stavů z databáze spontánní řeči, která byla získána ze záznamů hovorů z reálných call center. Poznatky z analýzy a návrhu metod rozpoznání z hrané řeči byly využity pro návrh nového systému pro rozpoznání sedmi spontánních emočních stavů. Jádrem navrženého přístupu je komplexní klasifikační architektura založena na fúzi různých systémů. Práce se dále zabývá vlivem emočního stavu mluvčího na úspěšnosti rozpoznání pohlaví a návrhem systému pro automatickou detekci úspěšných hovorů v call centrech na základě analýzy parametrů dialogu mezi účastníky telefonních hovorů.Doctoral thesis deals with emotion recognition from speech signals. The thesis is divided into two main parts; the first part describes proposed approaches for emotion recognition using two different multilingual databases of acted emotional speech. The main contributions of this part are detailed analysis of a big set of acoustic features, new classification schemes for vocal emotion recognition such as “emotion coupling” and new method for mapping discrete emotions into two-dimensional space. The second part of this thesis is devoted to emotion recognition using multilingual databases of spontaneous emotional speech, which is based on telephone records obtained from real call centers. The knowledge gained from experiments with emotion recognition from acted speech was exploited to design a new approach for classifying seven emotional states. The core of the proposed approach is a complex classification architecture based on the fusion of different systems. The thesis also examines the influence of speaker’s emotional state on gender recognition performance and proposes system for automatic identification of successful phone calls in call center by means of dialogue features.

Digital library of Brno University of Technology

National Repository of Grey Literature

An exploration of sarcasm detection in children with Attention Deficit Hyperactivity Disorder

Author: Chadwick Eleanor
Edwards Rebecca
Gutierrez Roberto
Ludlow Amanda
Morey Alice
Publication venue: 'Elsevier BV'
Publication date: 31/10/2019
Field of study

This document is the Accepted Manuscript version of the following article: Amanda K. Ludlow, Eleanor Chadwick, Alice Morey, Rebecca Edwards, and Roberto Gutierrez, ‘An exploration of sarcasm detection in children with Attention Deficit Hyperactivity Disorder’, Journal of Communication Disorders, Vol. 70: 25-34, November 2017. Under embargo. Embargo end date: 31 October 2019. The Version of Record is available at doi: https://doi.org/10.1016/j.jcomdis.2017.10.003.The present research explored the ability of children with ADHD to distinguish between sarcasm and sincerity. Twenty-two children with a clinical diagnosis of ADHD were compared with 22 age and verbal IQ matched typically developing children using the Social Inference–Minimal Test from The Awareness of Social Inference Test (TASIT, McDonald, Flanagan, & Rollins, 2002). This test assesses an individual’s ability to interpret naturalistic social interactions containing sincerity, simple sarcasm and paradoxical sarcasm. Children with ADHD demonstrated specific deficits in comprehending paradoxical sarcasm and they performed significantly less accurately than the typically developing children. While there were no significant differences between the children with ADHD and the typically developing children in their ability to comprehend sarcasm based on the speaker’s intentions and beliefs, the children with ADHD were found to be significantly less accurate when basing their decision on the feelings of the speaker, but also on what the speaker had said. Results are discussed in light of difficulties in their understanding of complex cues of social interactions, and non-literal language being symptomatic of children with a clinical diagnosis of ADHD. The importance of pragmatic language skills in their ability to detect social and emotional information is highlighted.Peer reviewe

University of Hertfordshire Research Archive

Machine Understanding of Human Behavior

Author: Huang Thomas
Nijholt Anton
Pantic Maja
Pentland Alex
Publication venue: University of Twente, Centre for Telematics and Information Technology (CTIT)
Publication date: 01/01/2007
Field of study

A widely accepted prediction is that computing will move to the background, weaving itself into the fabric of our everyday living spaces and projecting the human user into the foreground. If this prediction is to come true, then next generation computing, which we will call human computing, should be about anticipatory user interfaces that should be human-centered, built for humans based on human models. They should transcend the traditional keyboard and mouse to include natural, human-like interactive functions including understanding and emulating certain human behaviors such as affective and social signaling. This article discusses a number of components of human behavior, how they might be integrated into computers, and how far we are from realizing the front end of human computing, that is, how far are we from enabling computers to understand human behavior

University of Twente Research Information

Influence of Voice Intonation on Understanding Irony by Polish-Speaking Preschool Children

Author: Ackerman
Ackerman
Anolli
Anolli
Anolli
Anolli
Banasik
Banasik
Banasik
Banasik
Barbe
Barbe
Capelli
Capelli
Cutler
Cutler
Dews
Dews
Filippova
Filippova
Gibbs
Gibbs
Glenwright
Glenwright
Happé
Happé
Leggitt
Leggitt
Maria Katarzyna Zajączkowska
McDonald
McDonald
Rattray
Rattray
Roberts
Roberts
Roberts
Roberts
Sullivan
Sullivan
Winner
Winner
Winner
Winner
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/12/2016
Field of study

The main aim of the presented study was to investigate the influence of voice intonation on the comprehension of ironic utterances in 4- to 6-year-old Polish-speaking children. 83 preschool children were tested with the Irony Comprehension Task (Banasik & Bokus, 2012). In the Irony Comprehension Task, children are presented with stories in which ironic utterances were prerecorded and read by professional speakers using an ironic intonation. Half of the subjects performed the regular Irony Comprehension Task while the other half were given a modified version of the Irony Comprehension Task (ironic content was uttered using a non-ironic intonation). Results indicate that children from the ironic intonation group scored higher on the Irony Comprehension Task than children who heard ironic statements uttered using a neutral voice. Ironic voice intonation appeared to be a helpful cue to irony comprehension

Crossref

Directory of Open Access Journals

Kent Academic Repository

What's in a voice? Prosody as a test case for the Theory of Mind account of autism

Author: Chevallier C
Happe F
Noveck I
Wilson D
Publication venue: PERGAMON-ELSEVIER SCIENCE LTD
Publication date: 01/02/2011
Field of study

The human voice conveys a variety of information about people's feelings, emotions and mental states. Some of this information relies on sophisticated Theory of Mind (ToM) skills, whilst others are simpler and do not require ToM. This variety provides an interesting test case for the ToM account of autism, which would predict greater impairment as ToM requirements increase. In this paper, we draw on psychological and pragmatic theories to classify vocal cues according to the amount of mindreading required to identify them. Children with a high functioning Autism Spectrum Disorder and matched controls were tested in three experiments where the speakers' state had to be extracted from their vocalizations. Although our results confirm that people with autism have subtle difficulties dealing with vocal cues, they show a pattern of performance that is inconsistent with the view that atypical recognition of vocal cues is caused by impaired ToM

HAL-ENS-LYON

UCL Discovery

King's Research Portal

Computational rule-based model for Irony Detection in Italian Tweets

Author: Frenda Simona
Publication venue
Publication date: 01/01/2016
Field of study

Institutional Research Information System University of Turin

Speech with pauses sounds deceptive to listeners with and without hearing impairment

Author: Belyk Michel
McGettigan Carolyn
Patel Bindiya
Zhang Ziyun
Publication venue
Publication date: 12/06/2023
Field of study

Edge Hill University Research Information Repository

Speech with pauses sounds deceptive to listeners with and without hearing impairment

Author: Belyk Michel
McGettigan Carolyn
Patel Bindiya
Zhang Ziyun
Publication venue: 'American Speech Language Hearing Association'
Publication date: 30/10/2023
Field of study

Purpose: Communication is as much persuasion as it is the transfer of information. This creates a tension between the interests of the speaker and those of the listener as dishonest speakers naturally attempt to hide deceptive speech, and listeners are faced with the challenge of sorting truths from lies. Hearing impaired listeners in particular may have differing levels of access to the acoustical cues that give away deceptive speech. A greater tendency towards speech pauses has been hypothesised to result from the cognitive demands of lying convincingly. Higher vocal pitch has also been hypothesised to mark the increased anxiety of a dishonest speaker.// Method: listeners with or without hearing impairments heard short utterances from natural conversations some of which had been digitally manipulated to contain either increased pausing or raised vocal pitch. Listeners were asked to guess whether each statement was a lie in a two alternative forced choice task. Participants were also asked explicitly which cues they believed had influenced their decisions.// Results: Statements were more likely to be perceived as a lie when they contained pauses, but not when vocal pitch was raised. This pattern held regardless of hearing ability. In contrast, both groups of listeners self-reported using vocal pitch cues to identify deceptive statements, though at lower rates than pauses.// Conclusions: Listeners may have only partial awareness of the cues that influence their impression of dishonesty. Hearing impaired listeners may place greater weight on acoustical cues according to the differing degrees of access provided by hearing aids./

UCL Discovery