How to improve TTS systems for emotional expressivity

Ferreira Rebordao, Antonio; Hirose, Keikichi; Minematsu, Nobuaki; Shaikh, Mostafa Al Masum

research

How to improve TTS systems for emotional expressivity

Authors: Antonio Ferreira Rebordao
Keikichi Hirose
Nobuaki Minematsu
Mostafa Al Masum Shaikh
Publication date: 1 January 2009
Publisher: International Speech Communication Association (ISCA)

Abstract

Several experiments have been carried out that revealed weaknesses of the current Text-To-Speech (TTS) systems in their emotional expressivity. Although some TTS systems allow XML-based representations of prosodic and/or phonetic variables, few publications considered, as a pre-processing stage, the use of intelligent text processing to detect affective information that can be used to tailor the parameters needed for emotional expressivity. This paper describes a technique for an automatic prosodic parameterization based on affective clues. This technique recognizes the affective information conveyed in a text and, accordingly to its emotional connotation, assigns appropriate pitch accents and other prosodic parameters by XML-tagging. This pre-processing assists the TTS system to generate synthesized speech that contains emotional clues. The experimental results are encouraging and suggest the possibility of suitable emotional expressivity in speech synthesis

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Ghent University Academic Bibliography

oai:archive.ugent.be:817056

Last time updated on 12/11/2016