Utilising Spontaneous Conversational Speech in HMM-Based Speech Synthesis

Andersson, Sebastian; Clark, Robert; Yamagishi, Junichi

Utilising Spontaneous Conversational Speech in HMM-Based Speech Synthesis

Authors: Sebastian Andersson
Robert Clark
Junichi Yamagishi
Publication date: 1 September 2010
Publisher

Abstract

Spontaneous conversational speech has many characteristics that are currently not well modelled in unit selection and HMM-based speech synthesis. But in order to build synthetic voices more suitable for interaction we need data that exhibits more conversational characteristics than the generally used read aloud sentences. In this paper we will show how carefully selected utterances from a spontaneous conversation was instrumental for building an HMM-based synthetic voices with more natural sounding conversational characteristics than a voice based on carefully read aloud sentences. We also investigated a style blending technique as a solution to the inherent problem of phonetic coverage in spontaneous speech data. But the lack of an appropriate representation of spontaneous speech phenomena probably contributed to results showing that we could not yet compete with the speech quality achieved for grammatical sentences

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Edinburgh Research Archive

oai:era.ed.ac.uk:1842/4540

Last time updated on 07/06/2021

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 08/02/2015