HMM-based synthesis of child speech

Berkling, Kay; King, Simon; Watts, Oliver; Yamagishi, Junichi

HMM-based synthesis of child speech

Authors: Kay Berkling
Simon King
Oliver Watts
Junichi Yamagishi
Publication date: 1 January 2008
Publisher

Abstract

The synthesis of child speech presents challenges both in the collection of data and in the building of a synthesiser from that data. Because only limited data can be collected, and the domain of that data is constrained, it is difficult to obtain the type of phonetically-balanced corpus usually used in speech synthesis. As a consequence, building a synthesiser from this data is difficult. Concatenative synthesisers are not robust to corpora with many missing units (as is likely when the corpus content is not carefully designed), so we chose to build a statistical parametric synthesiser using the HMM-based system HTS. This technique has previously been shown to perform well for limited amounts of data, and for data collected under imperfect conditions. We compared 6 different configurations of the synthesiser, using both speaker-dependent and speaker-adaptive modelling techniques, and using varying amounts of data. The output from these systems was evaluated alongside natural and vocoded speech, in a Blizzard-style listening test

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 08/02/2015

Edinburgh Research Archive

oai:era.ed.ac.uk:1842/3817

Last time updated on 07/06/2021