Mage - Reactive articulatory feature control of HMM-based parametric speech synthesis

Astrinaki, Maria; Dutoit, Thierry; King, Simon; Ling, Zhen-Hua; Moinet, Alexis; Richmond, Korin; Yamagishi, Junichi

research

Mage - Reactive articulatory feature control of HMM-based parametric speech synthesis

Authors: Maria Astrinaki
Thierry Dutoit
Simon King
Zhen-Hua Ling
Alexis Moinet
Korin Richmond
Junichi Yamagishi
Publication date: 1 January 2013
Publisher

Abstract

In this paper, we present the integration of articulatory control into MAGE, a framework for realtime and interactive (reactive) parametric speech synthesis using hidden Markov models (HMMs). MAGE is based on the speech synthesis engine from HTS and uses acoustic features (spectrum and f0) to model and synthesize speech. In this work, we replace the standard acoustic models with models combining acoustic and articulatory features, such as tongue, lips and jaw positions. We then use feature-space-switched articulatory-to-acoustic regression matrices to enable us to control the spectral acoustic features by manipulating the articulatory features. Combining this synthesis model with MAGE allows us to interactively and intuitively modify phones synthesized in real time, for example transforming one phone into another, by controlling the configuration of the articulators in a visual display. Index Terms: speech synthesis, reactive, articulators 1

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

ORBi UMONS

oai:orbi.umons.ac.be:20.500.12...

Last time updated on 28/10/2024

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 08/02/2015