A Stochastic Model Of Intonation For French Text-To-Speech Synthesis

Benoît Lagrue; Fabienne Courtois; Jean Veronis; Philippe Di Cristo

A Stochastic Model Of Intonation For French Text-To-Speech Synthesis

Authors: Benoît Lagrue
Fabienne Courtois
Jean Veronis
Philippe Di Cristo
Publication date
Publisher

Abstract

This paper presents a stochastic model of French intonation contours for use in text-to-speech synthesis. The model has two modules, a linguistic module that generates abstract prosodic labels from text, and a phonetic module that generates an F 0 curve from the abstract prosodic labels. This model differs from previous work in the abstract prosodic labels used, which can be automatically derived from the training corpus. This feature makes it possible to use large corpora or several corpora of different speech styles, in addition to making it easy to adapt to new languages. The present paper focuses on the linguistic module, which does not require full syntactic analysis of the text but simply relies on a part-of-speech tagging technique. The results were validated by means of a perception test which showed that listeners did not perceive a significant difference in quality between the sentences synthesized with the original F 0 curve (from a recording), and those synthesized with the..

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.56.47...

Last time updated on 22/10/2014