Location of Repository

On the generation of synthetic disfluent speech: Local prosodic modifications caused by the insertion of editing terms

By Jordi Adell, Antonio Bonafonte and David Escudero-mancebo

Abstract

Disfluent speech synthesis is necessary in some applications such as automatic film dubbing or spoken translation. This paper presents a model for the generation of synthetic disfluent speech based on inserting each element of a disfluency in a context where they can be considered fluent. Prosody obtained by the application of standard techniques on these new sentences is used for the synthesis of the disfluent sentence. In addition, local modifications are applied to segmental units adjacent to disfluency elements. Experiments evidence that duration follows this behavior, what supports the feasibility of the model. Index Terms: speech synthesis, disfluent speech, prosody, disfluencies. 1

Year: 2010
OAI identifier: oai:CiteSeerX.psu:10.1.1.161.392
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://gps-tsc.upc.es/veu/rese... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.