Location of Repository

Annotating Speech Data for Pronunciation Variation

By Per-anders Jande

Abstract

This paper describes methods for annotating recorded speech with information hypothesised to be important for the pronunciation of words in discourse context. Annotation is structured into six hierarchically ordered tiers, each tier corresponding to a segmentally defined linguistic unit. Automatic methods are used to segment and annotate the respective annotation tiers. Decision tree models trained on annotation from elicited monologue showed a phoneme error rate of 9.91%, corresponding to a 55.25 % error reduction compared to using a canonical pronunciation representation from a lexicon for estimating the phonetic realisation

Year: 2013
OAI identifier: oai:CiteSeerX.psu:10.1.1.373.1561
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.ling.gu.se/konferen... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.