Search CORE

2 research outputs found

Modelling Pronunciation Variation Using Multi-Path HMMs for Syllables

Author: Bosch L.F.M. ten
Boves L.W.J.
Hämäläinen K.A.
Publication venue: Honolulu : New York, Institute of Electrical and Electronics Engineers
Publication date: 01/01/2007
Field of study

Contains fulltext : 44459.pdf (publisher's version ) (Closed access

Radboud Repository

MODELLING PRONUNCIATION VARIATION USING MULTI-PATH HMMS FOR SYLLABLES

Author: Annika Hämäläinen
Lou Boves
Louis Ten Bosch
Publication venue
Publication date
Field of study

Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multipath model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their reestimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome. Index Terms — Speech recognition, hidden Markov models 1

CiteSeerX

Crossref