Search CORE

1 research outputs found

Automatically Learning Speaker-independent Acoustic Subword Units

Author: Balakrishnan Varadarajan
Sanjeev Khudanpur
Publication venue
Publication date: 02/07/2010
Field of study

We investigate methods for unsupervised learning of sub-word acoustic units of a language directly from speech. We demonstrate that states of a hidden Markov model “grown ” using a novel modification of the maximum likelihood successive state splitting algorithm correspond very well with the phones of the language. In particular, the correspondence between the Viterbi state sequence for unseen speech from the training speaker and the phone transcription of the speech is over 85%, and generalizes to a large extent ( ∼ 63%) to speech from a different speaker. Furthermore, we are able to bridge more than half the gap between the speaker-dependent and cross-speaker correspondence of the automatically learned units to phones ( ∼ 75% accuracy) by unsupervised adaptation via MLLR. 1

CiteSeerX