Search CORE

2 research outputs found

Component-based discriminative classification for hidden Markov models

Author: Amari
Baum
Baum
Bellman
Bengio
Bennett
Bhattacharyya
Bicego
Bicego
Chen
Cortes
David M.J. Tax
Duda
Durbin
Elżbieta Pe¸kalska
Everitt
Fine
Fukanaga
Ghahramani
Gough
Hastie
He
Jelinek
Kaiser
Krogh
Kudo
Lai
Layton
Lodhi
Manuele Bicego
Molina
Mollineda
Rabiner
Robert P.W. Duin
Schölkopf
Ting
Tsuda
Vapnik
Wolpert
Woodland
Publication venue: 'Elsevier BV'
Publication date
Field of study

Training Data Selection for Discriminative Training of Acoustic Models

Author: Fang-Hui Chu
朱芳輝
Publication venue
Publication date
Field of study

[[abstract]]This thesis aims to investigate various training data selection approaches for improving the minimum phone error (MPE) based discriminative training of acoustic models for Mandarin large vocabulary continuous speech recognition (LVCSR). First, inspired by the concept of the AdaBoost algorithm that lays more emphasis on the training samples misclassified by the already-trained classifier, the accumulated statistics of the training utterances prone to be incorrectly recognized are properly adjusted during the MPE training. Meanwhile, multiple speech recognition systems with their acoustic models respectively trained using various training data selection criteria are combined together at different recognition stages for improving the recognition accuracy. On the other hand, a novel data selection approach conducted on the expected phone accuracy domain of the word lattices of training utterances is explored as well. It is able to select more discriminative training instances, in terms of either utterances or phone arcs, for better model discrimination. Moreover, this approach is further integrated with a previously proposed frame-level data selection approach, namely the normalized entropy based frame-level data selection, and a frame-level phone accuracy function for improving the MPE training. All experiments were performed on the Mandarin broadcast news corpus (MATBN), and the associated results initially demonstrated the feasibility of our proposed training data selection approaches.

National Taiwan Normal University Repository