Student-teacher training with diverse decision tree ensembles

Gales, MJF; Wong, JHM

Student-teacher training with diverse decision tree ensembles

Authors: MJF Gales
JHM Wong
Publication date: 20 August 2017
Publisher: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Doi

Abstract

Student-teacher training allows a large teacher model or ensemble of teachers to be compressed into a single student model, for the purpose of efficient decoding. However, current approaches in automatic speech recognition assume that the state clusters, often defined by Phonetic Decision Trees (PDT), are the same across all models. This limits the diversity that can be captured within the ensemble, and also the flexibility when selecting the complexity of the student model output. This paper examines an extension to student-teacher training that allows for the possibility of having different PDTs between teachers, and also for the student to have a different PDT from the teacher. The proposal is to train the student to emulate the logical context dependent state posteriors of the teacher, instead of the frame posteriors. This leads to a method of mapping frame posteriors from one PDT to another. This approach is evaluated on three speech recognition tasks: the Tok Pisin and Javanese low resource conversational telephone speech tasks from the IARPA Babel programme, and the HUB4 English broadcast news task

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Apollo (Cambridge)

oai:www.repository.cam.ac.uk:1...

Last time updated on 04/09/2019

Crossref

info:doi/10.21437%2Finterspeec...

Last time updated on 06/08/2021