2 research outputs found
Robust topic inference for latent semantic language model adaptation
We perform topic-based, unsupervised language model adap-tation under an N-best rescoring framework by using previous-pass system hypotheses to infer a topic mixture which is used to select topic-dependent LMs for interpolation with a topic-independent LM. Our primary focus is on techniques for im-proving the robustness of topic inference for a given utterance with respect to recognition errors, including the use of ASR confidence and contextual information from surrounding ut-terances. We describe a novel application of metadata-based pseudo-story segmentation to language model adaptation, and present good improvements to character error rate on multi-genre GALE Project data in Mandarin Chinese. Index Terms β language model adaptation, topic model-ing, unsupervised adaptation, speech recognition, story seg-mentation 1