Handling disfluencies in spontaneous language models

Demuynck, Kris; Duchateau, Jacques; Laureys, Tom; Wambacq, Patrick

Handling disfluencies in spontaneous language models

Authors: Kris Demuynck
Jacques Duchateau
Tom Laureys
Patrick Wambacq
Publication date: 1 January 2003
Publisher: Editions Rodopi B.V. Amsterdam/New York

Abstract

In automatic speech recognition, a stochastic language model (LM) predicts the probability of the next word on the basis of previously recognized words. For the recognition of dictated speech this method works reasonably well since sentences are typically well-formed and reliable estimation of the probabilities is possible on the basis of large amounts of written text material. However, for spontaneous speech the situation is quite different: disfluencies distort the normal flow of sentences and written transcripts of spontaneous speech are too scarce to train good stochastic LMs. Both factors contribute to the poor performance of automatic speech recognizers on spontaneous input. In this paper we investigate how one specific approach to disfluencies in spontaneous language modeling influences recognition performance.Duchateau J., Laureys T., Demuynck K., Wambacq P., ''Handling disfluencies in spontaneous language models'', Computational linguistics in The Netherlands 2002 - selected papers from the thirteenth CLIN meeting. Series : language and computers - studies in practical linguistics, vol. 47, pp. 39-50, Gaustad T. ed., 2003, Editions Rodopi B.V., Amsterdam/New York (13th computational linguistics in The Netherlands meeting - CLIN2002, November 29, 2002, Groningen, The Netherlands).status: publishe

Similar works

Full text

Available Versions

Lirias

oai:lirias2repo.kuleuven.be:12...

Last time updated on 10/12/2019