Handling disfluencies in spontaneous language models

Abstract

In automatic speech recognition, a stochastic language model (LM) predicts the probability of the next word on the basis of previously recognized words. For the recognition of dictated speech this method works reasonably well since sentences are typically well-formed and reliable estimation of the probabilities is possible on the basis of large amounts of written text material. However, for spontaneous speech the situation is quite different: disfluencies distort the normal flow of sentences and written transcripts of spontaneous speech are too scarce to train good stochastic LMs. Both factors contribute to the poor performance of automatic speech recognizers on spontaneous input. In this paper we investigate how one specific approach to disfluencies in spontaneous language modeling influences recognition performance.Duchateau J., Laureys T., Demuynck K., Wambacq P., ''Handling disfluencies in spontaneous language models'', Computational linguistics in The Netherlands 2002 - selected papers from the thirteenth CLIN meeting. Series : language and computers - studies in practical linguistics, vol. 47, pp. 39-50, Gaustad T. ed., 2003, Editions Rodopi B.V., Amsterdam/New York (13th computational linguistics in The Netherlands meeting - CLIN2002, November 29, 2002, Groningen, The Netherlands).status: publishe

    Similar works

    Full text

    thumbnail-image

    Available Versions