Search CORE

1 research outputs found

Time and Memory Efficient Viterbi Decoding for LVCSR Using a Precompiled Search Network

Author: Daniel Willegg
Erik Mcdermott
Shigeru Katagiri
Yasuhiro Minami
Publication venue
Publication date: 21/11/2007
Field of study

In this paper, we present our recently developed timesynchronous speech recognition decoder, which adopts the idea of representing the search space of Large Vocabulary Continuous Speech Recognition (LVCSR) in a single precompiled network. In particular, we outline our approaches for time and memory efficient Viterbi decoding in this scenario. This includes reducing the fast memory needs by keeping the search network on disk and only loading the required parts on demand. Evaluations are carried out on a difficult Japanese LVCSR task which involves a back-off trigram language model and full cross-word dependent triphone acoustic models. Time and memory efficiency enables the real-time Viterbi decoding of entire lecture speeches in a single time-synchronous pass with a search error of less than 1%

CiteSeerX