Search CORE

3 research outputs found

Progress in example based automatic speech recognition

Author: Demuynck Kris
Seppi Dino
Van Compernolle Dirk
Van hamme Hugo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

In this paper we present a number of improvements that were recently made to the template based speech recognition system developed at ESAT. Combining these improvements resulted in a decrease in word error rate from 9.6% to 8.2% on the Nov92, 20k trigram, Wall Street Journal task. The improvements are along different lines. Apart from the time warping already applied within the DTW, it was found beneficial to apply additional length compensation on the template score. The single best score was replaced by a weighted k-NN average, while maintaining natural successor information as an ensemble cost. The local geometry of the acoustic space is now taken into account by assigning a diagonal covariance matrix to each input frame. Context sensitivity of short templates is increased by taking cross boundary scores into account for sorting the N best templates. Furthermore boundaries on the template segmentations may be relaxed. Finally context dependent word templates are now being used for short words. Several other variants that were not retained in the final system are discussed as well. © 2011 IEEE.Demuynck K., Seppi D., Van hamme H., Van Compernolle D., ''Progress in example based automatic speech recognition'', 36th international conference on acoustics, speech and signal processing - ICASSP’2011, pp. 4692-4695, May 22-27, 2011, Prague, Czech Republic.status: publishe

Lirias

Progress in example based automatic speech recognition

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref