3 research outputs found

    Progress in example based automatic speech recognition

    No full text
    In this paper we present a number of improvements that were recently made to the template based speech recognition system developed at ESAT. Combining these improvements resulted in a decrease in word error rate from 9.6% to 8.2% on the Nov92, 20k trigram, Wall Street Journal task. The improvements are along different lines. Apart from the time warping already applied within the DTW, it was found beneficial to apply additional length compensation on the template score. The single best score was replaced by a weighted k-NN average, while maintaining natural successor information as an ensemble cost. The local geometry of the acoustic space is now taken into account by assigning a diagonal covariance matrix to each input frame. Context sensitivity of short templates is increased by taking cross boundary scores into account for sorting the N best templates. Furthermore boundaries on the template segmentations may be relaxed. Finally context dependent word templates are now being used for short words. Several other variants that were not retained in the final system are discussed as well. © 2011 IEEE.Demuynck K., Seppi D., Van hamme H., Van Compernolle D., ''Progress in example based automatic speech recognition'', 36th international conference on acoustics, speech and signal processing - ICASSP’2011, pp. 4692-4695, May 22-27, 2011, Prague, Czech Republic.status: publishe

    Progress in example based automatic speech recognition

    No full text
    corecore