Search CORE

1 research outputs found

Recent progress on the discriminative region-dependent transform for speech feature extraction

Author: Bing Zhang
Richard Schwartz
Spyros Matsoukas
Publication venue
Publication date
Field of study

The region-dependent transform (RDT) is a feature extraction method for speech recognition that employs the Minimum Phoneme Error (MPE) criterion to optimize a set of feature transforms, each concentrating on a region of the acoustic space. Previous results have shown that RDT gives significant recognitionerror reduction in a large vocabulary speaker-independent (SI) system. As a follow-up investigation, this paper presents the recent progress of applying RDT in speaker-adaptive training (SAT). Similar to previous SI results, the integration of RDT with SAT yields 7 % relative improvement in word error rate (WER). Also, theoretical comparisons are made between RDT and other discriminative feature extraction methods, including the improved version of the feature-space MPE (fMPE) that uses the “mean-offsets ” as additional input features. Index Terms: speech recognition, discriminative training, feature extraction, region-dependent transform

CiteSeerX