Speech Feature Smoothing for Robust ASR

Bilmes, Jeff; Chen, Chia-Ping; Ellis, Daniel P. W.

Speech Feature Smoothing for Robust ASR

Authors: Jeff Bilmes
Chia-Ping Chen
Daniel P. W. Ellis
Publication date: 1 January 2005
Publisher: 'Columbia University Libraries/Information Services'
Doi

Abstract

In this paper, we evaluate smoothing within the context of the MVA (mean subtraction, variance normalization, and ARMA filtering) post-processing scheme for noise-robust automatic speech recognition. MVA has shown great success in the past on the Aurora 2.0 and 3.0 corpora even though it is computationally inexpensive. Herein, MVA is applied to many acoustic feature extraction methods, and is evaluated using Aurora 2.0. We evaluate MVA post-processing on MFCCs, LPCs, PLPs, RASTA, Tandem, Modulation-filtered Spectrogram, and Modulation Cross- CorreloGram features. We conclude that while effectiveness does depend on the extraction method, the majority of features benefit significantly from MVA, and the smoothing ARMA filter is an important component. It appears that the effectiveness of normalization and smoothing depends on the domain in which it is applied, being most fruitfully applied just before being scored by a probabilistic model. Moreover, since it is both effective and simple, our ARMA filter should be considered a candidate method in most noise-robust speech recognition tasks

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Columbia University Academic Commons

oai:academiccommons.columbia.e...

Last time updated on 02/10/2018