Search CORE

156 research outputs found

Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement

Author
Publication venue: Springer
Publication date: 24/05/2009
Field of study

A Bayesian Alternative to Gain Adaptation in Autoregressive Hidden Markov Models

Author: Barber David
Mesot Bertrand
Publication venue: IDIAP
Publication date: 11/02/2010
Field of study

Models dealing directly with the raw acoustic speech signal are an alternative to conventional feature-based HMMs. A popular way to model the raw speech signal is by means of an autoregressive (AR) process. Being too simple to cope with the nonlinearity of the speech signal, the AR process is generally embedded into a more elaborate model, such as the switching autoregressive HMM (SAR-HMM). A fundamental issue faced by models based on AR processes is that they are very sensitive to variations in the amplitude of the signal. One way to overcome this limitation is to use Gain Adaptation to adjust the amplitude by maximising the likelihood of the observed signal. However, adjusting model parameters by maximising test likelihoods is fundamentally outside the framework of standard statistical approaches to machine learning, since this may lead to overfitting when the models are sufficiently flexible. We propose a statistically principled alternative based on an exact Bayesian procedure in which priors are explicitly defined on the parameters of the AR process. Explicitly, we present the Bayesian SAR-HMM and compare the performance of this model against the standard Gain-Adapted SAR-HMM on a single digit recognition task, showing the effectiveness of the approach and suggesting thereby a principled and straightforward solution to the issue of Gain Adaptation

Infoscience - École polytechnique fédérale de Lausanne

Linear and nonlinear adaptive filtering and their applications to speech intelligibility enhancement

Author: Gu Y.H.
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/1992
Field of study

Pure OAI Repository

Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise

Author: Betts Dave
Brookes Mike
Dmour Mohammad A.
Doire Clement Samuel Joseph
Hicks Christopher M.
Jensen Soren Holdt
Naylor Patrick A.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2017
Field of study

VBN

Adaptive Hidden Markov Noise Modelling for Speech Enhancement

Author: Bai Jiongjun
Publication venue: Electrical and Electronic Engineering, Imperial College London
Publication date: 01/05/2013
Field of study

A robust and reliable noise estimation algorithm is required in many speech enhancement systems. The aim of this thesis is to propose and evaluate a robust noise estimation algorithm for highly non-stationary noisy environments. In this work, we model the non-stationary noise using a set of discrete states with each state representing a distinct noise power spectrum. In this approach, the state sequence over time is conveniently represented by a Hidden Markov Model (HMM). In this thesis, we first present an online HMM re-estimation framework that models time-varying noise using a Hidden Markov Model and tracks changes in noise characteristics by a sequential model update procedure that tracks the noise characteristics during the absence of speech. In addition the algorithm will when necessary create new model states to represent novel noise spectra and will merge existing states that have similar characteristics. We then extend our work in robust noise estimation during speech activity by incorporating a speech model into our existing noise model. The noise characteristics within each state are updated based on a speech presence probability which is derived from a modified Minima controlled recursive averaging method. We have demonstrated the effectiveness of our noise HMM in tracking both stationary and highly non-stationary noise, and shown that it gives improved performance over other conventional noise estimation methods when it is incorporated into a standard speech enhancement algorithm

Spiral - Imperial College Digital Repository

Speech enhancement using voice source models

Author: Yasmin Anisa
Publication venue: 'University of Waterloo'
Publication date: 01/01/1999
Field of study

University of Waterloo's Institutional Repository

Relaxed statistical model for speech enhancement and a priori SNR estimation

Author: I. Cohen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref