Search CORE

1,144 research outputs found

Wavelet-based techniques for speech recognition

Author: Omar Farooq (7204418)
Publication venue
Publication date: 01/01/2002
Field of study

In this thesis, new wavelet-based techniques have been developed for the extraction of features from speech signals for the purpose of automatic speech recognition (ASR). One of the advantages of the wavelet transform over the short time Fourier transform (STFT) is its capability to process non-stationary signals. Since speech signals are not strictly stationary the wavelet transform is a better choice for time-frequency transformation of these signals. In addition it has compactly supported basis functions, thereby reducing the amount of computation as opposed to STFT where an overlapping window is needed. [Continues.

Machine Analysis of Facial Expressions

Author: Bartlett M.S.
Pantic M.
Publication venue: I-Tech Education and Publishing
Publication date: 01/01/2007
Field of study

No abstract

CiteSeerX

University of Twente Research Information

Wavelet speech enhancement based on time-scale adaptation

Author: Bahoura
Bahoura
Chen
Cohen
Deller
Donoho
Donoho
Donoho
Donoho
Ephraim
Ephraim
Gulzow
Jabloun
Jean Rouat
Johnstone
Mahmoudi
Mahmoudi
Mallat
Mohammed Bahoura
Pan
Sarikaya
Seok
Sika
Vidakovic
Xu
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

Abstract : We propose a new speech enhancement method based on time and scale adaptation of wavelet thresholds. The time dependency is introduced by approximating the Teager Energy of the wavelet coefficients, while the scale dependency is introduced by extending the principle of level dependent threshold to Wavelet Packet Thresholding. This technique does not require an explicit estimation of the noise level or of the apriori knowledge of the SNR, as is usually needed in most of the popular enhancement methods. Performance of the proposed method is evaluated on speech recorded in real conditions (plane, sawmill, tank, subway, babble, car, exhibition hall, restaurant, street, airport, and train station) and artificially added noise. MELscale decomposition based on wavelet packets is also compared to the common wavelet packet scale. Comparison in terms of Signal-to-Noise Ratio (SNR) is reported for time adaptation and time-scale adaptation thresholding of the wavelet coefficients thresholding. Visual inspection of spectrograms and listening experiments are also used to support the results. Hidden Markov Models Speech recognition experiments are conducted on the AURORA–2 database and show that the proposed method improves the speech recognition rates for low SNRs

Savoirs UdeS

Temporal and three-dimensional classification of objects in infra-red images: an investigation of Bayesian and FST approaches

Author: Connelly Andrew Peter
Publication venue: The University of Edinburgh
Publication date: 01/01/1998
Field of study