Search CORE

57 research outputs found

Recommended from our members

Efficient alternatives to the Ephraim and Malah suppression rule for audio signal enhancement

Author: Godsill SJ
Wolfe PJ
Publication venue: EURASIP Journal on Applied Signal Processing
Publication date: 12/12/2011
Field of study

Audio signal enhancement often involves the application of a time-varying filter, or suppression rule, to the frequency-domain transform of a corrupted signal. Here we address suppression rules derived under a Gaussian model and interpret them as spectral estimators in a Bayesian statistical framework. With regard to the optimal spectral amplitude estimator of Ephraim and Malah, we show that under the same modelling assumptions, alternative methods of Bayesian estimation lead to much simpler suppression rules exhibiting similarly effective behaviour. We derive three of such rules and demonstrate that, in addition to permitting a more straightforward implementation, they yield a more intuitive interpretation of the Ephraim and Malah solution

Apollo (Cambridge)

Multi-channel Feature Enhancement for Robust Speech Recognition

Author: Emanuele Principi
Francesco Piazza
Rudy Rotili
Simone Cifani
Stefano Squartini
Publication venue: 'IntechOpen'
Publication date: 01/01/2011
Field of study

IntechOpen

IRIS UniversitÃ Politecnica delle Marche

Model-based analysis of noisy musical recordings with application to audio restoration

Author: Esquef Paulo A. A.
Publication venue: Teknillinen korkeakoulu
Publication date: 02/04/2004
Field of study

This thesis proposes digital signal processing algorithms for noise reduction and enhancement of audio signals. Approximately half of the work concerns signal modeling techniques for suppression of localized disturbances in audio signals, such as impulsive noise and low-frequency pulses. In this regard, novel algorithms and modifications to previous propositions are introduced with the aim of achieving a better balance between computational complexity and qualitative performance, in comparison with other schemes presented in the literature. The main contributions related to this set of articles are: an efficient algorithm for suppression of low-frequency pulses in audio signals; a scheme for impulsive noise detection that uses frequency-warped linear prediction; and two methods for reconstruction of audio signals within long gaps of missing samples. The remaining part of the work discusses applications of sound source modeling (SSM) techniques to audio restoration. It comprises application examples, such as a method for bandwidth extension of guitar tones, and discusses the challenge of model calibration based on noisy recorded sources. Regarding this matter, a frequency-selective spectral analysis technique called frequency-zooming ARMA (FZ-ARMA) modeling is proposed as an effective way to estimate the frequency and decay time of resonance modes associated with the partials of a given tone, despite the presence of corrupting noise in the observable signal.reviewe

Aaltodoc Publication Archive

Spectral masking and filtering

Author: Benaroya
Berouti
Boll
Brandt
Breithaupt
Breithaupt
Breithaupt
Cappé
Cauchi
Cohen
Ephraim
Ephraim
Erkelens
Esch
Gerkmann
Gerkmann
Gerkmann
Gradshteyn
Jensen
Lebart
Li
Lotter
Madhu
Martin
Martin
Martin
Martin
Porter
Roweis
Schreier
Vincent
Vincent
Vincent
Wang
Wolfe
You
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 03/08/2018
Field of study

International audienc

Crossref

INRIA a CCSD electronic archive server

Speech enhancement Algorithm based on super-Gaussian modeling and orthogonal polynomials

Author: Abdulhussain Sadiq H.
Al-Obeidat Feras
Baker Thar
Jassim Wissam A.
Mahmmod Basheera M.
Ramli Abd Rahman
Publication venue: ZU Scholars
Publication date: 01/01/2019
Field of study

© 2020 Lippincott Williams and Wilkins. All rights reserved. Different types of noise from the surrounding always interfere with speech and produce annoying signals for the human auditory system. To exchange speech information in a noisy environment, speech quality and intelligibility must be maintained, which is a challenging task. In most speech enhancement algorithms, the speech signal is characterized by Gaussian or super-Gaussian models, and noise is characterized by a Gaussian prior. However, these assumptions do not always hold in real-life situations, thereby negatively affecting the estimation, and eventually, the performance of the enhancement algorithm. Accordingly, this paper focuses on deriving an optimum low-distortion estimator with models that fit well with speech and noise data signals. This estimator provides minimum levels of speech distortion and residual noise with additional improvements in speech perceptual aspects via four key steps. First, a recent transform based on an orthogonal polynomial is used to transform the observation signal into a transform domain. Second, the noise classification based on feature extraction is adopted to find accurate and mutable models for noise signals. Third, two stages of nonlinear and linear estimators based on the minimum mean square error (MMSE) and new models for speech and noise are derived to estimate a clean speech signal. Finally, the estimated speech signal in the time domain is determined by considering the inverse of the orthogonal transform. The results show that the average classification accuracy of the proposed approach is 99.43%. In addition, the proposed algorithm significantly outperforms existing speech estimators in terms of quality and intelligibility measures

ZU Scholars (Zayed University)

University of Brighton Research Portal

Model-Based Speech Enhancement in the Modulation Domain

Author: Brookes DM
Wang Y
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/12/2017
Field of study

This paper presents an algorithm for modulationdomain speech enhancement using a Kalman filter. The proposed estimator jointly models the estimated dynamics of the spectral amplitudes of speech and noise to obtain an MMSE estimation of the speech amplitude spectrum with the assumption that the speech and noise are additive in the complex domain. In order to include the dynamics of noise amplitudes with those of speech amplitudes, we propose a statistical “Gaussring” model that comprises a mixture of Gaussians whose centres lie in a circle on the complex plane. The performance of the proposed algorithm is evaluated using the perceptual evaluation of speech quality (PESQ) measure, segmental SNR (segSNR) measure and shorttime objective intelligibility (STOI) measure. For speech quality measures, the proposed algorithm is shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms. Speech recognition experiments also show that the Gaussring model based algorithm performs well for two types of noise

Spiral - Imperial College Digital Repository