Reconstruction-based speech enhancement from robust acoustic features

Ahmadi; Ben Milner; Boll; Cappe; Carmona; Chen; Cohen; Darch; de Cheveigné; Ephraim; Ephraim; Gales; Gauvain; Gerkmann; Gonzalez; Hu; Hu; Hu; Jensen; Kawahara; Leggetter; Loizou; Makhoul; Martin; Martin; McAulay; Milner; Milner; Mohammadiha; Oppenheim; Paliwal; Philip Harding; Rangachari; Reynolds; Stylianou; Syrdal; Varga; Xiao; Yan; Zen

research

Reconstruction-based speech enhancement from robust acoustic features

Authors: Ahmadi
Ben Milner
Boll
Cappe
Carmona
Chen
Cohen
Darch
de Cheveigné
Ephraim
Ephraim
Gales
Gauvain
Gerkmann
Gonzalez
Hu
Hu
Hu
Jensen
Kawahara
Leggetter
Loizou
Makhoul
Martin
Martin
McAulay
Milner
Milner
Mohammadiha
Oppenheim
Paliwal
Philip Harding
Rangachari
Reynolds
Stylianou
Syrdal
Varga
Xiao
Yan
Zen
Publication date: 17 October 2015
Publisher: 'Elsevier BV'
Doi

Abstract

This paper proposes a method of speech enhancement where a clean speech signal is reconstructed from a sinusoidal model of speech production and a set of acoustic speech features. The acoustic features are estimated from noisy speech and comprise, for each frame, a voicing classification (voiced, unvoiced or non-speech), fundamental frequency (for voiced frames) and spectral envelope. Rather than using different algorithms to estimate each parameter, a single statistical model is developed. This comprises a set of acoustic models and has similarity to the acoustic modelling used in speech recognition. This allows noise and speaker adaptation to be applied to acoustic feature estimation to improve robustness. Objective and subjective tests compare reconstruction-based enhancement with other methods of enhancement and show the proposed method to be highly effective at removing noise

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

University of East Anglia digital repository

oai:ueaeprints.uea.ac.uk:55856

Last time updated on 28/06/2016

Crossref

info:doi/10.1016%2Fj.specom.20...

Last time updated on 05/06/2019