Search CORE

10,549 research outputs found

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

Author: Geiger Jürgen
Jin Wenyu
Mousa Amr El-Desoky
Pohjalainen Jouni
Schuller Björn
Zhang Zixing
Publication venue
Publication date: 01/01/2018
Field of study

Eliminating the negative effect of non-stationary environmental noise is a long-standing research topic for automatic speech recognition that stills remains an important challenge. Data-driven supervised approaches, including ones based on deep neural networks, have recently emerged as potential alternatives to traditional unsupervised approaches and with sufficient training, can alleviate the shortcomings of the unsupervised methods in various real-life acoustic environments. In this light, we review recently developed, representative deep learning approaches for tackling non-stationary additive and convolutional degradation of speech with the aim of providing guidelines for those involved in the development of environmentally robust speech recognition systems. We separately discuss single- and multi-channel techniques developed for the front-end and back-end of speech recognition systems, as well as joint front-end and back-end training frameworks

arXiv.org e-Print Archive

OPUS Augsburg

Speech Recognition Under Noise Conditions: Compensation Methods

Author: Angel de la Torre
Antonio J. Rubio
Carmen Benitez
Javier Ramirez Luz Garcia
Jose C. Segura
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Speech Recognition in Unknown Noisy Conditions

Author: Baochun Hou
Ji Ming
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

A Family of Stereo-Based Stochastic Mapping Algorithms for Noisy Speech Recognition

Author: Mohamed Afify
Xiaodong Cui
Yuqing Gao
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

IntechOpen

Crossref

Autocorrelation-based Methods for Noise-Robust Speech Recognition

Author: Gholamreza Farahani
Mohammad Ahadi
Mohammad Mehdi Homayounpour
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Histogram equalization for robust text-independent speaker verification in telephone environments

Author: Skosan Marshalleno
Publication venue: Department of Electrical Engineering
Publication date: 01/01/2005
Field of study

Word processed copy. Includes bibliographical references

Cape Town University OpenUCT

Reconstruction-based speech enhancement from robust acoustic features

Author: Ahmadi
Ben Milner
Boll
Cappe
Carmona
Chen
Cohen
Darch
de Cheveigné
Ephraim
Ephraim
Gales
Gauvain
Gerkmann
Gonzalez
Hu
Hu
Hu
Jensen
Kawahara
Leggetter
Loizou
Makhoul
Martin
Martin
McAulay
Milner
Milner
Mohammadiha
Oppenheim
Paliwal
Philip Harding
Rangachari
Reynolds
Stylianou
Syrdal
Varga
Xiao
Yan
Zen
Publication venue: 'Elsevier BV'
Publication date: 17/10/2015
Field of study

This paper proposes a method of speech enhancement where a clean speech signal is reconstructed from a sinusoidal model of speech production and a set of acoustic speech features. The acoustic features are estimated from noisy speech and comprise, for each frame, a voicing classification (voiced, unvoiced or non-speech), fundamental frequency (for voiced frames) and spectral envelope. Rather than using different algorithms to estimate each parameter, a single statistical model is developed. This comprises a set of acoustic models and has similarity to the acoustic modelling used in speech recognition. This allows noise and speaker adaptation to be applied to acoustic feature estimation to improve robustness. Objective and subjective tests compare reconstruction-based enhancement with other methods of enhancement and show the proposed method to be highly effective at removing noise

Crossref

University of East Anglia digital repository