Search CORE

83 research outputs found

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research

Author
Publication venue: Springer
Publication date: 18/01/2016
Field of study

Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization

Author: A Sehr
B Kingsbury
CH Knapp
G Hinton
H Xu
JF Gemmeke
KJ Palomäki
S Thomas
SM Pizer
T Virtanen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

This paper describes a novel two-stage dereverberation feature enhancement method for noise-robust automatic speech recognition. In the first stage, an estimate of the dereverberated speech is generated by matching the distribution of the observed reverberant speech to that of clean speech, in a decorrelated transformation domain that has a long temporal context in order to address the effects of reverberation. The second stage uses this dereverberated signal as an initial estimate within a non-negative matrix factorization framework, which jointly estimates a sparse representation of the clean speech signal and an estimate of the convolutional distortion. The proposed feature enhancement method, when used in conjunction with automatic speech recognizer back-end processing, is shown to improve the recognition performance compared to three other state-of-the-art techniques

Crossref

Springer - Publisher Connector

Aaltodoc Publication Archive

White Rose Research Online

The PASCAL CHiME Speech Separation and Recognition Challenge

Author: Barker Jon
Christensen Heidi
Green Phil
Ma Ning
Vincent Emmanuel
Publication venue: 'Elsevier BV'
Publication date: 01/02/2013
Field of study

International audienceDistant microphone speech recognition systems that operate with humanlike robustness remain a distant goal. The key difficulty is that operating in everyday listening conditions entails processing a speech signal that is reverberantly mixed into a noise background composed of multiple competing sound sources. This paper describes a recent speech recognition evaluation that was designed to bring together researchers from multiple communities in order to foster novel approaches to this problem. The task was to identify keywords from sentences reverberantly mixed into audio backgrounds binaurally-recorded in a busy domestic environment. The challenge was designed to model the essential difficulties of multisource environment problem while remaining on a scale that would make it accessible to a wide audience. Compared to previous ASR evaluation a particular novelty of the task is that the utterances to be recognised were provided in a continuous audio background rather than as pre-segmented utterances thus allowing a range of background modelling techniques to be employed. The challenge attracted thirteen submissions. This paper describes the challenge problem, provides an overview of the systems that were entered and provides a comparison alongside both a baseline recognition system and human performance. The paper discusses insights gained from the challenge and lessons learnt for the design of future such evaluations

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1