Search CORE

2 research outputs found

Multi-microphone speech recognition in everyday environments

Author: Barker Jon
Marxer Ricard
Vincent Emmanuel
Watanabe Shinji
Publication venue: 'Elsevier BV'
Publication date: 22/02/2017
Field of study

International audienceMulti-microphone signal processing techniques have the potential to greatly improve the robustness of speech recognition (ASR) in distant microphone settings. However, in everyday environments, typified by complex non-stationary noise backgrounds, designing effective multi-microphone speech recognition systems is non trivial. In particular, optimal performance requires the tight integration of the front-end signal processing and the back-end statistical speech and noise source modelling. The best way to achieve this in a modern deep learning speech recognition framework remains unclear. Further, variability in microphone array design --- and consequent lack of real training data for any particular configuration --- may mean that systems have to be able to generalise from audio captured using mismatched microphone geometries or produced using simulation

Crossref

INRIA a CCSD electronic archive server

White Rose Research Online

HAL-Rennes 1