Search CORE

8 research outputs found

Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise

Author: Betts Dave
Brookes Mike
Dmour Mohammad A.
Doire Clement Samuel Joseph
Hicks Christopher M.
Jensen Soren Holdt
Naylor Patrick A.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2017
Field of study

Crossref

VBN

Foreground-Background Ambient Sound Scene Separation

Author: Gasso Gilles
Olvera Michel
Serizel Romain
Vincent Emmanuel
Publication venue
Publication date: 27/07/2020
Field of study

Ambient sound scenes typically comprise multiple short events occurring on top of a somewhat stationary background. We consider the task of separating these events from the background, which we call foreground-background ambient sound scene separation. We propose a deep learning-based separation framework with a suitable feature normaliza-tion scheme and an optional auxiliary network capturing the background statistics, and we investigate its ability to handle the great variety of sound classes encountered in ambient sound scenes, which have often not been seen in training. To do so, we create single-channel foreground-background mixtures using isolated sounds from the DESED and Audioset datasets, and we conduct extensive experiments with mixtures of seen or unseen sound classes at various signal-to-noise ratios. Our experimental findings demonstrate the generalization ability of the proposed approach

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Foreground-Background Ambient Sound Scene Separation

Author: Gasso Gilles
Olvera Michel
Serizel Romain
Vincent Emmanuel
Publication venue: HAL CCSD
Publication date: 18/01/2021
Field of study

International audienceAmbient sound scenes typically comprise multiple short events occurring on top of a somewhat stationary background. We consider the task of separating these events from the background, which we call foreground-background ambient sound scene separation. We propose a deep learning-based separation framework with a suitable feature normaliza-tion scheme and an optional auxiliary network capturing the background statistics, and we investigate its ability to handle the great variety of sound classes encountered in ambient sound scenes, which have often not been seen in training. To do so, we create single-channel foreground-background mixtures using isolated sounds from the DESED and Audioset datasets, and we conduct extensive experiments with mixtures of seen or unseen sound classes at various signal-to-noise ratios. Our experimental findings demonstrate the generalization ability of the proposed approach

Crossref

INRIA a CCSD electronic archive server

Robust speech dereverberation with a neural network-based post-filter that exploits multi-conditional training of binaural cues

Author: May Tobias
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Crossref

Online Research Database In Technology

Single-channel online enhancement of speech corrupted by reverberation and noise

Author: Betts D
Brookes DM
Dmour MA
Doire CSJ
Hicks CM
Jensen SH
Naylor PA
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/12/2016
Field of study

This paper proposes an online single-channel speech enhancement method designed to improve the quality of speech degraded by reverberation and noise. Based on an autoregressive model for the reverberation power and on a hidden Markov model for clean speech production, a Bayesian filtering formulation of the problem is derived and online joint estimation of the acoustic parameters and mean speech, reverberation, and noise powers is obtained in mel-frequency bands. From these estimates, a real-valued spectral gain is derived and spectral enhancement is applied in the short-time Fourier transform (STFT) domain. The method yields state-of-the-art performance and greatly reduces the effects of reverberation and noise while improving speech quality and preserving speech intelligibility in challenging acoustic environments

Spiral - Imperial College Digital Repository