Search CORE

2 research outputs found

Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation

Author: Kinoshita Keisuke
Nakatani Tomohiro
Publication venue
Publication date: 06/08/2019
Field of study

This article describes a probabilistic formulation of a Weighted Power minimization Distortionless response convolutional beamformer (WPD). The WPD unifies a weighted prediction error based dereverberation method (WPE) and a minimum power distortionless response beamformer (MPDR) into a single convolutional beamformer, and achieves simultaneous dereverberation and denoising in an optimal way. However, the optimization criterion is obtained simply by combining existing criteria without any clear theoretical justification. This article presents a generative model and a probabilistic formulation of a WPD, and derives an optimization algorithm based on a maximum likelihood estimation. We also describe a method for estimating the steering vector of the desired signal by utilizing WPE within the WPD framework to provide an effective and efficient beamformer for denoising and dereverberation.Comment: Accepted for EUSIPCO 2019. arXiv admin note: text overlap with arXiv:1812.0840

arXiv.org e-Print Archive

Single-channel Speech Dereverberation via Generative Adversarial Training

Author: Li Chenxing
Wang Tieqiang
Xu Bo
Xu Shuang
Publication venue
Publication date: 25/06/2018
Field of study

In this paper, we propose a single-channel speech dereverberation system (DeReGAT) based on convolutional, bidirectional long short-term memory and deep feed-forward neural network (CBLDNN) with generative adversarial training (GAT). In order to obtain better speech quality instead of only minimizing a mean square error (MSE), GAT is employed to make the dereverberated speech indistinguishable form the clean samples. Besides, our system can deal with wide range reverberation and be well adapted to variant environments. The experimental results show that the proposed model outperforms weighted prediction error (WPE) and deep neural network-based systems. In addition, DeReGAT is extended to an online speech dereverberation scenario, which reports comparable performance with the offline case.Comment: 5 pages. Accepted by Interspeech 201

arXiv.org e-Print Archive