2 research outputs found
Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation
This article describes a probabilistic formulation of a Weighted Power
minimization Distortionless response convolutional beamformer (WPD). The WPD
unifies a weighted prediction error based dereverberation method (WPE) and a
minimum power distortionless response beamformer (MPDR) into a single
convolutional beamformer, and achieves simultaneous dereverberation and
denoising in an optimal way. However, the optimization criterion is obtained
simply by combining existing criteria without any clear theoretical
justification. This article presents a generative model and a probabilistic
formulation of a WPD, and derives an optimization algorithm based on a maximum
likelihood estimation. We also describe a method for estimating the steering
vector of the desired signal by utilizing WPE within the WPD framework to
provide an effective and efficient beamformer for denoising and
dereverberation.Comment: Accepted for EUSIPCO 2019. arXiv admin note: text overlap with
arXiv:1812.0840
Single-channel Speech Dereverberation via Generative Adversarial Training
In this paper, we propose a single-channel speech dereverberation system
(DeReGAT) based on convolutional, bidirectional long short-term memory and deep
feed-forward neural network (CBLDNN) with generative adversarial training
(GAT). In order to obtain better speech quality instead of only minimizing a
mean square error (MSE), GAT is employed to make the dereverberated speech
indistinguishable form the clean samples. Besides, our system can deal with
wide range reverberation and be well adapted to variant environments. The
experimental results show that the proposed model outperforms weighted
prediction error (WPE) and deep neural network-based systems. In addition,
DeReGAT is extended to an online speech dereverberation scenario, which reports
comparable performance with the offline case.Comment: 5 pages. Accepted by Interspeech 201