2 research outputs found
Gray Jedi MVDR Post-filtering
Spatial filters can exploit deep-learning-based speech enhancement models to
increase their reliability in scenarios with multiple speech sources scenarios.
To further improve speech quality, it is common to perform postfiltering on the
estimated target speech obtained with spatial filtering. In this work, Minimum
Variance Distortionless Response (MVDR) is employed to provide the interference
estimation, along with the estimation of the target speech, to be later used
for postfiltering. This improves the enhancement performance over a
single-input baseline in a far more significant way than by increasing the
model's complexity. Results suggest that less computing resources are required
for postfiltering when provided with both target and interference signals,
which is a step forward in developing an online speech enhancement system for
multi-speech scenarios.Comment: \c{opyright} 2023 IEEE. Personal use of this material is permitted.
Permission from IEEE must be obtained for all other uses, in any current or
future media, including reprinting/republishing this material for advertising
or promotional purposes, creating new collective works, for resale or
redistribution to servers or lists, or reuse of any copyrighted component of
this work in other work