Near end listening enhancement in realistic environments

Abstract

Speech playback is harder to understand in noise. Near End Listening Enhancement algorithms try to overcomethe problem by enhancing the speech signal before it is played by a device. Different strategies have beentried, achieving variable degrees of success in specific noise conditions. Such technologies, however, are oftentested in artificial settings - with controlled noise sources and no reverberation. The purpose of this study isto compare a set of state-of-the-art algorithms based on different approaches (adaptive vs non-adaptive, withor without a compensation for reverberation) in simulated real- life scenarios. Binaural noise recordings andimpulse responses of real environments have been used to create two representative scenarios in which speechplayback may occur, namely a domestic and a public space. A preliminary study with N=24 subjects revealedthe need for higher SNRs in the realistic settings (in comparison to controlled noise conditions) in order to obtainthe same levels of intelligibility for plain speech. The goal of the main study is to assess the impact of noiseadaptivity and reverberation awareness in realistic scenarios, in order to better understand how to make speechplayback more robust to noise in real-life situations

    Similar works