Search CORE

1 research outputs found

Preliminary intelligibility tests of a monaural speech segregation system

Author: Barbara G. Shinn-cunningham
Dan Ellis
Deliang Wang
Ke Hu
Pierre Divenyi
Zhaozhang Jin
Publication venue
Publication date: 02/04/2012
Field of study

Human listeners are able to understand speech in the presence of a noisy background. How to simulate this perceptual ability remains a great challenge. This paper describes a preliminary evaluation of intelligibility of the output of a monaural speech segregation system. The system performs speech segregation in two stages. The first stage segregates voiced speech using supervised learning of harmonic features, and the second stage segregates unvoiced speech by subtracting noise energy that is estimated from voiced intervals and onset/offset based segmentation. Objective evaluation in terms of the match to ideal binary time-frequency masks shows substantial improvements. Tests with human subjects indicate that the system improves intelligibility for young listeners when the input SNR is very low, but does not aid elderly listeners. This preliminary evaluation identifies aspects of the system that should be improved in order to produce consistent improvement in intelligibility in noisy environments. Index Terms: speech segregation, computational auditory scene analysis, ideal binary mask, supervised learning, onset/offse

CiteSeerX