VOICE ACTIVITY DETECTION USING A SLIDING-WINDOW, MAXIMUM MARGIN CLUSTERING APPROACH

Abstract

ABSTRACT Recently, an unsupervised, data clustering algorithm based on maximum margin, i.e. support vector machine (SVM) was reported. The maximum margin clustering (MMC) algorithm was later applied to the problem of voice activity detection, however, the application did not allow for real-time detection which is important in speech processing applications. In this paper, we propose a voice activity detector (VAD) based on a sliding window, MMC algorithm which allows for real-time detection. Our system requires a separate initialization stage which imposes an initial detection delay, however, once initialized the system can operate in real-time. Using TIMIT speech under several NOISEX-92 noise backgrounds at various SNRs, we show that our average speech and non-speech hit rates are better than state-of-the-art VADs

    Similar works