1 research outputs found

    INTERSPEECH 2007 Using Direction of Arrival Estimate and Acoustic Feature Information in Speaker Diarization

    No full text
    This paper describes the I 2 R/NTU system submitted for the NIST Rich Transcription 2007 (RT-07) Meeting Recognition evaluation Multiple Distant Microphone (MDM) task. In our implementation, the Direction of Arrival (DOA) information is specifically used to perform speaker turn detection and clustering. Cluster purification is then carried out by performing GMM modeling on acoustic features. Finally, non-speech & silence removal is effected to remove unwanted segments. The system achieved an overall DER of 31.02 % on the NIST Rich Transcription Spring 2006 evaluation tasks. Index Terms: speaker diarization, direction of arrival, GMM clusterin
    corecore