A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets

Dean, David; Ghaemmaghami, Houman; Sridharan, Sridha

research

A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets

Authors: David Dean
Houman Ghaemmaghami
Sridha Sridharan
Publication date: 1 January 2014
Publisher: European Association for Signal Processing
Doi

Abstract

In this paper we propose a novel scheme for carrying out speaker diarization in an iterative manner. We aim to show that the information obtained through the first pass of speaker diarization can be reused to refine and improve the original diarization results. We call this technique speaker rediarization and demonstrate the practical application of our rediarization algorithm using a large archive of two-speaker telephone conversation recordings. We use the NIST 2008 SRE summed telephone corpora for evaluating our speaker rediarization system. This corpus contains recurring speaker identities across independent recording sessions that need to be linked across the entire corpus. We show that our speaker rediarization scheme can take advantage of inter-session speaker information, linked in the initial diarization pass, to achieve a 30% relative improvement over the original diarization error rate (DER) after only two iterations of rediarization