Search CORE

4 research outputs found

On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis

Author: Cumani Sandro
Laface Pietro
Plchot O.
Publication venue: IEEE - INST ELECTRICAL ELECTRONICS ENGINEERS INC
Publication date: 01/01/2014
Field of study

The i-vector extraction process is affected by several factors such as the noise level, the acoustic content of the observed features, the channel mismatch between the training conditions and the test data, and the duration of the analyzed speech segment. These factors influence both the i-vector estimate and its uncertainty, represented by the i-vector posterior covariance. This paper presents a new PLDA model that, unlike the standard one, exploits the intrinsic i-vector uncertainty. Since the recognition accuracy is known to decrease for short speech segments, and their length is one of the main factors affecting the i-vector covariance, we designed a set of experiments aiming at comparing the standard and the new PLDA models on short speech cuts of variable duration, randomly extracted from the conversations included in the NIST SRE 2010 extended dataset, both from interviews and telephone conversations. Our results on NIST SRE 2010 evaluation data show that in different conditions the new model outperforms the standard PLDA by more than 10% relative when tested on short segments with duration mismatches, and is able to keep the accuracy of the standard model for long enough speaker segments. This technique has also been successfully tested in the NIST SRE 2012 evaluation

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

From single to multiple enrollment i-vectors: Practical PLDA scoring variants for speaker verification

Author: Anton Afanasyev
Brümmer
Cumani
Cumani
Dehak
El Shafey
Garcia-Romero
Garcia-Romero
Garcia-Romero
Glembek
Hasan
Kanagasundaram
Kenny
Kenny
Kinnunen
Lee
Mandasari
McLaren
Ming
NIST
NIST
Padmanabhan Rajan
Prince
Prince
Rajan
Saeidi
Sarkar
Tomi Kinnunen
Villalba
Villalba
Ville Hautamäki
Yaman
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Detecting autism, emotions and social signals using AdaBoost

Author: Busa-Fekete Róbert
Gosztolya Gábor
Tóth László
Publication venue: Interspeech
Publication date: 01/01/2013
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications