Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010

Gómez Vilda, Pedro; Martínez Olalla, Rafael; Mazaira Fernández, Luis Miguel; Muñoz, Cristina; Álvarez Marquina, Agustin

research

Glottal Source Cepstrum Coefficients Applied to NIST SRE 2010

Authors: Pedro Gómez Vilda
Rafael Martínez Olalla
Luis Miguel Mazaira Fernández
Cristina Muñoz
Agustin Álvarez Marquina
Publication date: 1 January 2010
Publisher: Facultad de Informática (UPM)

Abstract

Through the present paper, a novel feature set for speaker recognition based on glottal estimate information is presented. An iterative algorithm is used to derive the vocal tract and glottal source estimations from speech signal. In order to test the importance of glottal source information in speaker characterization, the novel feature set has been tested in the 2010 NIST Speaker Recognition Evaluation (NIST SRE10). The proposed system uses glottal estimate parameter templates and classical cepstral information to build a model for each speaker involved in the recognition process. ALIZE [1] open-source software has been used to create the GMM models for both background and target speakers. Compared to using mel-frequency cepstrum coefficients (MFCC), the misclassification rate for the NIST SRE 2010 reduced from 29.43% to 27.15% when glottal source features are use

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Servicio de Coordinación de Bibliotecas de la Universidad Politécnica de Madrid

oai:oa.upm.es:7905

Last time updated on 10/02/2018

Archivo Digital UPM

oai:oa.upm.es:7905

Last time updated on 17/07/2013