A Hybrid Parameterization Technique for Speaker Identification

Fernández-Baillo Gallego de la Sacristana, Roberto; Gómez Vilda, Pedro; Martínez Olalla, Rafael; Mazaira Fernández, Luis Miguel; Muñoz, Cristina; Nieto Lluis, Victor; Rodellar Biarge, M. Victoria; Álvarez Marquina, Agustin

research

A Hybrid Parameterization Technique for Speaker Identification

Authors: Roberto Fernández-Baillo Gallego de la Sacristana
Pedro Gómez Vilda
Rafael Martínez Olalla
Luis Miguel Mazaira Fernández
Cristina Muñoz
Victor Nieto Lluis
M. Victoria Rodellar Biarge
Agustin Álvarez Marquina
Publication date: 1 January 2008
Publisher: Facultad de Informática (UPM)
Doi

Abstract

Classical parameterization techniques for Speaker Identification use the codification of the power spectral density of raw speech, not discriminating between articulatory features produced by vocal tract dynamics (acoustic-phonetics) from glottal source biometry. Through the present paper a study is conducted to separate voicing fragments of speech into vocal and glottal components, dominated respectively by the vocal tract transfer function estimated adaptively to track the acoustic-phonetic sequence of the message, and by the glottal characteristics of the speaker and the phonation gesture. The separation methodology is based in Joint Process Estimation under the un-correlation hypothesis between vocal and glottal spectral distributions. Its application on voiced speech is presented in the time and frequency domains. The parameterization methodology is also described. Speaker Identification experiments conducted on 245 speakers are shown comparing different parameterization strategies. The results confirm the better performance of decoupled parameterization compared against approaches based on plain speech parameterization