On the use of high-level information in speaker and language recognition

González Domínguez, Javier; González-Rodríguez, Joaquín; López Moreno, Ignacio; Montero-Asenjo, Alberto; Ramos, Daniel; Toledano, Doroteo T.

research

On the use of high-level information in speaker and language recognition

Authors: Javier González Domínguez
Joaquín González-Rodríguez
Ignacio López Moreno
Alberto Montero-Asenjo
Daniel Ramos
Doroteo T. Toledano
Publication date: 1 January 2006
Publisher

Abstract

Actas de las IV Jornadas de Tecnología del Habla (JTH 2006)Automatic Speaker Recognition systems have been largely dominated by acoustic-spectral based systems, relying in proper modelling of the short-term vocal tract of speakers. However, there is scientific and intuitive evidence that speaker specific information is embedded in the speech signal in multiple short- and long-term characteristics. In this work, a multilevel speaker recognition system combining acoustic, phonotactic and prosodic subsystems is presented and assessed using NIST 2005 Speaker Recognition Evaluation data. For language recognition systems, the NIST 2005 Language Recognition Evaluation was selected to measure performance of a high-level language recognition systems

Similar works

Full text

Available Versions

Biblos-e Archivo

oai:repositorio.uam.es:10486/6...

Last time updated on 17/11/2016