Low-resource language recognition using a fusion of phoneme posteriorgram counts, acoustic and glottal-based i-vectors

Caraballo Morcillo, Miguel Ángel; Córdoba Herralde, Ricardo de; D'haro Enríquez, Luis Fernando; Pardo Muñoz, José Manuel

research

Low-resource language recognition using a fusion of phoneme posteriorgram counts, acoustic and glottal-based i-vectors

Authors: Miguel Ángel Caraballo Morcillo
Ricardo de Córdoba Herralde
Luis Fernando D'haro Enríquez
José Manuel Pardo Muñoz
Publication date: 1 January 2013
Publisher: E.T.S.I. Telecomunicación (UPM)

Abstract

This paper presents a description of our system for the Albayzin 2012 LRE competition. One of the main characteristics of this evaluation was the reduced number of available files for training the system, especially for the empty condition where no training data set was provided but only a development set. In addition, the whole database was created from online videos and around one third of the training data was labeled as noisy files. Our primary system was the fusion of three different i-vector based systems: one acoustic system based on MFCCs, a phonotactic system using trigrams of phone-posteriorgram counts, and another acoustic system based on RPLPs that improved robustness against noise. A contrastive system that included new features based on the glottal source was also presented. Official and postevaluation results for all the conditions using the proposed metrics for the evaluation and the Cavg metric are presented in the paper

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archivo Digital UPM

oai:oa.upm.es:26034

Last time updated on 26/05/2014

Servicio de Coordinación de Bibliotecas de la Universidad Politécnica de Madrid

oai:oa.upm.es:26034

Last time updated on 10/02/2018