Search CORE

1 research outputs found

Using cross-decoder phone coocurrences in phonotactic language recognition

Author: Amparo Varona
Germán Bordel
Luis Javier Rodríguez-fuentes
Mikel Penagarikano
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

Phonotactic language recognizers are based on the ability of phone decoders to produce phone sequences containing acoustic, phonetic and phonological information, which is partially dependent on the language. Input utterances are de-coded and then scored by means of models for the target lan-guages. Commonly, various decoders are applied in parallel and fused at the score level. A kind of complementarity ef-fect is expected when fusing scores, since each decoder is assumed to extract different (and complementary) informa-tion from the input utterance. This assumption is supported by the performance improvements attained when fusing sys-tems. However, decodings are processed in a fully uncou-pled way, their time alignment (and the information that may be extracted from it) being completely lost. In this paper, a simple approach is proposed, which takes into account time alignment information, by considering cross-decoder phone coocurrences at the frame level. To evaluate the approach, a choice of open software (BUT front-end and phone decoders, SRI-LM toolkit, libSVM, FoCal) is used, and experiments are carried out on the NIST LRE2007 database. Adding phone coocurrences to the baseline phonotactic systems pro-vides slight performance improvements, revealing the poten-tial benefit of using cross-decoder dependencies for language modeling

CiteSeerX

Crossref