Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields

Demuynck, Kris; Nguyen, Patrick; Seppi, Dino; Van Compernolle, Dirk; Zweig, Geoffrey

Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields

Authors: Kris Demuynck
Patrick Nguyen
Dino Seppi
Dirk Van Compernolle
Geoffrey Zweig
Publication date: 1 January 2011
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Exemplar based recognition systems are characterized by the fact that, instead of abstracting large amounts of data into compact models, they store the observed data enriched with some annotations and infer on-the-fly from the data by finding those exemplars that resemble the input speech best. One advantage of exemplar based systems is that next to deriving what the current phone or word is, one can easily derive a wealth of meta-information concerning the chunk of audio under investigation. In this work we harvest meta-information from the set of best matching exemplars, that is thought to be relevant for the recognition such as word boundary predictions and speaker entropy. Integrating this meta-information into the recognition framework using segmental conditional random fields, reduced the WER of the exemplar based system on the WSJ Nov92 20k task from 8.2% to 7.6%. Adding the HMM-score and multiple HMM phone detectors as features further reduced the error rate to 6.6%. © 2011 IEEE.Demuynck K., Seppi D., Van Compernolle D., Nguyen P., Zweig G., ''Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields'', 36th international conference on acoustics, speech and signal processing - ICASSP’2011, pp. 5048-5051, May 22-27, 2011, Prague, Czech Republic.status: publishe

Similar works

Full text

Available Versions

Lirias

oai:lirias2repo.kuleuven.be:12...

Last time updated on 10/12/2019