Investigating techniques for low resource conversational speech recognition

Fraga-Silva, Thiago; Gauvain, Jean-Luc; Lamel, Lori; Laurent, Antoine

Investigating techniques for low resource conversational speech recognition

Authors: Thiago Fraga-Silva
Jean-Luc Gauvain
Lori Lamel
Antoine Laurent
Publication date: 20 March 2016
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

International audienceIn this paper we investigate various techniques in order to build effective speech to text (STT) and keyword search (KWS) systems for low resource conversational speech. Sub-word decoding and graphemic mappings were assessed in order to detect out-of-vocabulary keywords. To deal with the limited amount of transcribed data, semi-supervised training and data selection methods were investigated. Robust acoustic features produced via data augmentation were evaluated for acoustic modeling. For language modeling, automatically retrieved conversational-like Webdata was used, as well as neural network based models. We report STT improvements with all the techniques, but interestingly only some improve KWS performance. Results are reported for the Swahili language in the context of the 2015 OpenKWS Evaluation

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archive Ouverte en Sciences de l'Information et de la Communication

oai:HAL:hal-01515254v1

Last time updated on 17/08/2017