Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches

Dau-Cheng Lyu; Dominic Telaar; Eng-Siong Chng; Florian Metze; Haizhou Li; Jochen Weiner; Ngoc Thang Vu; Tanja Schultz

Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches

Authors: Dau-Cheng Lyu
Dominic Telaar
Eng-Siong Chng
Florian Metze
Haizhou Li
Jochen Weiner
Ngoc Thang Vu
Tanja Schultz
Publication date: 1 January 2012
Publisher

Abstract

ABSTRACT This paper describes the integration of language identification (LID) into a multilingual automatic speech recognition (ASR) system for spoken conversations containing code-switches between Mandarin and English. We apply a multistream approach to combine at frame level the acoustic model score and the language information, where the latter is provided by an LID component. Furthermore, we advance this multistream approach by a new method called "Language Lookahead", in which the language information of subsequent frames is used to improve accuracy. Both methods are evaluated using a set of controlled LID results with varying frame accuracies. Our results show that both approaches improve the ASR performance by at least 4% relative if the LID achieves a minimum frame accuracy of 85%

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.1077....

Last time updated on 07/12/2020