CORE
🇺🇦
make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
Multilingual representations for low resource speech recognition and keyword search
Authors
,
,
+21 more
,
,
K Audhkhasi
J Cui
X Cui
MJF Gales
P Golik
B Kingsbury
E Kislal
KM Knill
L Mangu
H Ney
M Nussbaum-Thom
M Picheny
A Ragni
B Ramabhadran
R Schluter
A Sethy
Z Tüske
H Wang
P Woodland
Publication date
1 January 2015
Publisher
2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings
Doi
Cite
Abstract
© 2015 IEEE. This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Recognition (ASR) and keyword search (KWS) for low resource languages in the context of the OpenKWS15 evaluation of the IARPA Babel program. The task is to develop Swahili ASR and KWS systems within two weeks using as little as 3 hours of transcribed data. Multilingual acoustic representations proved to be crucial for building these systems under strict time constraints. The paper discusses several key insights on how these representations are derived and used. First, we present a data sampling strategy that can speed up the training of multilingual representations without appreciable loss in ASR performance. Second, we show that fusion of diverse multilingual representations developed at different LORELEI sites yields substantial ASR and KWS gains. Speaker adaptation and data augmentation of these representations improves both ASR and KWS performance (up to 8.7% relative). Third, incorporating un-transcribed data through semi-supervised learning, improves WER and KWS performance. Finally, we show that these multilingual representations significantly improve ASR and KWS performance (relative 9% for WER and 5% for MTWV) even when forty hours of transcribed audio in the target language is available. Multilingual representations significantly contributed to the LORELEI KWS systems winning the OpenKWS15 evaluation
Similar works
Full text
Open in the Core reader
Download PDF
Available Versions
White Rose Research Online
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:eprints.whiterose.ac.uk:15...
Last time updated on 19/11/2019
Sustaining member
Apollo (Cambridge)
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:www.repository.cam.ac.uk:1...
Last time updated on 12/01/2019
RWTH Publications
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:publications.rwth-aachen.d...
Last time updated on 18/04/2020
White Rose Research Online
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:eprints.whiterose.ac.uk:15...
Last time updated on 02/02/2021
CUED - Cambridge University Engineering Department
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:generic.eprints.org:764838...
Last time updated on 15/07/2020
Publikationsserver der RWTH Aachen University
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:publications.rwth-aachen.d...
Last time updated on 04/11/2017