Search CORE

6 research outputs found

Acoustic Modelling for Under-Resourced Languages

Author: Stüker Sebastian
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2009
Field of study

Automatic speech recognition systems have so far been developed only for very few languages out of the 4,000-7,000 existing ones. In this thesis we examine methods to rapidly create acoustic models in new, possibly under-resourced languages, in a time and cost effective manner. For this we examine the use of multilingual models, the application of articulatory features across languages, and the automatic discovery of word-like units in unwritten languages

KITopen

Multilingual Adaptation of RNN Based ASR Systems

Author: Müller Markus
Stüker Sebastian
Waibel Alex
Publication venue
Publication date: 27/02/2018
Field of study

In this work, we focus on multilingual systems based on recurrent neural networks (RNNs), trained using the Connectionist Temporal Classification (CTC) loss function. Using a multilingual set of acoustic units poses difficulties. To address this issue, we proposed Language Feature Vectors (LFVs) to train language adaptive multilingual systems. Language adaptation, in contrast to speaker adaptation, needs to be applied not only on the feature level, but also to deeper layers of the network. In this work, we therefore extended our previous approach by introducing a novel technique which we call "modulation". Based on this method, we modulated the hidden layers of RNNs using LFVs. We evaluated this approach in both full and low resource conditions, as well as for grapheme and phone based systems. Lower error rates throughout the different conditions could be achieved by the use of the modulation.Comment: 5 pages, 1 figure, to appear in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018

arXiv.org e-Print Archive

Crossref

Veröffentlichungen und Vorträge 2009 der Mitglieder der Fakultät für Informatik

Author: Karlsruher Institut für Technologie
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2012
Field of study

KITopen

Jahresbericht 2009 der Fakultät für Informatik

Author: Karlsruher Institut für Technologie
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2012
Field of study

KITopen

Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems

Author: de Wet Febe
Kleynhans Neil
Sahraeian Reza
van Compernolle Dirk
Publication venue: 'Academy of Science of South Africa'
Publication date: 30/01/2017
Field of study

For purposes of automated speech recognition in under-resourced environments, techniques used to share acoustic data between closely related or similar languages become important. Donor languages with abundant resources can potentially be used to increase the recognition accuracy of speech systems developed in the resource poor target language. The assumption is that adding more data will increase the robustness of the statistical estimations captured by the acoustic models. In this study we investigated data sharing between Afrikaans and Flemish – an under-resourced and well-resourced language, respectively. Our approach was focused on the exploration of model adaptation and refinement techniques associated with hidden Markov model based speech recognition systems to improve the benefit of sharing data. Specifically, we focused on the use of currently available techniques, some possible combinations and the exact utilisation of the techniques during the acoustic model development process. Our findings show that simply using normal approaches to adaptation and refinement does not result in any benefits when adding Flemish data to the Afrikaans training pool. The only observed improvement was achieved when developing acoustic models on all available data but estimating model refinements and adaptations on the target data only. Significance:  Acoustic modelling for under-resourced languages Automatic speech recognition for Afrikaans Data sharing between Flemish and Afrikaans to improve acoustic modelling for Afrikaan

Academy of Science of South Africa (ASSAf): Open Journal Systems

Directory of Open Access Journals