ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text

Benites de Azevedo e Souza, Fernando; Büchi, Matthias; Cieliebak, Mark; Hürlimann, Manuela; Ulasik, Malgorzata Anna; von Däniken, Pius

ZHAW-InIT at GermEval 2020 task 4 : low-resource speech-to-text

Authors: Fernando Benites de Azevedo e Souza
Matthias Büchi
Mark Cieliebak
Manuela Hürlimann
Malgorzata Anna Ulasik
Pius von Däniken
Publication date: 1 June 2020
Publisher: CEUR Workshop Proceedings
Doi

Abstract

This paper presents the contribution of ZHAW-InIT to Task 4 ”Low-Resource STT” at GermEval 2020. The goal of the task is to develop a system for translating Swiss German dialect speech into Standard German text in the domain of parliamentary debates. Our approach is based on Jasper, a CNN Acoustic Model, which we fine-tune on the task data. We enhance the base system with an extended Language Model containing in-domain data and speed perturbation and run further experiments with post-processing. Our submission achieved first place with a final Word Error Rate of 40.29%

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

ZHAW digitalcollection

oai:digitalcollection.zhaw.ch:...

Last time updated on 18/03/2021

ZHAW digitalcollection

oai:digitalcollection.zhaw.ch:...

Last time updated on 14/02/2021