Search CORE

4 research outputs found

Neural Machine Translation for English–Kazakh with Morphological Segmentation and Synthetic Data

Author: Edman Lukas
Spenader Jennifer
Toral Ruiz Antonio
Yeshmagambetova Galiya
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/08/2019
Field of study

ARTS repository - University of Groningen

Neural Machine Translation for English–Kazakh with Morphological Segmentation and Synthetic Data

Author: Edman Lukas
Spenader Jennifer
Toral Ruiz Antonio
Yeshmagambetova Galiya
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

This paper presents the systems submitted by the University of Groningen to the English-Kazakh language pair (both translation directions) for the WMT 2019 news translation task. We explore the potential benefits of (i) morphological segmentation (both unsupervised and rule-based), given the agglutinative nature of Kazakh, (ii) data from two additional languages (Turkish and Russian), given the scarcity of English-Kazakh data and (iii) synthetic data, both for the source and for the target language. Our best sub- missions ranked second for Kazakh-English and third for English-Kazakh in terms of the BLEU automatic evaluation metric

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen