Search CORE

33 research outputs found

Cross-lingual Pre-training Based Transfer for Zero-shot Neural Machine Translation

Author: Chen Boxing
Duan Xiangyu
Ji Baijun
Luo Weihua
Zhang Min
Zhang Zhirui
Publication venue
Publication date: 03/12/2019
Field of study

Transfer learning between different language pairs has shown its effectiveness for Neural Machine Translation (NMT) in low-resource scenario. However, existing transfer methods involving a common target language are far from success in the extreme scenario of zero-shot translation, due to the language space mismatch problem between transferor (the parent model) and transferee (the child model) on the source side. To address this challenge, we propose an effective transfer learning approach based on cross-lingual pre-training. Our key idea is to make all source languages share the same feature space and thus enable a smooth transition for zero-shot translation. To this end, we introduce one monolingual pre-training method and two bilingual pre-training methods to obtain a universal encoder for different languages. Once the universal encoder is constructed, the parent model built on such encoder is trained with large-scale annotated data and then directly applied in zero-shot translation scenario. Experiments on two public datasets show that our approach significantly outperforms strong pivot-based baseline and various multilingual NMT approaches.Comment: Accepted as a conference paper at AAAI 2020 (oral presentation

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Zero-Resource Neural Machine Translation with Monolingual Pivot Data

Author: Currey Anna
Heafield Kenneth
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Crossref

Edinburgh Research Explorer

Improving Zero-shot Translation with Language-Independent Constraints

Author: Ha Thanh-Le
Niehues Jan
Pham Ngoc-Quan
Waibel Alex
Publication venue: Association for Computational Linguistics
Publication date: 20/04/2022
Field of study

KITopen