Search CORE

1,168 research outputs found

Modeling Target-Side Inflection in Neural Machine Translation

Author: Fraser Alexander
Marco Marion Weller-Di
Tamchyna Aleš
Publication venue
Publication date: 01/01/2017
Field of study

NMT systems have problems with large vocabulary sizes. Byte-pair encoding (BPE) is a popular approach to solving this problem, but while BPE allows the system to generate any target-side word, it does not enable effective generalization over the rich vocabulary in morphologically rich languages with strong inflectional phenomena. We introduce a simple approach to overcome this problem by training a system to produce the lemma of a word and its morphologically rich POS tag, which is then followed by a deterministic generation step. We apply this strategy for English-Czech and English-German translation scenarios, obtaining improvements in both settings. We furthermore show that the improvement is not due to only adding explicit morphological information.Comment: Accepted as a research paper at WMT17. (Updated version with corrected references.

arXiv.org e-Print Archive

MaTrEx: the DCU machine translation system for ICON 2008

Author: Haque Rejwanul
Naskar Sudip Kumar
Srivastava Ankit Kumar
Way Andy
Publication venue
Publication date: 01/01/2008
Field of study

In this paper, we give a description of the machine translation system developed at DCU that was used for our participation in the NLP Tools Contest of the International Conference on Natural Language Processing (ICON 2008). This was our first ever attempt at working on any Indian language. In this participation, we focus on various techniques for word and phrase alignment to improve system quality. For the English-Hindi translation task we exploit source-language reordering. We also carried out experiments combining both in-domain and out-of-domain data to improve the system performance and, as a post-processing step we transliterate out-of-vocabulary items

Fine-grained human evaluation of neural versus phrase-based machine translation

Author: Klubička Filip
Sánchez-Cartagena Víctor M.
Toral Antonio
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2017
Field of study

We compare three approaches to statistical machine translation (pure phrase-based, factored phrase-based and neural) by performing a fine-grained manual evaluation via error annotation of the systems' outputs. The error types in our annotation are compliant with the multidimensional quality metrics (MQM), and the annotation is performed by two annotators. Inter-annotator agreement is high for such a task, and results show that the best performing system (neural) reduces the errors produced by the worst system (phrase-based) by 54%.Comment: 12 pages, 2 figures, The Prague Bulletin of Mathematical Linguistic

arXiv.org e-Print Archive

Proceedings - University of Groningen

Dissertations of the University of Groningen

NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Author: Aransa Walid
Bardet Adrien
Barrault Loïc
Bougares Fethi
Caglayan Ozan
García-Martínez Mercedes
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2017
Field of study

In this paper, we present nmtpy, a flexible Python toolkit based on Theano for training Neural Machine Translation and other neural sequence-to-sequence architectures. nmtpy decouples the specification of a network from the training and inference utilities to simplify the addition of a new architecture and reduce the amount of boilerplate code to be written. nmtpy has been used for LIUM's top-ranked submissions to WMT Multimodal Machine Translation and News Translation tasks in 2016 and 2017.Comment: 10 pages, 3 figure

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository