Search CORE

24 research outputs found

The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Crossref

Edinburgh Research Explorer

MS-UEdin Submission to the WMT2018 APE Shared Task:Dual-Source Transformer for Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue
Publication date: 01/01/2018
Field of study

This paper describes the Microsoft and University of Edinburgh submission to the Automatic Post-editing shared task at WMT2018. Based on training data and systems from the WMT2017 shared task, we re-implement our own models from the last shared task and introduce improvements based on extensive parameter sharing. Next we experiment with our implementation of dual-source transformer models and data selection for the IT domain. Our submissions decisively wins the SMT post-editing sub-task establishing the new state-of-the-art and is a very close second (or equal, 16.46 vs 16.50 TER) in the NMT sub-task. Based on the rather weak results in the NMT sub-task, we hypothesize that neural-on-neural APE might not be actually useful.Comment: Winning submissions for WMT2018 APE shared tas

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Findings of the 2017 Conference on Machine Translation

Author: Bojar Ondřej
Chatterjee Rajen
Federmann Christian
Graham Yvette
Haddow Barry
Huang Shujian
Huck Matthias
Koehn Philipp
Liu Qun
Logacheva Varvara
Monz Christof
Negri Matteo
Post Matt
Rubino Raphael
Specia Lucia
Turchi Marco
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

This paper presents the results of the WMT17 shared tasks, which included three machine translation (MT) tasks (news, biomedical, and multimodal), two evaluation tasks (metrics and run-time estimation of MT quality), an automatic post-editing task, a neural MT training task, and a bandit learning task

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Irish Universities

Edinburgh Research Explorer

Biblio at Institute of Formal and Applied Linguistics

DCU Online Research Access Service

Findings of the 2017 Conference on Machine Translation (WMT17)

Author: Barry Haddow
Christian Federmann
Christof Monz
Lucia Specia
Marco Turchi .
Matt Post
Matteo Negri
Matthias Huck
Ondˇrej Bojar
Philipp Koehn
Qun Liu
Rajen Chatterjee
Raphael Rubino
Shujianhuang
Varvara Logacheva
Yvette Graham
Publication venue: The Association for Computational Linguistics
Publication date
Field of study

This paper presents the results of theWMT17 shared tasks, which included three machine translation (MT) tasks(news, biomedical, and multimodal), two evaluation tasks (metrics and run-time estimation of MT quality), an automatic post-editing task, a neural MT training task, and a bandit learning task

Archivio della ricerca - Fondazione Bruno Kessler

Multi-source transformer with combined losses for automatic post editing

Author: Agrawal Ruchit Rajeshkumar
Chatterjee Rajen
Negri Matteo
Tebbifakhr Amirhossein
Turchi Marco
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2018
Field of study

Recent approaches to the Automatic Post-editing (APE) of Machine Translation (MT) have shown that best results are obtained by neural multi-source models that correct the raw MT output by also considering information from the corresponding source sentence. To this aim, we present for the first time a neural multi-source APE model based on theTransformer architecture. Moreover, we employ sequence-level loss functions in order to avoid exposure bias during training and to be consistent with the automatic evaluation metrics used for the task. These are the main features of our submissions to the WMT 2018APE shared task (Chatterjee et al., 2018), where we participated both in the PBSMT sub-task (i.e. the correction of MT outputs from a phrase-based system) and in the NMT sub-task (i.e. the correction of neural outputs).In the first subtask, our system improves over the baseline up to -5.3 TER and +8.23 BLEU points ranking second out of 11 submitted runs. In the second one, characterized by the higher quality of the initial translations, we report lower but statistically significant gains (up to -0.38 TER and +0.8 BLEU), ranking first out of 10 submissions

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Multi-source Transformer for Automatic Post-Editing

Author: Agrawal Ruchit
Negri Matteo
Tebbifakhr Amirhossein
Turchi Marco
Publication venue: 'OpenEdition'
Publication date: 01/01/2018
Field of study

Recent approaches to the Automatic Post-editing (APE) of Machine Translation (MT) have shown that best results are obtained by neural multi-source models that correct the raw MT output by also considering information from the corresponding source sentence. In this paper, we pursue this objective by exploiting, for the first time in APE, the Transformer architecture. Our approach is much simpler than the best current solutions, which are based on ensembling multiple models and adding a final hypothesis re-ranking step. We evaluate our Transformer-based system on the English-German data released for the WMT 2017 APE shared task, achieving results that outperform the state of the art with a simpler architecture suitable for industrial applications.Gli approcci più efficaci alla correzione automatica di errori nella traduzione automatica (Automatic Postediting – APE) attualmente si basano su modelli neurali multi-source, capaci cioè di sfruttare informazione proveniente sia dalla frase da correggere che dalla frase nella lingua sorgente. Seguendo tale approccio, in questo articolo applichiamo per la prima volta l’architettura Transformer, ottenendo un sistema notevolmente meno complesso rispetto a quelli proposti fino ad ora (i migliori dei quali, basati sulla combinazione di più modelli). Attraverso esperimenti su dati Inglese-Tedesco rilasciati per l’APE task a WMT 2017, dimostriamo che, oltre a tale guadagno in termini di semplicità, il metodo proposto ottiene risultati superiori allo stato dell’arte

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

OpenEdition

Multi-source Transformer for Automatic Post-Editing

Author: Agrawal Ruchit
Negri Matteo
Tebbifakhr Amirhossein
Turchi Marco
Publication venue
Publication date
Field of study

Archivio della ricerca - Fondazione Bruno Kessler