Search CORE

16 research outputs found

Sheffield submissions for the WMT18 quality estimation shared task

Author: Blain Frédéric
Ive Julia
Scarton Carolina
Specia Lucia
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 25/08/2020
Field of study

Wolverhampton Intellectual Repository and E-theses

Findings of the WMT 2018 shared task on quality estimation

Author: Astudillo Ramón
Blain Frederic
Logacheva Varvara
Martins André
Specia Lucia
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 25/08/2020
Field of study

© 2018 The Authors. Published by Association for Computational Linguistics. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: http://dx.doi.org/10.18653/v1/W18-6451We report the results of the WMT18 shared task on Quality Estimation, i.e. the task of predicting the quality of the output of machine translation systems at various granularity levels: word, phrase, sentence and document. This year we include four language pairs, three text domains, and translations produced by both statistical and neural machine translation systems. Participating teams from ten institutions submitted a variety of systems to different task variants and language pairs.The data and annotations collected for Tasks 1, 2 and 3 was supported by the EC H2020 QT21 project (grant agreement no. 645452). The shared task organisation was also supported by the QT21 project, national funds through Fundacao para a Ciencia e Tecnologia (FCT), with references UID/CEC/50021/2013 and UID/EEA/50008/2013, and by the European Research Council (ERC StG DeepSPIN 758969). We would also like to thank Julie Beliao and the Unbabel Quality Team for coordinating the annotation of the dataset used in Task 4

Wolverhampton Intellectual Repository and E-theses

deepQuest: a framework for neural-based quality estimation

Author: Blain Frederic
Ive Julia
Specia Lucia
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 25/08/2020
Field of study

© 2018 The Authors. Published by Association for Computational Linguistics. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://www.aclweb.org/anthology/C18-1266/ Code available at: https://github.com/sheffieldnlp/deepQuestPredicting Machine Translation (MT) quality can help in many practical tasks such as MT post-editing. The performance of Quality Estimation (QE) methods has drastically improved recently with the introduction of neural approaches to the problem. However, thus far neural approaches have only been designed for word and sentence-level prediction. We present a neural framework that is able to accommodate neural QE approaches at these fine-grained levels and generalize them to the level of documents. We test the framework with two sentence-level neural QE approaches: a state of the art approach that requires extensive pre-training, and a new light-weight approach that we propose, which employs basic encoders. Our approach is significantly faster and yields performance improvements for a range of document-level quality estimation tasks. To our knowledge, this is the first neural architecture for document-level QE. In addition, for the first time we apply QE models to the output of both statistical and neural MT systems for a series of European languages and highlight the new challenges resulting from the use of neural MT.The development of deepQuest received funding from the European Association for Machine Translation and the Amazon Academic Research Awards program. The first author worked on this paper during a research stay at the University of Sheffield

Wolverhampton Intellectual Repository and E-theses

deepQuest: A Framework for Neural-based Quality Estimation

Author: Blain F.
Ive J.
Specia L.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/08/2018
Field of study

Predicting Machine Translation (MT) quality can help in many practical tasks such as MT post-editing. The performance of Quality Estimation (QE) methods has drastically improved recently with the introduction of neural approaches to the problem. However, thus far neural approaches have only been designed for word and sentence-level prediction. We present a neural framework that is able to accommodate neural QE approaches at these fine-grained levels and generalize them to the level of documents. We test the framework with two sentence-level neural QE approaches: a state of the art approach that requires extensive pre-training, and a new light-weight approach that we propose, which employs basic encoders. Our approach is significantly faster and yields performance improvements for a range of document-level quality estimation tasks. To our knowledge, this is the first neural architecture for document-level QE. In addition, for the first time we apply QE models to the output of both statistical and neural MT systems for a series of European languages and highlight the new challenges resulting from the use of neural MT

White Rose Research Online

Knowledge distillation for quality estimation

Author: Aletras N.
Alva-Manchego F.
Blain F.
Fomicheva M.
Gajbhiye A.
Obamuyide A.
Specia L.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 06/05/2021
Field of study

Quality Estimation (QE) is the task of automatically predicting Machine Translation quality in the absence of reference translations, making it applicable in real-time settings, such as translating online social media conversations. Recent success in QE stems from the use of multilingual pre-trained representations, where very large models lead to impressive results. However, the inference time, disk and memory requirements of such models do not allow for wide usage in the real world. Models trained on distilled pre-trained representations remain prohibitively large for many usage scenarios. We instead propose to directly transfer knowledge from a strong QE teacher model to a much smaller model with a different, shallower architecture. We show that this approach, in combination with data augmentation, leads to light-weight QE models that perform competitively with distilled pre-trained representations with 8x fewer parameters

arXiv.org e-Print Archive

White Rose Research Online

Wolverhampton Intellectual Repository and E-theses