Search CORE

232 research outputs found

Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation

Author: Bahdanau
Bahdanau
Bernardi
Bernardi
Denkowski
Denkowski
Elliott
Elliott
Elliott
Elliott
Hitschler
Hitschler
Hokamp
Hokamp
Huang
Huang
Lin
Lin
Papineni
Papineni
Peter
Peter
Sennrich
Sennrich
Sennrich
Sennrich
Shah
Shah
Specia
Specia
Vinyals
Vinyals
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/06/2017
Field of study

Recent work on multimodal machine translation has attempted to address the problem of producing target language image descriptions based on both the source language description and the corresponding image. However, existing work has not been conclusive on the contribution of visual information. This paper presents an in-depth study of the problem by examining the differences and complementarities of two related but distinct approaches to this task: textonly neural machine translation and image captioning. We analyse the scope for improvement and the effect of different data and settings to build models for these tasks. We also propose ways of combining these two approaches for improved translation quality

Crossref

Directory of Open Access Journals

White Rose Research Online

As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning

Author: Sennrich Rico
Vamvas Jannis
Publication venue
Publication date: 03/03/2022
Field of study

Omission and addition of content is a typical issue in neural machine translation. We propose a method for detecting such phenomena with off-the-shelf translation models. Using contrastive conditioning, we compare the likelihood of a full sequence under a translation model to the likelihood of its parts, given the corresponding source or target sequence. This allows to pinpoint superfluous words in the translation and untranslated words in the source even in the absence of a reference translation. The accuracy of our method is comparable to a supervised method that requires a custom quality estimation model.Comment: ACL 202

arXiv.org e-Print Archive

Edinburgh Research Explorer

ZORA

MT-based sentence alignment for OCR-generated parallel texts

Author: Sennrich R
Volk Martin
Publication venue
Publication date: 04/11/2010
Field of study

The performance of current sentence alignment tools varies according to the to-be-aligned texts. We have found existing tools unsuitable for hard-to-align parallel texts and describe an alternative alignment algorithm. The basic idea is to use machine translations of a text and BLEU as a similarity score to find reliable alignments which are used as anchor points. The gaps between these anchor points are then filled using BLEU-based and length-based heuristics. We show that this approach outperforms state-of-the-art algorithms in our alignment task, and that this improvement in alignment quality translates into better SMT performance. Furthermore, we show that even length-based alignment algorithms profit from having a machine translation as a point of comparison

ZORA

Improving Zero-Shot Cross-lingual Transfer Between Closely Related Languages by Injecting Character-Level Noise

Author: Aepli Noëmi
Sennrich Rico
Publication venue: Association for Computational Linguistics
Publication date: 01/05/2022
Field of study

Cross-lingual transfer between a high-resource language and its dialects or closely related language varieties should be facilitated by their similarity. However, current approaches that operate in the embedding space do not take surface similarity into account. This work presents a simple yet effective strategy to improve cross-lingual transfer between closely related varieties. We propose to augment the data of the high-resource source language with character-level noise to make the model more robust towards spelling variations. Our strategy shows consistent improvements over several languages and tasks: Zero-shot transfer of POS tagging and topic identification between language varieties from the Finnic, West and North Germanic, and Western Romance language branches. Our work provides evidence for the usefulness of simple surface-level noise in improving transfer between language varieties

ZORA

Improving Zero-shot Cross-lingual Transfer between Closely Related Languages by Injecting Character-level Noise

Author: Aepli Noëmi
Sennrich Rico
Publication venue
Publication date: 11/03/2022
Field of study

Cross-lingual transfer between a high-resource language and its dialects or closely related language varieties should be facilitated by their similarity. However, current approaches that operate in the embedding space do not take surface similarity into account. This work presents a simple yet effective strategy to imrove cross-lingual transfer between closely related varieties. We propose to augment the data of the high-resource source language with character-level noise to make the model more robust towards spelling variations. Our strategy shows consistent improvements over several languages and tasks: Zero-shot transfer of POS tagging and topic identification between language varieties from the Finnic, West and North Germanic, and Western Romance language branches. Our work provides evidence for the usefulness of simple surface-level noise in improving transfer between language varieties.Comment: ACL 202

arXiv.org e-Print Archive

Edinburgh Research Explorer

ZORA

Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias

Author: Sennrich Rico
Vamvas Jannis
Publication venue
Publication date: 07/11/2021
Field of study

Lexical disambiguation is a major challenge for machine translation systems, especially if some senses of a word are trained less often than others. Identifying patterns of overgeneralization requires evaluation methods that are both reliable and scalable. We propose contrastive conditioning as a reference-free black-box method for detecting disambiguation errors. Specifically, we score the quality of a translation by conditioning on variants of the source that provide contrastive disambiguation cues. After validating our method, we apply it in a case study to perform a targeted evaluation of sequence-level knowledge distillation. By probing word sense disambiguation and translation of gendered occupation names, we show that distillation-trained models tend to overgeneralize more than other models with a comparable BLEU score. Contrastive conditioning thus highlights a side effect of distillation that is not fully captured by standard evaluation metrics. Code and data to reproduce our findings are publicly available

Edinburgh Research Explorer

ZORA

How Suitable Are Subword Segmentation Strategies for Translating Non-Concatenative Morphology?

Author: Amrhein Chantal
Sennrich Rico
Publication venue
Publication date: 07/11/2021
Field of study

Edinburgh Research Explorer

ZORA

X-stance: A Multilingual Multi-Target Dataset for Stance Detection

Author: Sennrich Rico
Vamvas Jannis
Publication venue
Publication date: 10/06/2020
Field of study

We extract a large-scale stance detection dataset from comments written by candidates of elections in Switzerland. The dataset consists of German, French and Italian text, allowing for a cross-lingual evaluation of stance detection. It contains 67 000 comments on more than 150 political issues (targets). Unlike stance detection models that have specific target issues, we use the dataset to train a single model on all the issues. To make learning across targets possible, we prepend to each instance a natural question that represents the target (e.g. "Do you support X?"). Baseline results from multilingual BERT show that zero-shot cross-lingual and cross-target transfer of stance detection is moderately successful with this approach.Comment: SwissText + KONVENS 2020. Data and code are available at https://github.com/ZurichNLP/xstanc

arXiv.org e-Print Archive

Edinburgh Research Explorer

ZORA

Iterative, MT-based Sentence Alignment of Parallel Texts

Author: Sennrich Rico
Volk Martin
Publication venue
Publication date: 09/05/2011
Field of study

Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011. Editors: Bolette Sandford Pedersen, Gunta Nešpore and Inguna Skadiņa. NEALT Proceedings Series, Vol. 11 (2011), 175-182. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/16955

DSpace at Tartu University Library