3 research outputs found
Findings of the 2017 DiscoMT Shared Task on Cross-lingual Pronoun Prediction
We describe the design, the setup, and the
evaluation results of the DiscoMT 2017
shared task on cross-lingual pronoun prediction.
The task asked participants to
predict a target-language pronoun given a
source-language pronoun in the context of
a sentence. We further provided a lemmatized
target-language human-authored
translation of the source sentence, and
automatic word alignments between the
source sentence words and the targetlanguage
lemmata. The aim of the task
was to predict, for each target-language
pronoun placeholder, the word that should
replace it from a small, closed set of
classes, using any type of information that
can be extracted from the entire document.
We offered four subtasks, each for a
different language pair and translation
direction: English-to-French, Englishto-German,
German-to-English, and
Spanish-to-English. Five teams participated
in the shared task, making
submissions for all language pairs. The
evaluation results show that all participating
teams outperformed two strong
n-gram-based language model-based
baseline systems by a sizable margin