Findings of the Shared Task on Multilingual Coreference Resolution

Konopík, Miloslav; Nedoluzhko, Anna; Novák, Michal; Ogrodniczuk, Maciej; Popel, Martin; Pražák, Ondřej; Sido, Jakub; Zeman, Daniel; Zhu, Yilun; Žabokrtský, Zdeněk

Findings of the Shared Task on Multilingual Coreference Resolution

Authors: Miloslav Konopík
Anna Nedoluzhko
Michal Novák
Maciej Ogrodniczuk
Martin Popel
Ondřej Pražák
Jakub Sido
Daniel Zeman
Yilun Zhu
Zdeněk Žabokrtský
Publication date: 16 September 2022
Publisher

Abstract

This paper presents an overview of the shared task on multilingual coreference resolution associated with the CRAC 2022 workshop. Shared task participants were supposed to develop trainable systems capable of identifying mentions and clustering them according to identity coreference. The public edition of CorefUD 1.0, which contains 13 datasets for 10 languages, was used as the source of training and evaluation data. The CoNLL score used in previous coreference-oriented shared tasks was used as the main evaluation metric. There were 8 coreference prediction systems submitted by 5 participating teams; in addition, there was a competitive Transformer-based baseline system provided by the organizers at the beginning of the shared task. The winner system outperformed the baseline by 12 percentage points (in terms of the CoNLL scores averaged across all datasets for individual languages)

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2209.07841

Last time updated on 14/11/2022