Search CORE

2 research outputs found

semeval 2015 task 15 a cpa dictionary entry building task

Author: Adam Kilgarriff
Ismail El Maarouf
Jane Bradbury
Octavian Popescu
Silvie Cinkova
Vít Baisa
Publication venue
Publication date: 01/01/2015
Field of study

This paper describes the first SemEval task to explore the use of Natural Language Processing systems for building dictionary entries, in the framework of Corpus Pattern Analysis. CPA is a corpus-driven technique which provides tools and resources to identify and represent unambiguously the main semantic patterns in which words are used. Task 15 draws on the Pattern Dictionary of English Verbs (www.pdev.org.uk), for the targeted lexical entries, and on the British National Corpus for the input text. Dictionary entry building is split into three subtasks which all start from the same concordance sample: 1) CPA parsing, where arguments and their syntactic and semantic categories have to be identified, 2) CPA clustering, in which sentences with similar patterns have to be clustered and 3) CPA automatic lexicography where the structure of patterns have to be constructed automatically. Subtask 1 attracted 3 teams, though none could beat the baseline (rule-based system). Subtask 2 attracted 2 teams, one of which beat the baseline (majority-class classifier). Subtask 3 did not attract any participant. The task has produced a major semantic multidataset resource which includes data for 121 verbs and about 17,000 annotated sentences, and which is freely accessible

Crossref

Open Access Repository

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

Author: Ali Elkahky
Aljoscha Burchardt
Anna Missilä
Anna Nedoluzhko
Antonio Stella
Atsuko Shimada
Daniel Zeman
Dima Taji
Elena Badmaeva
Emily Pitler
Esha Banerjee
Filip Ginter
Francis Tyers
Georg Rehm
Gustavo Mendonca
Hans Uszkoreit
Hector Fernandez Alcalde
Herman Leung
Hiroshi Kanayama
Héctor Martínez Alonso
Jan Hajic Jr.
Jan Hajic
Jana Strnadová
Jaroslava Hlavacova
Jenna Kanerva
Jesse Kirchner
Joakim Nivre
Juhani Luotolahti
Katrin Marheinecke
Kim Harris
Kira Droganova
Manning Christopher D.
Manuela Sanguinetti
Maria Simi
Marie-Catherine de Marneffe
Martin Popel
Martin Potthast
Memduh Gokirmak
Michael Mandl
Milan Straka
Mohammed Attia
Nizar Habash
Osie Li
Rattima Nitisaroj
Ruli Manurung
Sampo Pyysalo
Saran Lertpradit
Sebastian Schuster
Silvie Cinkova
Siva Reddy
Slav Petrov
Sookyoung Kwak
Stina Ojala
Tatiana Lando
Tolga Kayadelen
Umut Sulubacak
Valeria Depaiva
Vivien Macketanz
Václava Kettnerová
Zdenka Uresova
Zhuoran Yu
Çağrı Çöltekin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

The Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets. In 2017, one of two tasks was devoted to learning dependency parsers for a large number of languages, in a real-world setting without any gold-standard annotation on input. All test sets followed a unified annotation scheme, namely that of Universal Dependencies. In this paper, we define the task and evaluation methodology, describe data preparation, report and analyze the main results, and provide a brief categorization of the different approaches of the participating syste

Archivio della Ricerca - Università di Pisa