Search CORE

3 research outputs found

Promoting multiword expressions in A* TAG parsing

Author: Parmentier Yannick
Savary Agata
Waszczuk Jakub
Publication venue: HAL CCSD
Publication date: 13/12/2016
Field of study

International audienceMultiword expressions (MWEs) are pervasive in natural languages and often have both idiomatic and compositional readings, which leads to high syntactic ambiguity. We show that for some MWE types idiomatic readings are usually the correct ones. We propose a heuristic for an A* parser for Tree Adjoining Grammars which benefits from this knowledge by promoting MWE-oriented analyses. This strategy leads to a substantial reduction in the parsing search space in case of true positive MWE occurrences, while avoiding parsing failures in case of false positives

HAL Université de Tours

The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions

Author: Candito Marie
Cap Fabienne
Cordeiro Silvio,
Doucet Antoine
Giouli Voula
Qasemizadeh Behrang
Ramisch Carlos
Sangati Federico
Savary Agata
Stoyanova Ivelina
Vincze Veronika
Publication venue
Publication date: 01/01/2017
Field of study

International audienceMultiword expressions (MWEs) are known as a "pain in the neck" for NLP due to their idiosyncratic behaviour. While some categories of MWEs have been addressed by many studies, verbal MWEs (VMWEs), such as to take a decision, to break one's heart or to turn off, have been rarely modelled. This is notably due to their syntactic variability, which hinders treating them as " words with spaces ". We describe an initiative meant to bring about substantial progress in understanding, modelling and processing VMWEs. It is a joint effort, carried out within a European research network, to elaborate universal terminologies and annotation guidelines for 18 languages. Its main outcome is a multilingual 5-million-word annotated corpus which underlies a shared task on automatic identification of VMWEs. This paper presents the corpus annotation methodology and outcome, the shared task organisation and the results of the participating systems

Crossref

HAL AMU

INRIA a CCSD electronic archive server

Publikationer från Uppsala Universitet

Università degli Studi di Napoli L'Orientale: CINECA IRIS

HAL Université de Tours

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Hal-Diderot

Edition 1.1 of the PARSEME Shared Task on automatic identification of verbal multiword expressions

Università degli Studi di Napoli L'Orientale: CINECA IRIS