Search CORE

95 research outputs found

Italian Event Detection Goes Deep Learning

Author: Caselli Tommaso
Publication venue
Publication date: 01/01/2018
Field of study

This paper reports on a set of experiments with different word embeddings to initialize a state-of-the-art Bi-LSTM-CRF network for event detection and classification in Italian, following the EVENTI evaluation exercise. The net- work obtains a new state-of-the-art result by improving the F1 score for detection of 1.3 points, and of 6.5 points for classification, by using a single step approach. The results also provide further evidence that embeddings have a major impact on the performance of such architectures.Comment: to appear at CLiC-it 201

arXiv.org e-Print Archive

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

OpenEdition

Dissertations of the University of Groningen

Systems’ Agreements and Disagreements in Temporal Processing:An Extensive Error Analysis of the TempEval-3 Task

Author: Caselli Tomasso
Morante Roser
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2018
Field of study

VU Research Portal

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Systems’ Agreements and Disagreements in Temporal Processing:An Extensive Error Analysis of the TempEval-3 Task

Author: Caselli Tomasso
Morante Roser
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2018
Field of study

Proceedings - University of Groningen

Normalisation of imprecise temporal expressions extracted from text

Author: Derczynski Leon
Didonet Del Fabro Marcos
Roberts Angus
Tissot Hegler
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Information extraction systems and techniques have been largely used to deal with the increasing amount of unstructured data available nowadays. Time is among the different kinds of information that may be extracted from such unstructured data sources, including text documents. However, the inability to correctly identify and extract temporal information from text makes it difficult to understand how the extracted events are organised in a chronological order. Furthermore, in many situations, the meaning of temporal expressions (timexes) is imprecise, such as in “less than 2 years” and “several weeks”, and cannot be accurately normalised, leading to interpretation errors. Although there are some approaches that enable representing imprecise timexes, they are not designed to be applied to specific scenarios and difficult to generalise. This paper presents a novel methodology to analyse and normalise imprecise temporal expressions by representing temporal imprecision in the form of membership functions, based on human interpretation of time in two different languages (Portuguese and English). Each resulting model is a generalisation of probability distributions in the form of trapezoidal and hexagonal fuzzy membership functions. We use an adapted F1-score to guide the choice of the best models for each kind of imprecise timex and a weighted F1-score (F1 3 D ) as a complementary metric in order to identify relevant differences when comparing two normalisation models. We apply the proposed methodology for three distinct classes of imprecise timexes, and the resulting models give distinct insights in the way each kind of temporal expression is interpreted

UCL Discovery

The IT University of Copenhagen's Repository

King's Research Portal