Machine learning approaches for temporal information extraction: a comparative study

Kolomiyets, Oleksandr; Moens, Marie-Francine

Machine learning approaches for temporal information extraction: a comparative study

Authors: Oleksandr Kolomiyets
Marie-Francine Moens
Publication date: 1 January 2009
Publisher: 'Indiana University Press (Project Muse)'

Abstract

Temporal expressions are important structures in natural language. In order to understand text, temporal expressions have to be extracted and normalized to ISO-based values. For these purposes rule-based and machine learning techniques were proposed. In this paper we present and compare two approaches for automatic recognition of temporal expressions in free text, based on a supervised machine learning approach and trained on an annotated corpus for temporal information, namely TimeBank. The first approach performs a token-by-token classification following B-I-O encoding. The second one does a binary constituent-based classification of chunk phrases. Our experiments demonstrate that on the TimeBank corpus the constituent-based classification performs better than the token-based one. It achieves F1-measure values of 0.852 for the detection task and 0.828 when an exact match is required, which is better than the state-of-the-art results for temporal expression detection on TimeBank.status: publishe

Similar works

Full text

Available Versions

Lirias

oai:lirias2repo.kuleuven.be:12...

Last time updated on 10/12/2019