Automatic extraction of subcategorization frames for Italian

Bosco, Cristina; Ienco, Dino; Villata, Serena

Automatic extraction of subcategorization frames for Italian

Authors: Cristina Bosco
Dino Ienco
Serena Villata
Publication date: 1 January 2008
Publisher: European Language Resources Association (ELRA)

Abstract

Subcategorization is a kind of knowledge which can be considered as crucial in several NLP tasks, such as Information Extraction or parsing, but the collection of very large resources including subcategorization representation is difficult and time-consuming. Various experiences show that the automatic extraction can be a practical and reliable solution for acquiring such a kind of knowledge. The aim of this paper is at investigating the relationships between subcategorization frame extraction and the nature of data from which the frames have to be extracted, e.g. how much the task can be influenced by the richness/poorness of the annotation. Therefore, we present some experiments that apply statistical subcategorization extraction methods, known in literature, on an Italian treebank that exploits a rich set of dependency relations that can be annotated at different degrees of specificity. Benefiting of the availability of relation sets that implement different granularity in the representation of relations, we evaluate our results with reference to previous works in a cross-linguistic perspective. 1

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.676.4...

Last time updated on 29/10/2017

Institutional Research Information System University of Turin

oai:iris.unito.it:2318/26669

Last time updated on 18/04/2020