Marco para parsing predictivo interactivo aplicado a la lengua castellana

Benedí Ruiz, José Miguel; Leiva Torres, Luis Alberto; Sánchez Peiró, Joan Andreu; Sánchez Sáez, Ricardo

research

Marco para parsing predictivo interactivo aplicado a la lengua castellana

Authors: José Miguel Benedí Ruiz
Luis Alberto Leiva Torres
Joan Andreu Sánchez Peiró
Ricardo Sánchez Sáez
Publication date: 1 January 2010
Publisher: Sociedad Española para el Procesamiento del Lenguaje Natural

Abstract

El marco teórico de Parsing Predictivo Interactivo (IPP) permite construir sistemas de anotación sintáctica interactivos. Los anotadores humanos pueden utilizar estos sistemas de ayuda para crear árboles sintácticos con muy poco esfuerzo (en comparación con el trabajo requerido para corregir manualmente árboles obtenidos a partir de un analizador sintáctico completamente automático). En este artículo se presenta la adaptación a la lengua castellana del marco IPP y su herramienta de anotación IPP-Ann, usando modelos obtenidos a partir del UAM Spanish Treebank. Hemos llevado a cabo experimentación simulando al usuario para obtener métricas de evaluación objetivas para nuestro sistema. Estos resultados muestran que el marco IPP aplicado al UAM Spanish Treebank se traduce en una importante cantidad de esfuerzo ahorrado, comparable con el obtenido al aplicar el marco IPP para analizar la lengua inglesa mediante el Penn Treebank.The Interactive Predictive Parsing (IPP) framework allows us the construction of interactive tree annotation systems. These can help human annotators in creating error-free parse trees with little effort (compared to manually post-editing the trees obtained from a completely automatic parser). In this paper we adapt the IPP framework and the IPP-Ann annotation tool for parse of the Spanish language, by using models obtained from the UAM Spanish Treebank. We performed user simulation experimentation and obtained objective evaluation metrics. The results establish that the IPP framework over the UAM Treebank shows important amounts of user effort reduction, comparable to the gains obtained when applying IPP to the English language on the Penn Treebank.Work supported by the EC (FEDER, FSE), the Spanish Government and Generalitat Valenciana (MICINN, ”Plan E”, under grants MIPRCV ”Consolider Ingenio 2010” CSD2007-00018, MIT-TRAL TIN2009-14633-C03-01, ALMPR Prometeo/2009/014 and FPU AP2006-01363)

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Repositorio Institucional de la Universidad de Alicante

oai:rua.ua.es:10045/14716

Last time updated on 13/09/2013

RUA

oai:rua.ua.es:10045/14716

Last time updated on 09/04/2020