84 research outputs found
An Integrated Framework for Treebanks and Multilayer Annotations
Treebank formats and associated software tools are proliferating rapidly,
with little consideration for interoperability. We survey a wide variety of
treebank structures and operations, and show how they can be mapped onto the
annotation graph model, and leading to an integrated framework encompassing
tree and non-tree annotations alike. This development opens up new
possibilities for managing and exploiting multilayer annotations.Comment: 8 page
Evalita\u2709 Parsing Task: comparing dependency parsers and treebanks
The aim of Evalita Parsing Task is at defining and extending Italian state of the art parsing by encouraging the application of existing models and approaches. As in the Evalita\u2707, the Task is organized around two tracks, i.e. Dependency Parsing and Constituency Parsing. As a main novelty with respect to the previous edition, the Dependency Parsing track has been articulated into two subtasks, differing at the level of the used treebanks, thus creating the prerequisites for assessing the impact of different annotation schemes on the parsers performance. In this paper, we describe the Dependency Parsing track by presenting the data sets for development and testing, reporting the test results and providing a first comparative analysis of these results, also with respect to state of the art parsing technologies
Developing a multilayer semantic annotation scheme based on ISO standards for the visualization of a newswire corpus
In this paper, we describe the process of
developing a multilayer semantic
annotation scheme designed for extracting
information from a European Portuguese
corpus of news articles, at three levels,
temporal, referential and semantic role
labelling. The novelty of this scheme is the
harmonization of parts 1, 4 and 9 of the ISO
24617 Language resource management -
Semantic annotation framework. This
annotation framework includes a set of
entity structures (participants, events,
times) and a set of links (temporal,
aspectual, subordination, objectal and
semantic roles) with several tags and
attribute values that ensure adequate
semantic and visual representations of news
stories
When Collaborative Treebank Curation Meets Graph Grammars: Arborator With a Grew Back-End
International audienceIn this paper we present Arborator-Grew, a collaborative annotation tool for treebank development. Arborator-Grew combines the features of two preexisting tools: Arborator and Grew. Arborator is a widely used collaborative graphical online dependency treebank annotation tool. Grew is a tool for graph querying and rewriting specialized in structures needed in NLP, i.e. syntactic and semantic dependency trees and graphs. Grew also has an online version, Grew-match, where all Universal Dependencies treebanks in their classical, deep and surface-syntactic flavors can be queried. Arborator-Grew is a complete redevelopment and modernization of Arborator, replacing its own internal database storage by a new Grew API, which adds a powerful query tool to Arborator's existing treebank creation and correction features. This includes complex access control for parallel expert and crowd-sourced annotation, tree comparison visualization, and various exercise modes for teaching and training of annotators. Arborator-Grew opens up new paths of collectively creating, updating, maintaining, and curating syntactic treebanks and semantic graph banks
- …