Skip to main content
Article thumbnail
Location of Repository

Creating and maintaining language resources: the main guidelines of the Victoria project

By Lionel Nicolas, Miguel Molinero, Benoît Sagot, Nieves Fernández Formoso and Vanesa Vidal Castro

Abstract

International audienceMany Natural Language Processing (NLP) tools rely on the availability of reliable language resources (LRs). Moreover, even when such LRs are available for a given language, their quality or coverage sometimes prevent them from being used in complex NLP systems. Considering the attention received from both the academic and industrial worlds and the significant efforts achieved during the past decades for LR development, such a lack of high quality and wide-coverage LR shows how difficult their creation and maintainance can be. In this paper, we describe a set of guidelines applied within the Victoria project in order to ease the creation and correction of the LRs required for symbolic parsing. These generic guidelines should be easy to adapt and use for the production of other types of LRs

Topics: [ INFO.INFO-CL ] Computer Science [cs]/Computation and Language [cs.CL]
Publisher: HAL CCSD
Year: 2010
OAI identifier: oai:HAL:inria-00521241v1
Provided by: Hal-Diderot

Suggested articles


To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.