A Corpus Based Morphological Disambiguation Tool For Basque
- Publication date
- Publisher
Abstract
This paper presents the methodology followed in the construction of a surfacebased morphosyntactic parsing grammar as well as the results obtained. It is based on the Constraint Grammar formalism which we find suitable for our project of analysing unrestricted texts. Besides, we will present a description of the main types of morphosyntactic ambiguity that we have identified for Basque and the disambiguation rules designed for their treatment. This work is the first step in the computational treatment of syntax. Keywords: basque language, disambiguation, Constraint Grammar. 1. Introduction This paper describes the design of morphological disambiguation rules as a first step to develop a robust grammar of Basque, conceived as a general basis for different applications, such as a lemmatiser/tagger (Aduriz et al., 96) and a syntactic corrector (Gojenola and Sarasola, 94). We have chosen the Constraint Grammar (CG) formalism (Karlsson et al., 95; Voutilainen, 94; Tapanainen and Voutilain..