Dependency parsing of Turkish

Eryigit, Gulsen; Eryiğit, Gülşen; Nivre, Joakim; Oflazer, Kemal

research

Dependency parsing of Turkish

Authors: Gulsen Eryigit
Gülşen Eryiğit
Joakim Nivre
Kemal Oflazer
Publication date: 1 September 2006
Publisher: 'MIT Press - Journals'
Doi

Abstract

The suitability of different parsing methods for different languages is an important topic in syntactic parsing. Especially lesser-studied languages, typologically different from the languages for which methods have originally been developed, poses interesting challenges in this respect. This article presents an investigation of data-driven dependency parsing of Turkish, an agglutinative free constituent order language that can be seen as the representative of a wider class of languages of similar type. Our investigations show that morphological structure plays an essential role in finding syntactic relations in such a language. In particular, we show that employing sublexical representations called inflectional groups, rather than word forms, as the basic parsing units improves parsing accuracy. We compare two different parsing methods, one based on a probabilistic model with beam search, the other based on discriminative classifiers and a deterministic parsing strategy, and show that the usefulness of sublexical units holds regardless of parsing method.We examine the impact of morphological and lexical information in detail and show that, properly used, this kind of information can improve parsing accuracy substantially. Applying the techniques presented in this article, we achieve the highest reported accuracy for parsing the Turkish Treebank

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

Last time updated on 03/01/2020

Sabanci University Research Database

oai:research.sabanciuniv.edu:1...

Last time updated on 12/07/2013

Sabanci University Research Database

oai:research.sabanciuniv.edu:6...

Last time updated on 12/07/2013