Search CORE

30 research outputs found

Dependency parsing with an extended finite-state approach

Author: Kaplan Ronald M
Kemal Oflazer
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2003
Field of study

This article presents a dependency parsing scheme using an extended finite-state approach. The parser augments input representation with "channels" so that links representing syntactic dependency relations among words can be accommodated and iterates on the input a number of times to arrive at a fixed point. Intermediate configurations violating various constraints of projective dependency representations such as no crossing links and no independent items except sentential head are filtered via finite-state filters. We have applied the parser to dependency parsing of Turkish

CiteSeerX

Crossref

Sabanci University Research Database

An infrastructure for Turkish prosody generation in text-to-speech synthesis

Author: Kulekci M. Oguzhan
Külekçi M. Oğuzhan
Oflazer Kemal
Publication venue
Publication date: 01/06/2006
Field of study

Text-to-speech engines benefit from natural language processing while generating the appropriate prosody. In this study, we investigate the natural language processing infrastructure for Turkish prosody generation in three steps as pronunciation disambiguation, phonological phrase detection and intonation level assignment. We focus on phrase boundary detection and intonation assignment. We propose a phonological phrase detection scheme based on syntactic analysis for Turkish and assign one of three intonation levels to words in detected phrases. Empirical observations on 100 sentences show that the proposed scheme works with approximately 85% accuracy

Sabanci University Research Database

Universal dependencies for Turkish

Author: Eryiğit Gülşen
Gökırmak Memduh
Nivre Joakim
Sulubacak Umut
Tyers Francis
Çöltekin Çağrı
Publication venue: The Association for Computational Linguistics
Publication date: 01/12/2016
Field of study

The Universal Dependencies (UD) project was conceived after the substantial recent interest in unifying annotation schemes across languages. With its own annotation principles and abstract inventory for parts of speech, morphosyntactic features and dependency relations, UD aims to facilitate multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. This paper presents the Turkish IMST-UD Treebank, the first Turkish treebank to be in a UD release. The IMST-UD Treebank was automatically converted from the IMST Treebank, which was also recently released. We describe this conversion procedure in detail, complete with mapping tables. We also present our evaluation of the parsing performances of both versions of the IMST Treebank. Our findings suggest that the UD framework is at least as viable for Turkish as the original annotation framework of the IMST Treebank.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

The incremental use of morphological information and lexicalization in data-driven dependency parsing

Author: Eryigit Gulsen
Eryiğit Gülşen
Nivre Joakim
Oflazer Kemal
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/12/2006
Field of study

Typological diversity among the natural languages of the world poses interesting challenges for the models and algorithms used in syntactic parsing. In this paper, we apply a data-driven dependency parser to Turkish, a language characterized by rich morphology and flexible constituent order, and study the effect of employing varying amounts of morpholexical information on parsing performance. The investigations show that accuracy can be improved by using representations based on inflectional groups rather than word forms, confirming earlier studies. In addition, lexicalization and the use of rich morphological features are found to have a positive effect. By combining all these techniques, we obtain the highest reported accuracy for parsing the Turkish Treebank

Sabanci University Research Database

Statistical dependency parsing of Turkish

Author: Eryigit Gulsen
Eryiğit Gülşen
Oflazer Kemal
Publication venue
Publication date: 01/04/2006
Field of study

This paper presents results from the first statistical dependency parser for Turkish. Turkish is a free-constituent order language with complex agglutinative inflectional and derivational morphology and presents interesting challenges for statistical parsing, as in general, dependency relations are between “portions” of words called inflectional groups. We have explored statistical models that use different representational units for parsing. We have used the Turkish Dependency Treebank to train and test our parser but have limited this initial exploration to that subset of the treebank sentences with only left-to-right non-crossing dependency links. Our results indicate that the best accuracy in terms of the dependency relations between inflectional groups is obtained when we use inflectional groups as units in parsing, and when contexts around the dependent are employed

Sabanci University Research Database

Morphology-Syntax interface for Turkish LFG

Author: Cetinoglu Ozlem
Oflazer Kemal
Çetinoğlu Özlem
Publication venue
Publication date: 01/01/2006
Field of study

This paper investigates the use of sublexical units as a solution to handling the complex morphology with productive derivational processes, in the development of a lexical functional grammar for Turkish. Such sublexical units make it possible to expose the internal structure of words with multiple derivations to the grammar rules in a uniform manner. This in turn leads to more succinct and manageable rules. Further, the semantics of the derivations can also be systematically reflected in a compositional way by constructing PRED values on the fly. We illustrate how we use sublexical units for handling simple productive derivational morphology and more interesting cases such as causativization, etc., which change verb valency. Our priority is to handle several linguistic phenomena in order to observe the effects of our approach on both the c-structure and the f-structure representation, and grammar writing, leaving the coverage and evaluation issues aside for the moment

Crossref

Sabanci University Research Database

Türkçe cümlelerin kural tabanlı bağlılık analizi

Author: Adalı Eşref
Eryiğit Gülşen
Oflazer Kemal
Publication venue
Publication date: 01/06/2006
Field of study

Bu makalede, Türkçe cümlelerin kural tabanlı bağlılık analizi yöntemi ile ayrıştırılmaları sonucunda elde edilen başarım sunulmaktadır. Çalışma, test verisi olarak kullanılan ODTÜ-Sabancı Ağaç Yapılı Derlemi'nin bütünü üzerindeki ilk kural tabanlı sonuçları içermektedir. Uygulanan ayrıştırma algoritması ve kural yapıları detaylı olarak verilmiştir. Sonuçlar Türkçe'nin Bağlılık Analizi konusunda yapılacak çalışmalara temel olma niteliğindedir

Sabanci University Research Database

Egy pszicholingvisztikai indíttatású számítógépes nyelvfeldolgozási modell felé

Author: Indig Balázs
Miháltz Márton
Prószéky Gábor
Sass Bálint
Publication venue: Szegedi Tudományegyetem Informatikai Tanszékcsoport
Publication date: 01/01/2014
Field of study

Repository of the Academy's Library

Towards a Psycholinguistically Motivated Performance-Based Parsing Model

Author: Indig Balázs
Publication venue: Pázmány Egyetem, ePress
Publication date: 01/01/2014
Field of study

Repository of the Academy's Library