Turkish Native Language Identification

Schneider, Gerold; Uluslu, Ahmet Yavuz

Turkish Native Language Identification

Authors: Gerold Schneider
Ahmet Yavuz Uluslu
Publication date: 28 July 2023
Publisher

Abstract

In this paper, we present the first application of Native Language Identification (NLI) for the Turkish language. NLI involves predicting the writer's first language by analysing their writing in different languages. While most NLI research has focused on English, our study extends its scope to Turkish. We used the recently constructed Turkish Learner Corpus and employed a combination of three syntactic features (CFG production rules, part-of-speech n-grams, and function words) with L2 texts to demonstrate their effectiveness in this task

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2307.14850

Last time updated on 04/08/2023