Comparative evaluation of Arabic language morphological analysers and stemmers

Sawalha, M.; Atwell, E.S.

unknown

oai:eprints.whiterose.ac.uk:42635

Comparative evaluation of Arabic language morphological analysers and stemmers

Authors: M. Sawalha
E.S. Atwell
Publication date: 1 January 2008
Publisher: Coling 2008 Organizing Committee

Abstract

Arabic morphological analysers and stemming algorithms have become a popular area of research. Many computational linguists have designed and developed algorithms to solve the problem of morphology and stemming. Each researcher proposed his own gold standard, testing methodology and accuracy measurements to test and compute the accuracy of his algorithm. Therefore, we cannot make comparisons between these algorithms. In this paper we have accomplished two tasks. First, we proposed four different fair and precise accuracy measurements and two 1000-word gold standards taken from the Holy Qur’an and from the Corpus of Contemporary Arabic. Second, we combined the results from the morphological analysers and stemming algorithms by voting after running them on the sample documents. The evaluation of the algorithms shows that Arabic morphology is still a challenge

Similar works

Full text

White Rose Research Online

oai:eprints.whiterose.ac.uk:42...

Last time updated on 01/12/2017

This paper was published in White Rose Research Online.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.