Using Mechanical Turk to Create a Corpus of Arabic Summaries

El-Haj, M; Kruschwitz, U; Fox, C

research

oai:repository.essex.ac.uk:4064

Using Mechanical Turk to Create a Corpus of Arabic Summaries

Authors: M El-Haj
U Kruschwitz
C Fox
Publication date: 1 January 2010
Publisher: European Language Resources Association

Abstract

This paper describes the creation of a human-generated corpus of extractive Arabic summaries of a selection of Wikipedia and Arabic newspaper articles using Mechanical Turk?an online workforce. The purpose of this exercise was two-fold. First, it addresses a shortage of relevant data for Arabic natural language processing. Second, it demonstrates the application of Mechanical Turk to the problem of creating natural language resources. The paper also reports on a number of evaluations we have performed to compare the collected summaries against results obtained from a variety of automatic summarisation systems

Similar works

Full text

Open in the Core reader

Download PDF

University of Essex Research Repository

oai:repository.essex.ac.uk:406...

Last time updated on 14/09/2013

This paper was published in University of Essex Research Repository.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.