research

Design and Implementing Of Multilingual Hadith Corpus

Abstract

In this paper, we want to establish the first design of Multilingual Hadith Corpus. The Hadith original language is Arabic and we decide to select English, French and Russian as extra languages for Hadith translation. Design the Hadith corpus will be in four steps, the first step is data collection, which will be from the internet because it is considered as the biggest corpora, second step cleaning the data, step three file generation and the last step is file annotation using XML

    Similar works