Search CORE

1 research outputs found

Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach

Author: Choukri Khalid
Fluhr Christian
Saadane Houda
Seffih Hosni
Semmar Nasredine
Publication venue: HAL CCSD
Publication date: 07/05/2018
Field of study

International audienceAutomatic identification of Arabic dialects in a text is a difficult task, especially for Maghreb languages and when they are written in Arabic or Latin characters (Arabizi). These texts are characterized by the use of code-switching between the Modern Standard Arabic (MSA) and the Arabic Dialect (AD) in the texts written in Arabic, or between Arabizi and foreign languages for those written in Latin. This paper presents the specific resources and tools we have developed for this purpose, with a focus on the transliteration of Arabizi into Arabic (using the dedicated tools for Arabic dialects). A dictionary-based approach to detect the dialectal origin of a text is described, it exhibits satisfactory results

Hal - Université Grenoble Alpes

HAL-CEA