Detecting Hate Speech Online: A Case of Croatian

A Buyse; A Jakubowicz; I Gagliardone; Jelena Marković; M Monteleone; M Silberztein; MM Barrios

Detecting Hate Speech Online: A Case of Croatian

Authors: A Buyse
A Jakubowicz
I Gagliardone
Jelena Marković
M Monteleone
M Silberztein
MM Barrios
Publication date: 1 January 2020
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

This project proposes a NooJ algorithm with the task to find and categorize various slurs, insults and ultimately, hate speech in Croatian. The results also provide a more detailed insight into inappropriate language in Croatian. We strongly emphasize the ethical considerations of (mis)identifying hate speech and as a result, an unethical and undeserved censorship of inappropriate, but free speech. Thus, we tried to make a clear distinction between insults and hate speech. The test corpus consists of written online comments and remarks posted on five Croatian Facebook news pages during one week period. Given the differences between the standard Croatian grammar and syntax, and what is actually being used in informal on-line communication, the false negatives present the biggest difficulty since some variations (substandard usages of cases, spelling errors, colloquialisms) are impossible to predict, and therefore, extremely hard to implement into the algorithm

Similar works

Full text

Available Versions

Open Repository of the University of Zagreb Faculty of Humanities and Social Sciences

oai:repozitorij.ffzg.unizg.hr:...

Last time updated on 22/01/2020

Crossref

Last time updated on 10/08/2021

University of Zagreb Repository

oai:repozitorij.unizg.hr:ffzg_...

Last time updated on 20/02/2020