Multilingual Cross-domain Perspectives on Online Hate Speech

Daelemans, Walter; De Pauw, Guy; De Smedt, Tom; Gwóźdź, Maja; Jaki, Sylvia; Kotzé, Eduan; Saoud, Leïla

research

Multilingual Cross-domain Perspectives on Online Hate Speech

Authors: Walter Daelemans
Guy De Pauw
Tom De Smedt
Maja Gwóźdź
Sylvia Jaki
Eduan Kotzé
Leïla Saoud
Publication date: 1 January 2018
Publisher

Abstract

In this report, we present a study of eight corpora of online hate speech, by demonstrating the NLP techniques that we used to collect and analyze the jihadist, extremist, racist, and sexist content. Analysis of the multilingual corpora shows that the different contexts share certain characteristics in their hateful rhetoric. To expose the main features, we have focused on text classification, text profiling, keyword and collocation extraction, along with manual annotation and qualitative study.Comment: 24 page

Similar works

Full text

Available Versions

Institutional Repository Universiteit Antwerpen

c:irua:156589

Last time updated on 09/08/2019