Exploring Conditional Language Model Based Data Augmentation Approaches For Hate Speech Classification

Fohr, Dominique; Geet d'Sa, Ashwin; Illina, Irina; Klakow, Dietrich; Ruiter, Dana

Exploring Conditional Language Model Based Data Augmentation Approaches For Hate Speech Classification

Authors: Dominique Fohr
Ashwin Geet d'Sa
Irina Illina
Dietrich Klakow
Dana Ruiter
Publication date: 6 September 2021
Publisher: HAL CCSD

Abstract

International audienceDeep Neural Network (DNN) based classifiers have gained increased attention in hate speech classification. However, the performance of DNN classifiers increases with quantity of available training data and in reality, hate speech datasets consist of only a small amount of labeled data. To counter this, Data Augmentation (DA) techniques are often used to increase the number of labeled samples and therefore, improve the classifier's performance. In this article, we explore augmentation of training samples using a conditional language model. Our approach uses a single class conditioned Generative Pre-Trained Transformer-2 (GPT-2) language model for DA, avoiding the need for multiple class specific GPT-2 models. We study the effect of increasing the quantity of the augmented data and show that adding a few hundred samples significantly improves the classifier's performance. Furthermore, we evaluate the effect of filtering the generated data used for DA. Our approach demonstrates up to 7.3% and up to 25.0% of relative improvements in macro-averaged F1 on two widely used hate speech corpora

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

INRIA a CCSD electronic archive server

oai:HAL:hal-03244472v1

Last time updated on 30/12/2021