LT@Helsinki at SemEval-2020 Task 12 : Multilingual or language-specific BERT?

Kajava, Kaisla; Pàmies, Marc; Tiedemann, Jörg; Öhman, Emily

LT@Helsinki at SemEval-2020 Task 12 : Multilingual or language-specific BERT?

Authors: Kaisla Kajava
Marc Pàmies
Jörg Tiedemann
Emily Öhman
Publication date: 1 January 2020
Publisher: International Committee for Computational Linguistics

Abstract

This paper presents the different models submitted by the LT@Helsinki team for the SemEval2020 Shared Task 12. Our team participated in sub-tasks A and C; titled offensive language identification and offense target identification, respectively. In both cases we used the so called Bidirectional Encoder Representation from Transformer (BERT), a model pre-trained by Google and fine-tuned by us on the OLID dataset. The results show that offensive tweet classification is one of several language-based tasks where BERT can achieve state-of-the-art results.Peer reviewe

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Helsingin yliopiston digitaalinen arkisto

oai:helda.helsinki.fi:10138/34...

Last time updated on 28/02/2022