On the Limitations of Sociodemographic Adaptation with Transformers

Glavaš, Goran; Hovy, Dirk; Hung, Chia-Chien; Lauscher, Anne; Ponzetto, Simone Paolo

On the Limitations of Sociodemographic Adaptation with Transformers

Authors: Goran Glavaš
Dirk Hovy
Chia-Chien Hung
Anne Lauscher
Simone Paolo Ponzetto
Publication date: 1 August 2022
Publisher

Abstract

Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for incorporating external knowledge into pretrained Transformers (e.g., domain-specific or geographic knowledge). We adapt the language representations for the sociodemographic dimensions of gender and age, using continuous language modeling and dynamic multi-task learning for adaptation, where we couple language modeling with the prediction of a sociodemographic class. Our results when employing a multilingual model show substantial performance gains across four languages (English, German, French, and Danish). These findings are in line with the results of previous work and hold promise for successful sociodemographic specialization. However, controlling for confounding factors like domain and language shows that, while sociodemographic adaptation does improve downstream performance, the gains do not always solely stem from sociodemographic knowledge. Our results indicate that sociodemographic specialization, while very important, is still an unresolved problem in NLP

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2208.01029

Last time updated on 06/10/2022