Search CORE

21 research outputs found

Good, but not always Fair: An Evaluation of Gender Bias for three Commercial Machine Translation Systems

Author: Bentivogli Luisa
Piazzolla Silvia Alma
Savoldi Beatrice
Publication venue: Aarhus University, Faculty of Arts, School of Communication and Culture
Publication date: 31/12/2023
Field of study

Machine Translation (MT) continues to make significant strides in quality and is increasingly adopted on a larger scale. Consequently, analyses have been redirected to more nuanced aspects, intricate phenomena, as well as potential risks that may arise from the widespread use of MT tools. Along this line, this paper offers a meticulous assessment of three commercial MT systems - Google Translate, DeepL, and Modern MT - with a specific focus on gender translation and bias. For three language pairs (English-Spanish, English-Italian, and English-French), we scrutinize the behavior of such systems at several levels of granularity and on a variety of naturally occurring gender phenomena in translation. Our study takes stock of the current state of online MT tools, by revealing significant discrepancies in the gender translation of the three systems, with each system displaying varying degrees of bias despite their overall translation quality

Tidsskrift.dk (Det Kongelige Bibliotek)

Good, but not always Fair: An Evaluation of Gender Bias for three commercial Machine Translation Systems

Author: Bentivogli Luisa
Piazzolla Silvia Alma
Savoldi Beatrice
Publication venue
Publication date: 09/06/2023
Field of study

Machine Translation (MT) continues to make significant strides in quality and is increasingly adopted on a larger scale. Consequently, analyses have been redirected to more nuanced aspects, intricate phenomena, as well as potential risks that may arise from the widespread use of MT tools. Along this line, this paper offers a meticulous assessment of three commercial MT systems - Google Translate, DeepL, and Modern MT - with a specific focus on gender translation and bias. For three language pairs (English/Spanish, English/Italian, and English/French), we scrutinize the behavior of such systems at several levels of granularity and on a variety of naturally occurring gender phenomena in translation. Our study takes stock of the current state of online MT tools, by revealing significant discrepancies in the gender translation of the three systems, with each system displaying varying degrees of bias despite their overall translation quality.Comment: Under review at HERMES Journa

arXiv.org e-Print Archive

Test Suites Task: Evaluation of Gender Fairness in MT with MuST-SHE and INES

Author: Bentivogli Luisa
Gaido Marco
Negri Matteo
Savoldi Beatrice
Publication venue
Publication date: 01/01/2023
Field of study

As part of the WMT-2023 “Test suites” shared task, in this paper we summarize the results of two test suites evaluations: MuST-SHEWMT23 and INES. By focusing on the en-de and de-en language pairs, we rely on these newly created test suites to investigate systems’ ability to translate feminine and masculine gender and produce gender-inclusive translations. Furthermore we discuss metrics associated with our test suites and validate them by means of human evaluations. Our results indicate that systems achieve reasonable and comparable performance in correctly translating both feminine and masculine gender forms for naturalistic gender phenomena. Instead, the generation of inclusive language forms in translation emerges as a challenging task for all the evaluated MT models, indicating room for future improvements and research on the topic. We make MuST-SHEWMT23 and INES freely available

Archivio della ricerca - Fondazione Bruno Kessler

Test Suites Task: Evaluation of Gender Fairness in MT with MuST-SHE and INES

Author: Bentivogli Luisa
Gaido Marco
Negri Matteo
Savoldi Beatrice
Publication venue
Publication date: 30/10/2023
Field of study

As part of the WMT-2023 "Test suites" shared task, in this paper we summarize the results of two test suites evaluations: MuST-SHE-WMT23 and INES. By focusing on the en-de and de-en language pairs, we rely on these newly created test suites to investigate systems' ability to translate feminine and masculine gender and produce gender-inclusive translations. Furthermore we discuss metrics associated with our test suites and validate them by means of human evaluations. Our results indicate that systems achieve reasonable and comparable performance in correctly translating both feminine and masculine gender forms for naturalistic gender phenomena. Instead, the generation of inclusive language forms in translation emerges as a challenging task for all the evaluated MT models, indicating room for future improvements and research on the topic.Comment: Accepted at WMT 202

arXiv.org e-Print Archive

On the Dynamics of Gender Learning in Speech Translation

Author: Beatrice Savoldi
Luisa Bentivogli
Marco Gaido
Marco Turchi
Matteo Negri
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2022
Field of study

Due to the complexity of bias and the opaque nature of current neural approaches, there is a rising interest in auditing language technologies. In this work, we contribute to such a line of inquiry by exploring the emergence of gender bias in Speech Translation (ST). As a new perspective, rather than focusing on the final systems only, we examine their evolution over the course of training. In this way, we are able to account for different variables related to the learning dynamics of gender translation, and investigate when and how gender divides emerge in ST. Accordingly, for three language pairs (en ? es, fr, it) we compare how ST systems behave for masculine and feminine translation at several levels of granularity. We find that masculine and feminine curves are dissimilar, with the feminine one being characterized by more erratic behaviour and late improvements over the course of training. Also, depending on the considered phenomena, their learning trends can be either antiphase or parallel. Overall, we show how such a progressive analysis can inform on the reliability and time-wise acquisition of gender, which is concealed by static evaluations and standard metrics

Archivio della ricerca - Fondazione Bruno Kessler

Gender Neutralization for an Inclusive Machine Translation: from Theoretical Foundations to Open Challenges

Author: Andrea Piergentili
Beatrice Savoldi
Dennis Fucci
Luisa Bentivogli
Matteo Negri
Publication venue: European Association for Machine Translation
Publication date: 01/01/2023
Field of study

Gender inclusivity in language technologies has become a prominent research topic. In this study, we explore gender-neutral translation (GNT) as a form of gender inclusivity and a goal to be achieved by machine translation (MT) models, which have been found to perpetuate gender bias and discrimination. Specifically, we focus on translation from English into Italian, a language pair representative of salient gender-related linguistic transfer problems. To define GNT, we review a selection of relevant institutional guidelines for gender-inclusive language, discuss its scenarios of use, and examine the technical challenges of performing GNT in MT, concluding with a discussion of potential solutions to encourage advancements toward greater inclusivity in MT

Archivio della ricerca - Fondazione Bruno Kessler

Good, but not always Fair: An Evaluation of Gender Bias for three Commercial Machine Translation Systems

Author: Beatrice Savoldi
Luisa Bentivogli
Silvia Alma Piazzolla
Publication venue: Aarhus University
Publication date: 01/01/2024
Field of study

Directory of Open Access Journals

Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

Author: Bentivogli Luisa
Cattoni Roldano
Di Gangi Mattia Antonino
Negri Matteo
Savoldi Beatrice
Turchi Marco
Publication venue
Publication date: 10/06/2020
Field of study

Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines. This difficulty is also due to the fact that the training data on which models are built typically reflect the asymmetries of natural languages, gender bias included. Exclusively fed with textual data, machine translation is intrinsically constrained by the fact that the input sentence does not always contain clues about the gender identity of the referred human entities. But what happens with speech translation, where the input is an audio signal? Can audio provide additional information to reduce gender bias? We present the first thorough investigation of gender bias in speech translation, contributing with: i) the release of a benchmark useful for future studies, and ii) the comparison of different technologies (cascade and end-to-end) on two language directions (English-Italian/French).Comment: 9 pages of content, accepted at ACL 202

arXiv.org e-Print Archive

Archivio della ricerca - Fondazione Bruno Kessler

Upright BPPV Protocol: Feasibility of a New Diagnostic Paradigm for Lateral Semicircular Canal Benign Paroxysmal Positional Vertigo Compared to Standard Diagnostic Maneuvers

Author: Alfonso Scarpa
Andrea Castellucci
Andrea Gallo
Andrea Stolfa
Angelo Ghidini
Antonio Greco
Cecilia Botti
Elisabetta Rebecchi
Enrico Armato
Ettore Cassandro
Francesco Comacchio
Giacinto Asprella Libonati
Giannoni Beatrice
Giulio Pagliuca
Giuseppe Attanasio
Luigi Califano
Luisa Savoldi
Marco de Vincentiis
Marco Lucio Manfrin
Marta Mion
Massimo Ralli
Pasquale Malara
Pecci Rudi
Salvatore Martellucci
Silvia Quaglieri
Veronica Clemenzi
Vincenzo Marcelli
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2020
Field of study

Florence Research

Gender bias and Machine Translation: On first looking into parallel corpora

Author: Beatrice Savoldi
Luisa Bentivogli
Publication venue
Publication date: 01/01/2021
Field of study

Archivio della ricerca - Fondazione Bruno Kessler