Fair Abstractive Summarization of Diverse Perspectives

Fabbri, Alexander; Kamoi, Ryo; Liu, Junru; Liu, Yixin; Lu, Xiaoxin; McKeown, Kathleen; Radev, Dragomir; Xiong, Caiming; Zhang, Nan; Zhang, Rui; Zhang, Yusen; Zhao, Jieyu

Fair Abstractive Summarization of Diverse Perspectives

Authors: Alexander Fabbri
Ryo Kamoi
Junru Liu
Yixin Liu
Xiaoxin Lu
Kathleen McKeown
Dragomir Radev
Caiming Xiong
Nan Zhang
Rui Zhang
Yusen Zhang
Jieyu Zhao
Publication date: 13 November 2023
Publisher

Abstract

People from different social and demographic groups express diverse perspectives and conflicting opinions on a broad set of topics such as product reviews, healthcare, law, and politics. A fair summary should provide a comprehensive coverage of diverse perspectives without underrepresenting certain groups. However, current work in summarization metrics and Large Language Models (LLMs) evaluation has not explored fair abstractive summarization. In this paper, we systematically investigate fair abstractive summarization for user-generated data. We first formally define fairness in abstractive summarization as not underrepresenting perspectives of any groups of people and propose four reference-free automatic metrics measuring the differences between target and source perspectives. We evaluate five LLMs, including three GPT models, Alpaca, and Claude, on six datasets collected from social media, online reviews, and recorded transcripts. Experiments show that both the model-generated and the human-written reference summaries suffer from low fairness. We conduct a comprehensive analysis of the common factors influencing fairness and propose three simple but effective methods to alleviate unfair summarization. Our dataset and code are available at https://github.com/psunlpgroup/FairSumm.Comment: 19 pages, 10 figure

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2311.07884

Last time updated on 10/02/2024