Search CORE

258 research outputs found

Recommended from our members

Orbital Stability Analysis for Perturbed Nonlinear Systems and Natural Entrainment via Adaptive Andronov-Hopf Oscillator

Author: Iwasaki Tetsuya
Zhao Jinxin
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

eScholarship - University of California

Dielectric spectroscopy investigation on the interaction of poly(diallyldimethylammonium chloride) with sodium decyl sulfate in aqueous solution

Author: Chen Zhen
Li Xinwei
Xiao Jinxin
Yang Likun
Zhao Kongshuang
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/05/2011
Field of study

UQ eSpace (University of Queensland)

Mass Transfer Performance of a Water-Sparged Aerocyclone Reactor and Its Application in Wastewater Treatment

Author: Fuping Wang
Jinxin Xiang
Qinghua Zhao
Xuejun Quan
Zhiliang Cheng
Publication venue: 'IntechOpen'
Publication date: 26/10/2011
Field of study

IntechOpen

Crossref

FairBench: A Four-Stage Automatic Framework for Detecting Stereotypes and Biases in Large Language Models

Author: Bai Yanhong
He Liang
Shi Jinxin
Wei Tingjiang
Wu Xingjiao
Zhao Jiabao
Publication venue
Publication date: 20/08/2023
Field of study

Detecting stereotypes and biases in Large Language Models (LLMs) can enhance fairness and reduce adverse impacts on individuals or groups when these LLMs are applied. However, the majority of existing methods focus on measuring the model's preference towards sentences containing biases and stereotypes within datasets, which lacks interpretability and cannot detect implicit biases and stereotypes in the real world. To address this gap, this paper introduces a four-stage framework to directly evaluate stereotypes and biases in the generated content of LLMs, including direct inquiry testing, serial or adapted story testing, implicit association testing, and unknown situation testing. Additionally, the paper proposes multi-dimensional evaluation metrics and explainable zero-shot prompts for automated evaluation. Using the education sector as a case study, we constructed the Edu-FairBench based on the four-stage framework, which encompasses 12,632 open-ended questions covering nine sensitive factors and 26 educational scenarios. Experimental results reveal varying degrees of stereotypes and biases in five LLMs evaluated on Edu-FairBench. Moreover, the results of our proposed automated evaluation method have shown a high correlation with human annotations

arXiv.org e-Print Archive