Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on
  Simplified Corpora?

Anschütz, Miriam; Groh, Georg; Mosca, Edoardo

Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?

Authors: Miriam Anschütz
Georg Groh
Edoardo Mosca
Publication date: 10 April 2024
Publisher

Abstract

Text simplification seeks to improve readability while retaining the original content and meaning. Our study investigates whether pre-trained classifiers also maintain such coherence by comparing their predictions on both original and simplified inputs. We conduct experiments using 11 pre-trained models, including BERT and OpenAI's GPT 3.5, across six datasets spanning three languages. Additionally, we conduct a detailed analysis of the correlation between prediction change rates and simplification types/strengths. Our findings reveal alarming inconsistencies across all languages and models. If not promptly addressed, simplified inputs can be easily exploited to craft zero-iteration model-agnostic adversarial attacks with success rates of up to 50%Comment: Published at DeTermIt! Workshop at LREC-COLING 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2404.06838

Last time updated on 24/11/2024