Are Frequent Phrases Directly Retrieved like Idioms? An Investigation with Self-paced Reading and Language Models

Alessandro Lenci; Emmanuele Chersoni; Giulia Rambelli; Marco S. G. Senaldi; Philippe Blache

Are Frequent Phrases Directly Retrieved like Idioms? An Investigation with Self-paced Reading and Language Models

Authors: Alessandro Lenci
Emmanuele Chersoni
Giulia Rambelli
Marco S. G. Senaldi
Philippe Blache
Publication date: 1 January 2023
Publisher: place:Stroudsburg

Abstract

An open question in language comprehension studies is whether non-compositional multiword expressions like idioms and compositional-but-frequent word sequences are processed differently. Are the latter constructed online, or are instead directly retrieved from the lexicon, with a degree of entrenchment depending on their frequency? In this paper, we address this question with two different methodologies. First, we set up a self-paced reading experiment comparing human reading times for idioms and both highfrequency and low-frequency compositional word sequences. Then, we ran the same experiment using the Surprisal metrics computed with Neural Language Models (NLMs). Our results provide evidence that idiomatic and high-frequency compositional expressions are processed similarly by both humans and NLMs. Additional experiments were run to test the possible factors that could affect the NLMs’ performance

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

oai:cris.unibo.it:11585/925638

Last time updated on 20/09/2023