Lost in the Middle: How Language Models Use Long Contexts

Bevilacqua, Michele; Hewitt, John; Liang, Percy; Lin, Kevin; Liu, Nelson F.; Paranjape, Ashwin; Petroni, Fabio

Lost in the Middle: How Language Models Use Long Contexts

Authors: Michele Bevilacqua
John Hewitt
Percy Liang
Kevin Lin
Nelson F. Liu
Ashwin Paranjape
Fabio Petroni
Publication date: 6 July 2023
Publisher

Abstract

While recent language models have the ability to take long contexts as input, relatively little is known about how well the language models use longer context. We analyze language model performance on two tasks that require identifying relevant information within their input contexts: multi-document question answering and key-value retrieval. We find that performance is often highest when relevant information occurs at the beginning or end of the input context, and significantly degrades when models must access relevant information in the middle of long contexts. Furthermore, performance substantially decreases as the input context grows longer, even for explicitly long-context models. Our analysis provides a better understanding of how language models use their input context and provides new evaluation protocols for future long-context models.Comment: 15 pages, 17 figure

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2307.03172

Last time updated on 08/07/2023