Measuring the mixing of contextual information in the transformer

Ferrando Monsonís, Javier; Gallego Olsina, Gerard Ion; Ruiz Costa-Jussà, Marta

Measuring the mixing of contextual information in the transformer

Authors: Javier Ferrando Monsonís
Gerard Ion Gallego Olsina
Marta Ruiz Costa-Jussà
Publication date: 1 January 2022
Publisher: Association for Computational Linguistics
Doi

Abstract

The Transformer architecture aggregates input information through the self-attention mechanism, but there is no clear understanding of how this information is mixed across the entire model. Additionally, recent works have demonstrated that attention weights alone are not enough to describe the flow of information. In this paper, we consider the whole attention block --multi-head attention, residual connection, and layer normalization-- and define a metric to measure token-to-token interactions within each layer. Then, we aggregate layer-wise interpretations to provide input attribution scores for model predictions. Experimentally, we show that our method, ALTI (Aggregation of Layer-wise Token-to-token Interactions), provides more faithful explanations and increased robustness than gradient-based methods.Javier Ferrando and Gerard I. Gállego are supported by the Spanish Ministerio de Ciencia e Innovación through the project PID2019-107579RB-I00 / AEI / 10.13039/501100011033.Peer ReviewedPostprint (published version

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2203.04212

Last time updated on 18/04/2022

UPCommons. Portal del coneixement obert de la UPC

oai:upcommons.upc.edu:2117/394...

Last time updated on 04/10/2023