Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction

Bewley, Tom; Lawry, Jonathan; Richards, Arthur

Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction

Authors: Tom Bewley
Jonathan Lawry
Arthur Richards
Publication date: 17 January 2022
Publisher

Abstract

We introduce a data-driven, model-agnostic technique for generating a human-interpretable summary of the salient points of contrast within an evolving dynamical system, such as the learning process of a control agent. It involves the aggregation of transition data along both spatial and temporal dimensions according to an information-theoretic divergence measure. A practical algorithm is outlined for continuous state spaces, and deployed to summarise the learning histories of deep reinforcement learning agents with the aid of graphical and textual communication methods. We expect our method to be complementary to existing techniques in the realm of agent interpretability

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2201.07749

Last time updated on 24/03/2022

Supporting member

Explore Bristol Research

oai:research-information.bris....

Last time updated on 03/05/2024