UniMASK: Unified Inference in Sequential Decision Problems

Bignell, David; Carroll, Micah; Devlin, Sam; Dragan, Anca; Georgescu, Raluca; Hausknecht, Matthew; Hofmann, Katja; Lin, Jessy; Milani, Stephanie; Paradise, Orr; Sun, Mingfei

UniMASK: Unified Inference in Sequential Decision Problems

Authors: David Bignell
Micah Carroll
Sam Devlin
Anca Dragan
Raluca Georgescu
Matthew Hausknecht
Katja Hofmann
Jessy Lin
Stephanie Milani
Orr Paradise
Mingfei Sun
Publication date: 1 November 2022
Publisher

Abstract

Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision-making, where many well-studied tasks like behavior cloning, offline reinforcement learning, inverse dynamics, and waypoint conditioning correspond to different sequence maskings over a sequence of states, actions, and returns. We introduce the UniMASK framework, which provides a unified way to specify models which can be trained on many different sequential decision-making tasks. We show that a single UniMASK model is often capable of carrying out many tasks with performance similar to or better than single-task models. Additionally, after fine-tuning, our UniMASK models consistently outperform comparable single-task models. Our code is publicly available at https://github.com/micahcarroll/uniMASK.Comment: NeurIPS 2022 (Oral). A prior version was published at an ICML Workshop, available at arXiv:2204.1332

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2211.10869

Last time updated on 24/12/2022

The University of Manchester - Institutional Repository

oai:pure.atira.dk:publications...

Last time updated on 22/06/2024

The University of Manchester - Institutional Repository

oai:pure.atira.dk:publications...

Last time updated on 18/12/2022