Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO

Chen, Jiaxin (author); Chen, Yangkun (author); He, J. (author); Liu, Shuai (author); Suarez, Joseph (author); Yu, Chenghui (author); Zhang, Yibing (author); Zhao, Liang (author); Zhu, Hengman (author)

Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO

Authors: Jiaxin (author) Chen
Yangkun (author) Chen
J. (author) He
Shuai (author) Liu
Joseph (author) Suarez
Chenghui (author) Yu
Yibing (author) Zhang
Liang (author) Zhao
Hengman (author) Zhu
Publication date: 1 January 2023
Publisher

Abstract

We present the results of the second Neural MMO challenge, hosted at IJCAI 2022, which received 1600+ submissions. This competition targets robustness and generalization in multi-agent systems: participants train teams of agents to complete a multi-task objective against opponents not seen during training. We summarize the competition design and results and suggest that, considering our work as a case study, competitions are an effective approach to solving hard problems and establishing a solid benchmark for algorithms. We will open-source our benchmark including the environment wrapper, baselines, a visualization tool, and selected policies for further research.Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.Interactive Intelligenc

Similar works

Full text

Available Versions

TU Delft Repository

oai:tudelft.nl:uuid:311c668f-8...

Last time updated on 07/12/2023