Search CORE

11 research outputs found

Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning

Author: Albrecht Stefano V
Christianos Filippos
Schäfer Lukas
Publication venue
Publication date: 06/12/2020
Field of study

Exploration in multi-agent reinforcement learning is a challenging problem, especially in environments with sparse rewards. We propose a general method for efficient exploration by sharing experience amongst agents. Our proposed algorithm, called Shared Experience Actor-Critic (SEAC), applies experience sharing in an actor-critic framework. We evaluate SEAC in a collection of sparse-reward multi-agent environments and find that it consistently outperforms two baselines and two state-of-the-art algorithms by learning in fewer steps and converging to higher returns. In some harder environments, experience sharing makes the difference between learning to solve the task and not learning at all.Comment: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canad

arXiv.org e-Print Archive

Edinburgh Research Explorer

Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

Author: Albrecht Stefano V.
Christianos Filippos
Papoudakis Georgios
Publication venue
Publication date: 22/07/2023
Field of study

This work focuses on equilibrium selection in no-conflict multi-agent games, where we specifically study the problem of selecting a Pareto-optimal equilibrium among several existing equilibria. It has been shown that many state-of-the-art multi-agent reinforcement learning (MARL) algorithms are prone to converging to Pareto-dominated equilibria due to the uncertainty each agent has about the policy of the other agents during training. To address sub-optimal equilibrium selection, we propose Pareto Actor-Critic (Pareto-AC), which is an actor-critic algorithm that utilises a simple property of no-conflict games (a superset of cooperative games): the Pareto-optimal equilibrium in a no-conflict game maximises the returns of all agents and therefore is the preferred outcome for all agents. We evaluate Pareto-AC in a diverse set of multi-agent games and show that it converges to higher episodic returns compared to seven state-of-the-art MARL algorithms and that it successfully converges to a Pareto-optimal equilibrium in a range of matrix games. Finally, we propose PACDCG, a graph neural network extension of Pareto-AC which is shown to efficiently scale in games with a large number of agents.Comment: 20 pages, 12 figure

arXiv.org e-Print Archive

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Author: Albrecht Stefano V.
Christianos Filippos
Schäfer Lukas
Storkey Amos
Publication venue
Publication date: 20/11/2023
Field of study

Successful deployment of multi-agent reinforcement learning often requires agents to adapt their behaviour. In this work, we discuss the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited fine-tuning. Motivated by the intuition that agents need to be able to identify and distinguish tasks in order to adapt their behaviour to the current task, we propose to learn multi-agent task embeddings (MATE). These task embeddings are trained using an encoder-decoder architecture optimised for reconstruction of the transition and reward functions which uniquely identify tasks. We show that a team of agents is able to adapt to novel tasks when provided with task embeddings. We propose three MATE training paradigms: independent MATE, centralised MATE, and mixed MATE which vary in the information used for the task encoding. We show that the embeddings learned by MATE identify tasks and provide useful information which agents leverage during adaptation to novel tasks.Comment: To be presented at the Seventh Workshop on Generalization in Planning at the NeurIPS 2023 conferenc

arXiv.org e-Print Archive

Deep Reinforcement Learning for Multi-Agent Interaction

Author: Ahmed Ibrahim H.
Albrecht Stefano V.
Brewitt Cillian
Carlucho Ignacio
Christianos Filippos
Dunion Mhairi
Fosong Elliot
Garcin Samuel
Guo Shangmin
Gyevnar Balint
McInroe Trevor
Papoudakis Georgios
Rahman Arrasy
Schäfer Lukas
Tamborski Massimiliano
Vecchio Giuseppe
Wang Cheng
Publication venue
Publication date: 02/08/2022
Field of study

The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning. Research problems include scalable learning of coordinated agent policies and inter-agent communication; reasoning about the behaviours, goals, and composition of other agents from limited observations; and sample-efficient learning based on intrinsic motivation, curriculum learning, causal inference, and representation learning. This article provides a broad overview of the ongoing research portfolio of the group and discusses open problems for future directions.Comment: Published in AI Communications Special Issue on Multi-Agent Systems Research in the U

arXiv.org e-Print Archive

Heriot Watt Pure

Evolution of the Commons: Governing Common Pool, Semi-Open Access Fisheries in the Greek Seas A Blend of Modern Theories and Application for Small-Scale Coastal Fisheries

Author: A L Karakostas
A L Karakostas
Christianos
Christianos
Christianos
Ec/Eu Fact
Eurofound
Gerhart
Gerhart
Gl
Governing Ostrom
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Id
Illegal-Fishing
Introduction
Itlos Order Of
James Marson
Lisa B. Uffman-Kirsch J.D.
Ostrom
Ostrom
Ostrom
Polycentric Ostrom
Polycentric Ostrom
Preamble Id At
Preamble Id At
Preamble Id At
Prodromos D Dagtoglou
Rainer Froese
See Costa V
See Gerhart
Wikipedia
�1 Eurofound
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Crossref