Search CORE

2 research outputs found

A utility-based analysis of equilibria in multi-objective normal form games

Author: Mannion Patrick
Nowé Ann
Roijers Diederik M.
Rădulescu Roxana
Zhang Yijie
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 17/01/2020
Field of study

In multi-objective multi-agent systems (MOMAS), agents explicitly consider the possible tradeoffs between conflicting objective functions. We argue that compromises between competing objectives in MOMAS should be analysed on the basis of the utility that these compromises have for the users of a system, where an agent's utility function maps their payoff vectors to scalar utility values. This utility-based approach naturally leads to two different optimisation criteria for agents in a MOMAS: expected scalarised returns (ESR) and scalarised expected returns (SER). In this article, we explore the differences between these two criteria using the framework of multi-objective normal form games (MONFGs). We demonstrate that the choice of optimisation criterion (ESR or SER) can radically alter the set of equilibria in a MONFG when non-linear utility functions are used.Comment: Under review since 16 January 202

arXiv.org e-Print Archive

Deep Reinforcement Learning for Autonomous Driving: A Survey

Author: Kiran B Ravi
Mannion Patrick
Pérez Patrick
Sallab Ahmad A. Al
Sobh Ibrahim
Talpaert Victor
Yogamani Senthil
Publication venue
Publication date: 23/01/2021
Field of study

With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments. This review summarises deep reinforcement learning (DRL) algorithms and provides a taxonomy of automated driving tasks where (D)RL methods have been employed, while addressing key computational challenges in real world deployment of autonomous driving agents. It also delineates adjacent domains such as behavior cloning, imitation learning, inverse reinforcement learning that are related but are not classical RL algorithms. The role of simulators in training agents, methods to validate, test and robustify existing solutions in RL are discussed.Comment: Accepted for publication at IEEE Transactions on Intelligent Transportation System

arXiv.org e-Print Archive