Search CORE

10 research outputs found

Bounded Approximations for Linear Multi-Objective Planning under Uncertainty (Extended Abstract)

Author: Oliehoek Frans A
Roijers Diederik
Scharpff Joris
Spaan Matthijs
Weerdt Mathijs De
Whiteson Shimon
Publication venue
Publication date: 01/11/2014
Field of study

University of Liverpool Repository

Can multiple contractors self-regulate their joint service delivery? A serious gaming experiment on road maintenance planning

Author: de Weerdt Mathijs M.
Scharpff Joris
Schraven Daan
Spaan Matthijs T.J.
Volker Leentje
Publication venue
Publication date: 01/02/2021
Field of study

University of Twente Research Information

Solving Multi-agent MDPs Optimally with Conditional Return Graphs

Author: Oliehoek Frans A
Roijers Diederik M
Scharpff Joris
Spaan Matthijs TJ
Weerdt Mathijs de
Publication venue
Publication date: 01/01/2015
Field of study

In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate in order find an optimal joint policy that maximises joint value. Typical solution al- gorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP setting (MMDP) such structure is not present. We propose a new optimal solver for so-called TI-MMDPs, where agents can only af- fect their local state, while their value may depend on the state of others. We decompose the returns into local returns per agent that we represent compactly in a conditional re- turn graph (CRG). Using CRGs the value of a joint policy as well as bounds on the value of partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the avail- able alternatives and is able to find solutions to problems previously considered unsolvable

University of Liverpool Repository

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Bounded Approximations for Linear Multi-Objective Planning under Uncertainty (Extended Abstract)

Author: Oliehoek Frans A
Roijers Diederik
Scharpff Joris
Spaan Matthijs
Weerdt Mathijs De
Whiteson Shimon
Publication venue
Publication date: 01/11/2014
Field of study

University of Liverpool Repository

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version)

Author: Oliehoek Frans A
Roijers Diederik M
Scharpff Joris
Spaan Matthijs TJ
Weerdt Mathijs M de
Publication venue
Publication date: 29/11/2015
Field of study

In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP setting (MMDP) such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these dependencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the available alternatives and finds solutions to problems previously unsolvable

arXiv.org e-Print Archive

University of Liverpool Repository

Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty

Author: de Weerdt Mathijs
Intelligence Assoc Advancement Artificial
Oliehoek Frans A
Roijers Diederik M
Scharpff Joris
Spaan Matthijs TJ
Whiteson Shimon
Publication venue
Publication date: 01/01/2014
Field of study

Planning under uncertainty poses a complex problem in which multiple objectives often need to be balanced. When dealing with multiple objectives, it is often assumed that the relative importance of the objectives is known a priori. How-ever, in practice human decision makers often find it hard to specify such preferences, and would prefer a decision sup-port system that presents a range of possible alternatives. We propose two algorithms for computing these alternatives for the case of linearly weighted objectives. First, we pro-pose an anytime method, approximate optimistic linear sup-port (AOLS), that incrementally builds up a complete set of -optimal plans, exploiting the piecewise-linear and convex shape of the value function. Second, we propose an approx-imate anytime method, scalarised sample incremental im-provement (SSII), that employs weight sampling to focus on the most interesting regions in weight space, as suggested by a prior over preferences. We show empirically that our meth-ods are able to produce (near-)optimal alternative sets orders of magnitude faster than existing techniques.

University of Liverpool Repository

CiteSeerX

International Migration, Integration and Social Cohesion online publications

Association for the Advancement of Artificial Intelligence: AAAI Publications

UvA-DARE

Bounded approximations for linear multi-objective planning under uncertainty.

Author: Diederik M Roijers
Frans A Oliehoek
Joris Scharpff
Mathijs M De Weerdt
Matthijs T J Spaan
Shimon Whiteson
Publication venue
Publication date: 01/01/2014
Field of study

Abstract Planning under uncertainty poses a complex problem in which multiple objectives often need to be balanced. When dealing with multiple objectives, it is often assumed that the relative importance of the objectives is known a priori. However, in practice human decision makers often find it hard to specify such preferences exactly, and would prefer a decision support system that presents a range of possible alternatives. We propose two algorithms for computing these alternatives for the case of linearly weighted objectives. First, we propose an anytime method, approximate optimistic linear support (AOLS), that incrementally builds up a complete set of -optimal plans, exploiting the piecewise-linear and convex shape of the value function. Second, we propose an approximate anytime method, scalarised sample incremental improvement (SSII), that employs weight sampling to focus on the most interesting regions in weight space, as suggested by a prior over preferences. We show empirically that our methods are able to produce (near-)optimal alternative sets orders of magnitude faster than existing techniques, thereby demonstrating that our methods provide sensible approximations in stochastic multi-objective domains

CiteSeerX

Dynamic Mechanism Design for Efficient Plan- ning under Uncertainty

Author: Dr Matthijs Mathijs De Weerdt
Dr Scharpff
Ir Joris
Spaan
Publication venue
Publication date: 02/04/2020
Field of study

CiteSeerX

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions

Author: AAAI
de Weerdt Mathijs M
Oliehoek Frans A
Roijers Diederik M
Scharpff Joris
Spaan Matthijs TJ
Publication venue: American Association for Artificial Intelligence (AAAI)
Publication date: 01/01/2016
Field of study

In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP (MMDP) setting such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these de- pendencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the available alternatives and finds solutions to previously unsolvable problems.Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.Algorithmic

University of Liverpool Repository

TU Delft Repository

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Association for the Advancement of Artificial Intelligence: AAAI Publications