Search CORE

3 research outputs found

Optimal and Approximate Q-value Functions for Decentralized POMDPs

Author: Oliehoek Frans A.
Spaan Matthijs T. J.
Vlassis Nikos
Publication venue: 'AI Access Foundation'
Publication date: 01/01/2008
Field of study

Decision-theoretic planning is a popular approach to sequential decision making problems, because it treats uncertainty in sensing and acting in a principled way. In single-agent frameworks like MDPs and POMDPs, planning can be carried out by resorting to Q-value functions: an optimal Q-value function Q* is computed in a recursive manner by dynamic programming, and then an optimal policy is extracted from Q*. In this paper we study whether similar Q-value functions can be defined for decentralized POMDP models (Dec-POMDPs), and how policies can be extracted from such value functions. We define two forms of the optimal Q-value function for Dec-POMDPs: one that gives a normative description as the Q-value function of an optimal pure joint policy and another one that is sequentially rational and thus gives a recipe for computation. This computation, however, is infeasible for all but the smallest problems. Therefore, we analyze various approximate Q-value functions that allow for efficient computation. We describe how they relate, and we prove that they all provide an upper bound to the optimal Q-value function Q*. Finally, unifying some previous approaches for solving Dec-POMDPs, we describe a family of algorithms for extracting policies from such Q-value functions, and perform an experimental evaluation on existing test problems, including a new firefighting benchmark problem

arXiv.org e-Print Archive

CiteSeerX

University of Liverpool Repository

Crossref

Open Repository and Bibliography - Luxembourg

UvA-DARE

International Migration, Integration and Social Cohesion online publications

A Cross-Entropy Approach to Solving Dec-POMDPs

Author: Kooij J.F.P.
Oliehoek F.A.
Vlassis N.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

International Migration, Integration and Social Cohesion online publications

A Cross-Entropy Approach to Solving Dec-POMDPs

Author: C.V. Goldman
D. Koller
D.S. Bernstein
M.T.J. Spaan
P.-T. de Boer
Publication venue
Publication date: 01/01/2008
Field of study

In this paper we focus on distributed multiagent planning under uncertainty. For single-agent planning under uncertainty, the partially observable Markov decision process (POMDP) is the dominant model (see [Spaan and Vlassis, 2005] and references therein). Recently, several generalizations of th

CiteSeerX

Crossref

International Migration, Integration and Social Cohesion online publications