Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes

Gimbert, Hugo; Oualhadj, Youssouf

research

Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes

Authors: Hugo Gimbert
Youssouf Oualhadj
Publication date: 18 July 2012
Publisher: HAL CCSD

Abstract

The value 1 problem is a natural decision problem in algorithmic game theory. For partially observable Markov decision processes with reachability objective, this problem is defined as follows: are there strategies that achieve the reachability objective with probability arbitrarily close to 1? This problem was shown undecidable recently. Our contribution is to introduce a class of partially observable Markov decision processes, namely #-acyclic partially observable Markov decision processes, for which the value 1 problem is decidable. Our algorithm is based on the construction of a two-player perfect information game, called the knowledge game, abstracting the behaviour of a #-acyclic partially observable Markov decision process M such that the first player has a winning strategy in the knowledge game if and only if the value of M is 1

Similar works

Full text

Available Versions

HAL AMU

oai:HAL:hal-00743137v3

Last time updated on 11/11/2016