Search CORE

1 research outputs found

How to Play in Infinite MDPs (Invited Talk)

Author: Kiefer Stefan
Mayr Richard
Shirmohammadi Mahsa
Totzke Patrick
Wojtczak Dominik
Publication venue
Publication date: 01/01/2020
Field of study

International audienceMarkov decision processes (MDPs) are a standard model for dynamic systems that exhibit both stochastic and nondeterministic behavior. For MDPs with finite state space it is known that for a wide range of objectives there exist optimal strategies that are memoryless and deterministic. In contrast, if the state space is infinite, optimal strategies may not exist, and optimal or ε-optimal strategies may require (possibly infinite) memory. In this paper we consider qualitative objectives: reachability, safety, (co-)Büchi, and other parity objectives. We aim at giving an introduction to a collection of techniques that allow for the construction of strategies with little or no memory in countably infinite MDPs

University of Liverpool Repository

HAL Descartes

Edinburgh Research Explorer

Dagstuhl Research Online Publication Server