Search CORE

910 research outputs found

Deterministic Priority Mean-payoff Games as Limits of Discounted Games

Author: A. Hordijk
D. Blackwell
E.A. Emerson
H. Gimbert
J. Vöge
J.F. Mertens
L. Alfaro de
L.S. Shapley
M. Jurdziński
T.E.S. Raghavan
U. Zwick
Publication venue: Springer, Berlin
Publication date: 01/01/2006
Field of study

International audienceInspired by the paper of de Alfaro, Henzinger and Majumdar about discounted

\mu

-calculus we show new surprising links between parity games and different classes of discounted games

Blackwell-Optimal Strategies in Priority Mean-Payoff Games

Author: A. Hordijk
A.N. Shiryayev
Angelo Montanari
D. Blackwell
D.A. Martin
Daniel W. Stroock
H. Björklund
H. Gimbert
H. Gimbert
H. Gimbert
H. Gimbert
Hugo Gimbert
Hugo Gimbert
J.F. Mertens
L. de Alfaro
L. S. Shapley
Margherita Napoli
Mimmo Parente
Wiesław Zielonka
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2010
Field of study

We examine perfect information stochastic mean-payoff games - a class of games containing as special sub-classes the usual mean-payoff games and parity games. We show that deterministic memoryless strategies that are optimal for discounted games with state-dependent discount factors close to 1 are optimal for priority mean-payoff games establishing a strong link between these two classes

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

Applying Blackwell optimality: priority mean-payoff games as limits of multi-discounted game

Author: Gimbert Hugo
Zielonka Wieslaw
Publication venue: HAL CCSD
Publication date: 25/12/2007
Field of study

International audienceWe define and examine priority mean-payoff games - a natural extension of parity games. By adapting the notion of Blackwell optimality borrowed from the theory of Markov decision processes we show that priority mean-payoff games can be seen as a limit of special multi-discounted games

Hal-Diderot

HAL-Polytechnique

Two-Player Perfect-Information Shift-Invariant Submixing Stochastic Games Are Half-Positional

Author: Gimbert Hugo
Kelmendi Edon
Publication venue
Publication date: 08/10/2015
Field of study

We consider zero-sum stochastic games with perfect information and finitely many states and actions. The payoff is computed by a payoff function which associates to each infinite sequence of states and actions a real number. We prove that if the the payoff function is both shift-invariant and submixing, then the game is half-positional, i.e. the first player has an optimal strategy which is both deterministic and stationary. This result relies on the existence of

\epsilon

-subgame-perfect equilibria in shift-invariant games, a second contribution of the paper

arXiv.org e-Print Archive

Queen Mary Research Online

Playing in stochastic environment: from multi-armed bandits to two-player games

Author: Zielonka Wieslaw
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2010)
Publication date: 01/01/2010
Field of study

Given a zero-sum infinite game we examine the question if players have optimal memoryless deterministic strategies. It turns out that under some general conditions the problem for two-player games can be reduced to the same problem for one-player games which in turn can be reduced to a simpler related problem for multi-armed bandits

Dagstuhl Research Online Publication Server

Perfect Information Stochastic Priority Games

Author: A. Hordijk
A. McIver
A. McIver
D. Blackwell
E. Emerson
H. Gimbert
J. Filar
L. Alfaro de
L. Alfaro de
L.S. Shapley
M.J. Osborne
W. Zielonka
Publication venue: HAL CCSD
Publication date: 01/01/2007
Field of study

International audienceWe introduce stochastic priority games - a new class of perfect information stochastic games. These games can take two different, but equivalent, forms. In stopping priority games a play can be stopped by the environment after a finite number of stages, however, infinite plays are also possible. In discounted priority games only infinite plays are possible and the payoff is a linear combination of the classical discount payoff and of a limit payoff evaluating the performance at infinity. Shapley games and parity games are special extreme cases of priority games

Crossref

HAL Descartes

Hal-Diderot

Continuous positional payoffs

Author: Kozachinskiy Alexander
Publication venue: Schloss Dagstuhl – Leibniz-Zentrum für Informatik GmbH
Publication date: 01/01/2021
Field of study

What payoffs are positionally determined for deterministic two-player antagonistic games on finite directed graphs? In this paper we study this question for payoffs that are continuous. The main reason why continuous positionally determined payoffs are interesting is that they include the multi-discounted payoffs. We show that for continuous payoffs positional determinacy is equivalent to a simple property called prefix-monotonicity. We provide three proofs of it, using three major techniques of establishing positional determinacy - inductive technique, fixed point technique and strategy improvement technique. A combination of these approaches provides us with better understanding of the structure of continuous positionally determined payoffs as well as with some algorithmic results

arXiv.org e-Print Archive

Episciences.org

Directory of Open Access Journals

Dagstuhl Research Online Publication Server

Warwick Research Archives Portal Repository

Markov Decision Processes with Multiple Long-run Average Objectives

Author: Antonín Kučera
Krishnendu Chatterjee
Stephan Kreutzer
Tomá Brázdil
Vojtěch Forejt
Václav Broek
Publication venue: 'Logical Methods in Computer Science e.V.'
Publication date: 01/01/2011
Field of study

We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We consider two different objectives, namely, expectation and satisfaction objectives. Given an MDP with k limit-average functions, in the expectation objective the goal is to maximize the expected limit-average value, and in the satisfaction objective the goal is to maximize the probability of runs such that the limit-average value stays above a given vector. We show that under the expectation objective, in contrast to the case of one limit-average function, both randomization and memory are necessary for strategies even for epsilon-approximation, and that finite-memory randomized strategies are sufficient for achieving Pareto optimal values. Under the satisfaction objective, in contrast to the case of one limit-average function, infinite memory is necessary for strategies achieving a specific value (i.e. randomized finite-memory strategies are not sufficient), whereas memoryless randomized strategies are sufficient for epsilon-approximation, for all epsilon>0. We further prove that the decision problems for both expectation and satisfaction objectives can be solved in polynomial time and the trade-off curve (Pareto curve) can be epsilon-approximated in time polynomial in the size of the MDP and 1/epsilon, and exponential in the number of limit-average functions, for all epsilon>0. Our analysis also reveals flaws in previous work for MDPs with multiple mean-payoff functions under the expectation objective, corrects the flaws, and allows us to obtain improved results

arXiv.org e-Print Archive

Crossref

Episciences.org

Directory of Open Access Journals

Oxford University Research Archive

IST Austria: PubRep (Institute of Science and Technology)