Search CORE

51 research outputs found

Qualitative Reachability in Stochastic BPA Games

Author: Abdulla
Antonín Kučera
Athreya
Baeten
Baier
Brázdil
Chatterjee
Chatterjee
Chatterjee
Condon
de Alfaro
Esparza
Etessami
Etessami
Etessami
Etessami
Fagin
Harris
Hopcroft
Jan Obdržálek
Maitra
Manning
Martin
Tarski
Thomas
Tomáš Brázdil
Vieille
Václav Brožek
Walukiewicz
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

We consider a class of infinite-state stochastic games generated by stateless pushdown automata (or, equivalently, 1-exit recursive state machines), where the winning objective is specified by a regular set of target configurations and a qualitative probability constraint `>0' or `=1'. The goal of one player is to maximize the probability of reaching the target set so that the constraint is satisfied, while the other player aims at the opposite. We show that the winner in such games can be determined in PTIME for the `>0' constraint, and both in NP and coNP for the `=1' constraint. Further, we prove that the winning regions for both players are regular, and we design algorithms which compute the associated finite-state automata. Finally, we show that winning strategies can be synthesized effectively.Comment: Submitted to Information and Computation. 48 pages, 3 figure

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Crossref

Dagstuhl Research Online Publication Server

Optimal Strategies in Infinite-state Stochastic Reachability Games

Author: A. Condon
D. A. Martin
Giovanna D'Agostino
H. Gimbert
J. Esparza
K. Etessami
M. L. Puterman
N. Berger
Salvatore La Torre
T. Brázdil
T. Brázdil
T. Brázdil
T. Brázdil
T. Brázdil
T. Brázdil
Václav Brožek
Publication venue: 'Open Publishing Association'
Publication date: 01/06/2011
Field of study

We consider perfect-information reachability stochastic games for 2 players on infinite graphs. We identify a subclass of such games, and prove two interesting properties of it: first, Player Max always has optimal strategies in games from this subclass, and second, these games are strongly determined. The subclass is defined by the property that the set of all values can only have one accumulation point -- 0. Our results nicely mirror recent results for finitely-branching games, where, on the contrary, Player Min always has optimal strategies. However, our proof methods are substantially different, because the roles of the players are not symmetric. We also do not restrict the branching of the games. Finally, we apply our results in the context of recently studied One-Counter stochastic games

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Qualitative Analysis of Concurrent Mean-payoff Games

Author: Chatterjee Krishnendu
Ibsen-Jensen Rasmus
Publication venue
Publication date: 18/09/2014
Field of study

We consider concurrent games played by two-players on a finite-state graph, where in every round the players simultaneously choose a move, and the current state along with the joint moves determine the successor state. We study a fundamental objective, namely, mean-payoff objective, where a reward is associated to each transition, and the goal of player 1 is to maximize the long-run average of the rewards, and the objective of player 2 is strictly the opposite. The path constraint for player 1 could be qualitative, i.e., the mean-payoff is the maximal reward, or arbitrarily close to it; or quantitative, i.e., a given threshold between the minimal and maximal reward. We consider the computation of the almost-sure (resp. positive) winning sets, where player 1 can ensure that the path constraint is satisfied with probability 1 (resp. positive probability). Our main results for qualitative path constraints are as follows: (1) we establish qualitative determinacy results that show that for every state either player 1 has a strategy to ensure almost-sure (resp. positive) winning against all player-2 strategies, or player 2 has a spoiling strategy to falsify almost-sure (resp. positive) winning against all player-1 strategies; (2) we present optimal strategy complexity results that precisely characterize the classes of strategies required for almost-sure and positive winning for both players; and (3) we present quadratic time algorithms to compute the almost-sure and the positive winning sets, matching the best known bound of algorithms for much simpler problems (such as reachability objectives). For quantitative constraints we show that a polynomial time solution for the almost-sure or the positive winning set would imply a solution to a long-standing open problem (the value problem for turn-based deterministic mean-payoff games) that is not known to be solvable in polynomial time

arXiv.org e-Print Archive

CiteSeerX

University of Liverpool Repository

IST PubRep

IST Austria: PubRep (Institute of Science and Technology)

Solving Stochastic B\"uchi Games on Infinite Arenas with a Finite Attractor

Author: Abdulla
Abdulla
Abdulla
Abdulla
Arnold
Baier
Baier
Baier
Baier
Baier
Barceló
Bertrand
Bouajjani
Bouyer
Bouyer
Bradfield
Brázdil
Brázdil
Brázdil
Brázdil
Chambart
Chatterjee
Chatterjee
Chen
Condon
Etessami
Herbert Wiklicky
Jančar
Karandikar
Kesten
Konev
Kouzmin
Luca Bortolussi
Martin
Muscholl
Nathalie Bertrand
Philippe Schnoebelen
Purushothaman Iyer
Puterman
Rabinovich
Schmitz
Schnoebelen
Shapley
Zielonka
Publication venue: 'Open Publishing Association'
Publication date: 01/03/2013
Field of study

We consider games played on an infinite probabilistic arena where the first player aims at satisfying generalized B\"uchi objectives almost surely, i.e., with probability one. We provide a fixpoint characterization of the winning sets and associated winning strategies in the case where the arena satisfies the finite-attractor property. From this we directly deduce the decidability of these games on probabilistic lossy channel systems.Comment: In Proceedings QAPL 2013, arXiv:1306.241

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

Directory of Open Access Journals

HAL-Rennes 1

Approximating the Termination Value of One-Counter MDPs and Stochastic Games

Author: G.R. Grimmett
J. Lambert
K. Etessami
K. Etessami
L.B. White
M.L. Puterman
T. Brázdil
Publication venue
Publication date: 01/01/2011
Field of study

One-counter MDPs (OC-MDPs) and one-counter simple stochastic games (OC-SSGs) are 1-player, and 2-player turn-based zero-sum, stochastic games played on the transition graph of classic one-counter automata (equivalently, pushdown automata with a 1-letter stack alphabet). A key objective for the analysis and verification of these games is the termination objective, where the players aim to maximize (minimize, respectively) the probability of hitting counter value 0, starting at a given control state and given counter value. Recently, we studied qualitative decision problems ("is the optimal termination value = 1?") for OC-MDPs (and OC-SSGs) and showed them to be decidable in P-time (in NP and coNP, respectively). However, quantitative decision and approximation problems ("is the optimal termination value ? p", or "approximate the termination value within epsilon") are far more challenging. This is so in part because optimal strategies may not exist, and because even when they do exist they can have a highly non-trivial structure. It thus remained open even whether any of these quantitative termination problems are computable. In this paper we show that all quantitative approximation problems for the termination value for OC-MDPs and OC-SSGs are computable. Specifically, given a OC-SSG, and given epsilon > 0, we can compute a value v that approximates the value of the OC-SSG termination game within additive error epsilon, and furthermore we can compute epsilon-optimal strategies for both players in the game. A key ingredient in our proofs is a subtle martingale, derived from solving certain LPs that we can associate with a maximizing OC-MDP. An application of Azuma's inequality on these martingales yields a computable bound for the "wealth" at which a "rich person's strategy" becomes epsilon-optimal for OC-MDPs.Comment: 35 pages, 1 figure, full version of a paper presented at ICALP 2011, invited for submission to Information and Computatio

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Greatest Fixed Points of Probabilistic Min/Max Polynomial Equations, and Reachability for Branching Markov Decision Processes?

Author: I Bozic
J Esparza
J Esparza
JG Reiter
K Etessami
K Etessami
R Bonnet
T Brázdil
T Brázdil
T Chen
U Rothblum
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

We give polynomial time algorithms for quantitative (and qualitative) reachability analysis for Branching Markov Decision Processes (BMDPs). Specifically, given a BMDP, and given an initial population, where the objective of the controller is to maximize (or minimize) the probability of eventually reaching a population that contains an object of a desired (or undesired) type, we give algorithms for approximating the supremum (infimum) reachability probability, within desired precision epsilon > 0, in time polynomial in the encoding size of the BMDP and in log(1/epsilon). We furthermore give P-time algorithms for computing epsilon-optimal strategies for both maximization and minimization of reachability probabilities. We also give P-time algorithms for all associated qualitative analysis problems, namely: deciding whether the optimal (supremum or infimum) reachability probabilities are 0 or 1. Prior to this paper, approximation of optimal reachability probabilities for BMDPs was not even known to be decidable. Our algorithms exploit the following basic fact: we show that for any BMDP, its maximum (minimum) non-reachability probabilities are given by the greatest fixed point (GFP) solution g* in [0,1]^n of a corresponding monotone max (min) Probabilistic Polynomial System of equations (max/min-PPS), x=P(x), which are the Bellman optimality equations for a BMDP with non-reachability objectives. We show how to compute the GFP of max/min PPSs to desired precision in P-time. We also study more general Branching Simple Stochastic Games (BSSGs) with (non-)reachability objectives. We show that: (1) the value of these games is captured by the GFP of a corresponding max-minPPS; (2) the quantitative problem of approximating the value is in TFNP; and (3) the qualitative problems associated with the value are all solvable in P-time

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Taming denumerable Markov decision processes with decisiveness

Author: Bertrand Nathalie
Bouyer Patricia
Brihaye Thomas
Fournier Paulin
Publication venue
Publication date: 24/08/2020
Field of study

Decisiveness has proven to be an elegant concept for denumerable Markov chains: it is general enough to encompass several natural classes of denumerable Markov chains, and is a sufficient condition for simple qualitative and approximate quantitative model checking algorithms to exist. In this paper, we explore how to extend the notion of decisiveness to Markov decision processes. Compared to Markov chains, the extra non-determinism can be resolved in an adversarial or cooperative way, yielding two natural notions of decisiveness. We then explore whether these notions yield model checking procedures concerning the infimum and supremum probabilities of reachability properties

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

IST Austria Technical Report

Author: Chatterjee Krishnendu
Ibsen-Jensen Rasmus
Publication venue: IST Austria
Publication date: 01/01/2013
Field of study

We consider concurrent games played by two-players on a finite state graph, where in every round the players simultaneously choose a move, and the current state along with the joint moves determine the successor state. We study the most fundamental objective for concurrent games, namely, mean-payoff or limit-average objective, where a reward is associated to every transition, and the goal of player 1 is to maximize the long-run average of the rewards, and the objective of player 2 is strictly the opposite (i.e., the games are zero-sum). The path constraint for player 1 could be qualitative, i.e., the mean-payoff is the maximal reward, or arbitrarily close to it; or quantitative, i.e., a given threshold between the minimal and maximal reward. We consider the computation of the almost-sure (resp. positive) winning sets, where player 1 can ensure that the path constraint is satisfied with probability 1 (resp. positive probability). Almost-sure winning with qualitative constraint exactly corresponds to the question whether there exists a strategy to ensure that the payoff is the maximal reward of the game. Our main results for qualitative path constraints are as follows: (1) we establish qualitative determinacy results that show for every state either player 1 has a strategy to ensure almost-sure (resp. positive) winning against all player-2 strategies or player 2 has a spoiling strategy to falsify almost-sure (resp. positive) winning against all player-1 strategies; (2) we present optimal strategy complexity results that precisely characterize the classes of strategies required for almost-sure and positive winning for both players; and (3) we present quadratic time algorithms to compute the almost-sure and the positive winning sets, matching the best known bound of the algorithms for much simpler problems (such as reachability objectives). For quantitative constraints we show that a polynomial time solution for the almost-sure or the positive winning set would imply a solution to a long-standing open problem (of solving the value problem of mean-payoff games) that is not known to be in polynomial time

IST Austria: PubRep (Institute of Science and Technology)

Analyzing probabilistic pushdown automata

Author: A Bouajjani
A Tarski
A Tarski
Antonín Kučera
C Baier
D Grigoriev
D Williams
E Allender
E Emerson
H Hansson
I Walukiewicz
J Canny
J Esparza
J Esparza
J Esparza
J Esparza
J Hopcroft
J Rosenthal
Javier Esparza
K Athreya
K Chung
K Etessami
K Etessami
K Etessami
K Etessami
K Etessami
K Etessami
L Bozzelli
R Alur
R Alur
R Mayr
S Kiefer
Stefan Kiefer
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Brázdil
T Harris
Tomáš Brázdil
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

The paper gives a summary of the existing results about algorithmic analysis of probabilistic pushdown automata and their subclasses.V článku je podán přehled známých výsledků o pravděpodobnostních zásobníkových automatech a některých jejich podtřídách

CiteSeerX

Crossref

Univerzitní repozitář Masarykovy univerzity