Search CORE

1,722 research outputs found

Synchronizing Objectives for Markov Decision Processes

Author: A. Aziz
A. Bianco
Bernd Finkbeiner
D. Beauquier
J. Filar
Johannes Reich
Laurent Doyen
M. V. Volkov
Mahsa Shirmohammadi
R. Segala
Thierry Massart
V. A. Korthikanti
Y. Benenson
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2011
Field of study

We introduce synchronizing objectives for Markov decision processes (MDP). Intuitively, a synchronizing objective requires that eventually, at every step there is a state which concentrates almost all the probability mass. In particular, it implies that the probabilistic system behaves in the long run like a deterministic system: eventually, the current state of the MDP can be identified with almost certainty. We study the problem of deciding the existence of a strategy to enforce a synchronizing objective in MDPs. We show that the problem is decidable for general strategies, as well as for blind strategies where the player cannot observe the current state of the MDP. We also show that pure strategies are sufficient, but memory may be necessary.Comment: In Proceedings iWIGP 2011, arXiv:1102.374

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

DI-fusion

Limit Synchronization in Markov Decision Processes

Author: C. Baier
C. Baier
H. Gimbert
J. Aspnes
K. Chatterjee
L. Alfaro de
L. Doyen
M.V. Volkov
P. Jancar
R. Baldoni
T.A. Henzinger
W. Fokkink
Publication venue
Publication date: 31/10/2013
Field of study

Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing if the probability mass accumulates in a single state, possibly in the limit. Precisely, for 0 <= p <= 1 the sequence is p-synchronizing if a probability distribution in the sequence assigns probability at least p to some state, and we distinguish three synchronization modes: (i) sure winning if there exists a strategy that produces a 1-synchronizing sequence; (ii) almost-sure winning if there exists a strategy that produces a sequence that is, for all epsilon > 0, a (1-epsilon)-synchronizing sequence; (iii) limit-sure winning if for all epsilon > 0, there exists a strategy that produces a (1-epsilon)-synchronizing sequence. We consider the problem of deciding whether an MDP is sure, almost-sure, limit-sure winning, and we establish the decidability and optimal complexity for all modes, as well as the memory requirements for winning strategies. Our main contributions are as follows: (a) for each winning modes we present characterizations that give a PSPACE complexity for the decision problems, and we establish matching PSPACE lower bounds; (b) we show that for sure winning strategies, exponential memory is sufficient and may be necessary, and that in general infinite memory is necessary for almost-sure winning, and unbounded memory is necessary for limit-sure winning; (c) along with our results, we establish new complexity results for alternating finite automata over a one-letter alphabet

arXiv.org e-Print Archive

CiteSeerX

Crossref

DI-fusion

Infinite Synchronizing Words for Probabilistic Automata (Erratum)

Author: Doyen Laurent
Massart Thierry
Shirmohammadi Mahsa
Publication venue
Publication date: 01/01/2012
Field of study

In [1], we introduced the weakly synchronizing languages for probabilistic automata. In this report, we show that the emptiness problem of weakly synchronizing languages for probabilistic automata is undecidable. This implies that the decidability result of [1-3] for the emptiness problem of weakly synchronizing language is incorrect.Comment: 5 pages, 3 figure

arXiv.org e-Print Archive

DI-fusion

Refinement & Synthesis - Distributed Event Clock Automata

Author: Legay Axel
Ortiz Vega James Jerson
Schobbens Pierre-Yves
Publication venue
Publication date: 01/01/2011
Field of study

Repository of the University of Namur

Multiple verification in computational modeling of bone pathologies

Author: A. Aziz
D.T. Gillespie
E. Bartocci
E. Bartocci
E. Bartocci
Emanuela Merelli
Erik de Vink
G. Karsenty
Ion Petre
L.J. Raggatt
M. Calder
M. Kwiatkowska
M. Kwiatkowska
N. Paoletti
Nicola Paoletti
P. Lió
Pietro Liò
R. Barbuti
R. Hanada
S. Jabbar
S.C. Manolagas
T.E. Pronk
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2011
Field of study

We introduce a model checking approach to diagnose the emerging of bone pathologies. The implementation of a new model of bone remodeling in PRISM has led to an interesting characterization of osteoporosis as a defective bone remodeling dynamics with respect to other bone pathologies. Our approach allows to derive three types of model checking-based diagnostic estimators. The first diagnostic measure focuses on the level of bone mineral density, which is currently used in medical practice. In addition, we have introduced a novel diagnostic estimator which uses the full patient clinical record, here simulated using the modeling framework. This estimator detects rapid (months) negative changes in bone mineral density. Independently of the actual bone mineral density, when the decrease occurs rapidly it is important to alarm the patient and monitor him/her more closely to detect insurgence of other bone co-morbidities. A third estimator takes into account the variance of the bone density, which could address the investigation of metabolic syndromes, diabetes and cancer. Our implementation could make use of different logical combinations of these statistical estimators and could incorporate other biomarkers for other systemic co-morbidities (for example diabetes and thalassemia). We are delighted to report that the combination of stochastic modeling with formal methods motivate new diagnostic framework for complex pathologies. In particular our approach takes into consideration important properties of biosystems such as multiscale and self-adaptiveness. The multi-diagnosis could be further expanded, inching towards the complexity of human diseases. Finally, we briefly introduce self-adaptiveness in formal methods which is a key property in the regulative mechanisms of biological systems and well known in other mathematical and engineering areas.Comment: In Proceedings CompMod 2011, arXiv:1109.104

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Camerino

Distributed Synthesis in Continuous Time

Author: A Bertoni
A Paz
A Pnueli
B Genest
B Sinopoli
C Baier
C Baier
D Berwanger
D Guck
DS Bernstein
H Gimbert
H Hermanns
H Hermanns
H Hermanns
J-P Katoen
J-P Katoen
L Cheung
M Droste
MF Neuts
ML Puterman
P Madhusudan
P Madhusudan
PR D’Argenio
R Canetti
R Milner
R Saha
S Giro
SD Brookes
SS Pelozo
W Sue-Hwey
Publication venue
Publication date: 01/01/2016
Field of study

We introduce a formalism modelling communication of distributed agents strictly in continuous-time. Within this framework, we study the problem of synthesising local strategies for individual agents such that a specified set of goal states is reached, or reached with at least a given probability. The flow of time is modelled explicitly based on continuous-time randomness, with two natural implications: First, the non-determinism stemming from interleaving disappears. Second, when we restrict to a subclass of non-urgent models, the quantitative value problem for two players can be solved in EXPTIME. Indeed, the explicit continuous time enables players to communicate their states by delaying synchronisation (which is unrestricted for non-urgent models). In general, the problems are undecidable already for two players in the quantitative case and three players in the qualitative case. The qualitative undecidability is shown by a reduction to decentralized POMDPs for which we provide the strongest (and rather surprising) undecidability result so far

arXiv.org e-Print Archive

Crossref

Online Research Database In Technology

Quantitative Timed Analysis of Interactive Markov Chains

Author: C. Baier
D.P. Bertsekas
E. Böde
G.G.I. López
H. Boudali
H. Hermanns
L. Alfaro de
L. Zhang
M. Bozzano
M.N. Rabe
N. Coste
N. Coste
P. Buchholz
P. Buchholz
R. Knast
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Abstract This paper presents new algorithms and accompanying tool support for analyzing interactive Markov chains (IMCs), a stochastic timed 1 1 2-player game in which delays are exponentially distributed. IMCs are compositional and act as semantic model for engineering for-malisms such as AADL and dynamic fault trees. We provide algorithms for determining the extremal expected time of reaching a set of states, and the long-run average of time spent in a set of states. The prototypical tool Imca supports these algorithms as well as the synthesis of ε-optimal piecewise constant timed policies for timed reachability objectives. Two case studies show the feasibility and scalability of the algorithms.

CiteSeerX

Crossref

Publikationsserver der RWTH Aachen University

University of Twente Research Information

A PAC Learning Algorithm for LTL and Omega-regular Objectives in MDPs

Author: Perez Mateo
Somenzi Fabio
Trivedi Ashutosh
Publication venue
Publication date: 18/10/2023
Field of study

Linear temporal logic (LTL) and omega-regular objectives -- a superset of LTL -- have seen recent use as a way to express non-Markovian objectives in reinforcement learning. We introduce a model-based probably approximately correct (PAC) learning algorithm for omega-regular objectives in Markov decision processes. Unlike prior approaches, our algorithm learns from sampled trajectories of the system and does not require prior knowledge of the system's topology

arXiv.org e-Print Archive