Search CORE

7 research outputs found

Incremental Temporal Logic Synthesis of Control Policies for Robots Interacting with Dynamic Agents

Author: Belta Calin
Frazzoli Emilio
Rus Daniela
Ulusoy Alphan
Wongpiromsarn Tichakorn
Publication venue
Publication date: 01/01/2012
Field of study

We consider the synthesis of control policies from temporal logic specifications for robots that interact with multiple dynamic environment agents. Each environment agent is modeled by a Markov chain whereas the robot is modeled by a finite transition system (in the deterministic case) or Markov decision process (in the stochastic case). Existing results in probabilistic verification are adapted to solve the synthesis problem. To partially address the state explosion issue, we propose an incremental approach where only a small subset of environment agents is incorporated in the synthesis procedure initially and more agents are successively added until we hit the constraints on computational resources. Our algorithm runs in an anytime fashion where the probability that the robot satisfies its specification increases as the algorithm progresses

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Crossref

Boston University Institutional Repository (OpenBU)

The Complexity of Graph-Based Reductions for Reachability in Markov Decision Processes

Author: AL Strehl
C Baier
C Courcoubetis
C Dehnert
Krishnendu Chatterjee
L Valiant
LP Kaelbling
M Kwiatkowska
M Steinmetz
ML Puterman
N Fijalkow
PR D’Argenio
S Fortune
SJ Russell
T Brázdil
T Eilam-Tzoreff
Publication venue
Publication date: 01/01/2018
Field of study

We study the never-worse relation (NWR) for Markov decision processes with an infinite-horizon reachability objective. A state q is never worse than a state p if the maximal probability of reaching the target set of states from p is at most the same value from q, regard- less of the probabilities labelling the transitions. Extremal-probability states, end components, and essential states are all special cases of the equivalence relation induced by the NWR. Using the NWR, states in the same equivalence class can be collapsed. Then, actions leading to sub- optimal states can be removed. We show the natural decision problem associated to computing the NWR is coNP-complete. Finally, we ex- tend a previously known incomplete polynomial-time iterative algorithm to under-approximate the NWR

arXiv.org e-Print Archive

Crossref

Institutional Repository Universiteit Antwerpen

DI-fusion

Reduction Techniques for Model Checking and Learning in MDPs

Author: Bharadwaj Suda
Perez Guillermo A.
Roux Stephane Le
Topcu Ufuk
Publication venue: 'International Joint Conferences on Artificial Intelligence'
Publication date: 01/01/2017
Field of study

info:eu-repo/semantics/publishe

Crossref

Institutional Repository Universiteit Antwerpen

DI-fusion

Ensuring the Reliability of Your Model Checker::Interval Iteration for Markov Decision Processes

Author: A Bell
A Bianco
C Baier
C Baier
C Courcoubetis
DP Bertsekas
EM Hahn
H Hansson
I Chades
J-P Katoen
L Alfaro de
M Kwiatkowska
M Puterman
ML Puterman
P Dai
R Bellman
R Howard
S Giro
S Haddad
T Brázdil
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2017
Field of study

Crossref

University of Birmingham Research Portal

Maximizing the Conditional Expected Reward for Reaching the Goal

Author: C Acerbi
C Baier
C Baier
C Baier
DP Bertsekas
F Gretz
G Barthe
G Seber
J-P Katoen
K Chatterjee
K Chatzikokolakis
L Alfaro
L Kallenberg
M Kwiatkowska
M Randour
ME Andrés
ME Andrés
ML Puterman
MS Alvim
T Brázdil
Publication venue
Publication date: 19/01/2017
Field of study

The paper addresses the problem of computing maximal conditional expected accumulated rewards until reaching a target state (briefly called maximal conditional expectations) in finite-state Markov decision processes where the condition is given as a reachability constraint. Conditional expectations of this type can, e.g., stand for the maximal expected termination time of probabilistic programs with non-determinism, under the condition that the program eventually terminates, or for the worst-case expected penalty to be paid, assuming that at least three deadlines are missed. The main results of the paper are (i) a polynomial-time algorithm to check the finiteness of maximal conditional expectations, (ii) PSPACE-completeness for the threshold problem in acyclic Markov decision processes where the task is to check whether the maximal conditional expectation exceeds a given threshold, (iii) a pseudo-polynomial-time algorithm for the threshold problem in the general (cyclic) case, and (iv) an exponential-time algorithm for computing the maximal conditional expectation and an optimal scheduler.Comment: 103 pages, extended version with appendices of a paper accepted at TACAS 201

arXiv.org e-Print Archive

Crossref

Design of Approaches for Dependability and Initial Prototypes

Author: Bertolino Antonia
Calabro Antonello
Chiaradonna Silvano
Costa Gabriele
Di Giandomenico Felicita
Di Marco Antinisca
Fusani Mario
Grandoni Fabrizio
Issarny Valerie
Kwiatkowska Marta
Marcheti Eda
Martinelli Fabio
Martinucci Marco
Masci Paolo
Matteucci Ilaria
Qu Hongyang
Saadi Rachid
Sabetta Antonino
Vaccarelli Anna
Publication venue: HAL CCSD
Publication date: 18/02/2011
Field of study

The aim of CONNECT is to achieve universal interoperability between heterogeneous Networked Systems. For this, the non-functional properties required at each side of the connection going to be established must be fulfilled. By the one inclusive term "CONNECTability" we comprehend properties belonging to all four non-functional concerns of interest for CONNECT, namely dependability, performance, security and trust. We model such properties in conformance with a meta-model which establishes the relevant concepts and their relations. Then, building on the conceptual models proposed in the first year in Deliverable D5.1, in this document we present the approaches developed for assuring CONNECTability both at synthesis time and at runtime. The contributions include: the Dependability&Performance analysis Enabler, for which we release a modular architecture supporting stochastic verification and state-based analysis; incremental verification and event-based monitoring for runtime analysis; a model-based approach to interoperable trust management; the Security-by-Contract-with-Trust framework, which guarantees and enforces the expected trust levels and security policies

INRIA a CCSD electronic archive server