Search CORE

700 research outputs found

Barrier Functions for Multiagent-POMDPs with DTL Specifications

Author: Ahmadi Mohamadreza
Ames Aaron D.
Burdick Joel W.
Singletary Andrew
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/03/2020
Field of study

Multi-agent partially observable Markov decision processes (MPOMDPs) provide a framework to represent heterogeneous autonomous agents subject to uncertainty and partial observation. In this paper, given a nominal policy provided by a human operator or a conventional planning method, we propose a technique based on barrier functions to design a minimally interfering safety-shield ensuring satisfaction of high-level specifications in terms of linear distribution temporal logic (LDTL). To this end, we use sufficient and necessary conditions for the invariance of a given set based on discrete-time barrier functions (DTBFs) and formulate sufficient conditions for finite time DTBF to study finite time convergence to a set. We then show that different LDTL mission/safety specifications can be cast as a set of invariance or finite time reachability problems. We demonstrate that the proposed method for safety-shield synthesis can be implemented online by a sequence of one-step greedy algorithms. We demonstrate the efficacy of the proposed method using experiments involving a team of robots

arXiv.org e-Print Archive

Crossref

Caltech Authors

Anytime Guarantees for Reachability in Uncountable Markov Decision Processes

Author: Grover Kush
Meggendorfer Tobias
Weininger Maximilian
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 33rd International Conference on Concurrency Theory (CONCUR 2022)
Publication date: 01/01/2022
Field of study

We consider the problem of approximating the reachability probabilities in Markov decision processes (MDP) with uncountable (continuous) state and action spaces. While there are algorithms that, for special classes of such MDP, provide a sequence of approximations converging to the true value in the limit, our aim is to obtain an algorithm with guarantees on the precision of the approximation. As this problem is undecidable in general, assumptions on the MDP are necessary. Our main contribution is to identify sufficient assumptions that are as weak as possible, thus approaching the "boundary" of which systems can be correctly and reliably analyzed. To this end, we also argue why each of our assumptions is necessary for algorithms based on processing finitely many observations. We present two solution variants. The first one provides converging lower bounds under weaker assumptions than typical ones from previous works concerned with guarantees. The second one then utilizes stronger assumptions to additionally provide converging upper bounds. Altogether, we obtain an anytime algorithm, i.e. yielding a sequence of approximants with known and iteratively improving precision, converging to the true value in the limit. Besides, due to the generality of our assumptions, our algorithms are very general templates, readily allowing for various heuristics from literature in contrast to, e.g., a specific discretization algorithm. Our theoretical contribution thus paves the way for future practical improvements without sacrificing correctness guarantees

Dagstuhl Research Online Publication Server

An Anytime Algorithm for Reachability on Uncountable MDP

Author: Grover Kush
Křetínský Jan
Meggendorfer Tobias
Weininger Maximilian
Publication venue
Publication date: 10/08/2020
Field of study

We provide an algorithm for reachability on Markov decision processes with uncountable state and action spaces, which, under mild assumptions, approximates the optimal value to any desired precision. It is the first such anytime algorithm, meaning that at any point in time it can return the current approximation with its precision. Moreover, it simultaneously is the first algorithm able to utilize \emph{learning} approaches without sacrificing guarantees and it further allows for combination with existing heuristics

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

IST Austria: PubRep (Institute of Science and Technology)

Lancaster E-Prints

Efficient approximation of optimal control for continuous-time Markov games

Author: Fearnley J
Rabe MN
Schewe S
Zhang L
Publication venue: 'Elsevier BV'
Publication date: 30/12/2015
Field of study

We study the time-bounded reachability problem for continuous-time Markov decision processes (CTMDPs) and games (CTMGs). Existing techniques for this problem use discretisation techniques to partition time into discrete intervals of size ε, and optimal control is approximated for each interval separately. Current techniques provide an accuracy of on each interval, which leads to an infeasibly large number of intervals. We propose a sequence of approximations that achieve accuracies of , , and , that allow us to drastically reduce the number of intervals that are considered. For CTMDPs, the performance of the resulting algorithms is comparable to the heuristic approach given by Buchholz and Schulz, while also being theoretically justified. All of our results generalise to CTMGs, where our results yield the first practically implementable algorithms for this problem. We also provide memoryless strategies for both players that achieve similar error bounds

University of Liverpool Repository

Crossref

Institute Of Software, Chinese Academy Of Sciences

Barrier Functions for Multiagent-POMDPs with DTL Specifications

Author: Ahmadi Mohamadreza
Ames Aaron D.
Burdick Joel W.
Singletary Andrew
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2020
Field of study