Search CORE

171 research outputs found

Stochastic Finite State Control of POMDPs with LTL Specifications

Author: Ahmadi Mohamadreza
Burdick Joel W.
Sharan Rangoli
Publication venue
Publication date: 21/01/2020
Field of study

Partially observable Markov decision processes (POMDPs) provide a modeling framework for autonomous decision making under uncertainty and imperfect sensing, e.g. robot manipulation and self-driving cars. However, optimal control of POMDPs is notoriously intractable. This paper considers the quantitative problem of synthesizing sub-optimal stochastic finite state controllers (sFSCs) for POMDPs such that the probability of satisfying a set of high-level specifications in terms of linear temporal logic (LTL) formulae is maximized. We begin by casting the latter problem into an optimization and use relaxations based on the Poisson equation and McCormick envelopes. Then, we propose an stochastic bounded policy iteration algorithm, leading to a controlled growth in sFSC size and an any time algorithm, where the performance of the controller improves with successive iterations, but can be stopped by the user based on time or memory considerations. We illustrate the proposed method by a robot navigation case study

Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Author: Chatterjee Krishnendu
Chmelík Martin
Gupta Raghav
Kanodia Ayush
Publication venue
Publication date: 01/01/2015
Field of study

We consider partially observable Markov decision processes (POMDPs), that are a standard framework for robotics applications to model uncertainties present in the real world, with temporal logic specifications. All temporal logic specifications in linear-time temporal logic (LTL) can be expressed as parity objectives. We study the qualitative analysis problem for POMDPs with parity objectives that asks whether there is a controller (policy) to ensure that the objective holds with probability 1 (almost-surely). While the qualitative analysis of POMDPs with parity objectives is undecidable, recent results show that when restricted to finite-memory policies the problem is EXPTIME-complete. While the problem is intractable in theory, we present a practical approach to solve the qualitative analysis problem. We designed several heuristics to deal with the exponential complexity, and have used our implementation on a number of well-known POMDP examples for robotics applications. Our results provide the first practical approach to solve the qualitative analysis of robot motion planning with LTL properties in the presence of uncertainty

arXiv.org e-Print Archive

Crossref

IST PubRep

IST Austria: PubRep (Institute of Science and Technology)

Barrier Functions for Multiagent-POMDPs with DTL Specifications

Author: Ahmadi Mohamadreza
Ames Aaron D.
Burdick Joel W.
Singletary Andrew
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/03/2020
Field of study

Multi-agent partially observable Markov decision processes (MPOMDPs) provide a framework to represent heterogeneous autonomous agents subject to uncertainty and partial observation. In this paper, given a nominal policy provided by a human operator or a conventional planning method, we propose a technique based on barrier functions to design a minimally interfering safety-shield ensuring satisfaction of high-level specifications in terms of linear distribution temporal logic (LDTL). To this end, we use sufficient and necessary conditions for the invariance of a given set based on discrete-time barrier functions (DTBFs) and formulate sufficient conditions for finite time DTBF to study finite time convergence to a set. We then show that different LDTL mission/safety specifications can be cast as a set of invariance or finite time reachability problems. We demonstrate that the proposed method for safety-shield synthesis can be implemented online by a sequence of one-step greedy algorithms. We demonstrate the efficacy of the proposed method using experiments involving a team of robots

arXiv.org e-Print Archive

Crossref

Caltech Authors

Control of Probabilistic Systems under Dynamic, Partially Known Environments with Temporal Logic Specifications

Author: Frazzoli Emilio
Wongpiromsarn Tichakorn
Publication venue
Publication date: 01/01/2012
Field of study

We consider the synthesis of control policies for probabilistic systems, modeled by Markov decision processes, operating in partially known environments with temporal logic specifications. The environment is modeled by a set of Markov chains. Each Markov chain describes the behavior of the environment in each mode. The mode of the environment, however, is not known to the system. Two control objectives are considered: maximizing the expected probability and maximizing the worst-case probability that the system satisfies a given specification

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Crossref

IST Austria Technical Report

Author: Chatterjee Krishnendu
Chmelik Martin
Gupta Raghav
Kanodia Ayush
Publication venue: IST Austria
Publication date: 01/01/2014
Field of study

IST Austria: PubRep (Institute of Science and Technology)

IST Austria Technical Report

Author: Chatterjee Krishnendu
Chmelik Martin
Gupta Raghav
Kanodia Ayush
Publication venue: IST Austria
Publication date: 01/01/2014
Field of study

IST Austria: PubRep (Institute of Science and Technology)

Point-Based Methods for Model Checking in Partially Observable Markov Decision Processes

Author: Bouton Maxime
Kochenderfer Mykel J.
Tumova Jana
Publication venue
Publication date: 11/01/2020
Field of study

Autonomous systems are often required to operate in partially observable environments. They must reliably execute a specified objective even with incomplete information about the state of the environment. We propose a methodology to synthesize policies that satisfy a linear temporal logic formula in a partially observable Markov decision process (POMDP). By formulating a planning problem, we show how to use point-based value iteration methods to efficiently approximate the maximum probability of satisfying a desired logical formula and compute the associated belief state policy. We demonstrate that our method scales to large POMDP domains and provides strong bounds on the performance of the resulting policy.Comment: 8 pages, 3 figures, AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications