Search CORE

1,885 research outputs found

Planning for Decentralized Control of Multiple Robots Under Uncertainty

Author: Amato Christopher
Cruz Gabriel
How Jonathan P.
Kaelbling Leslie P.
Konidaris George D.
Maynor Christopher A.
Publication venue
Publication date: 12/02/2014
Field of study

We describe a probabilistic framework for synthesizing control policies for general multi-robot systems, given environment and sensor models and a cost function. Decentralized, partially observable Markov decision processes (Dec-POMDPs) are a general model of decision processes where a team of agents must cooperate to optimize some objective (specified by a shared reward or cost function) in the presence of uncertainty, but where communication limitations mean that the agents cannot share their state, so execution must proceed in a decentralized fashion. While Dec-POMDPs are typically intractable to solve for real-world problems, recent research on the use of macro-actions in Dec-POMDPs has significantly increased the size of problem that can be practically solved as a Dec-POMDP. We describe this general model, and show how, in contrast to most existing methods that are specialized to a particular problem class, it can synthesize control policies that use whatever opportunities for coordination are present in the problem, while balancing off uncertainty in outcomes, sensor information, and information about other agents. We use three variations on a warehouse task to show that a single planner of this type can generate cooperative behavior using task allocation, direct communication, and signaling, as appropriate

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Crossref

Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Author: Chatterjee Krishnendu
Chmelík Martin
Gupta Raghav
Kanodia Ayush
Publication venue
Publication date: 01/01/2015
Field of study

We consider partially observable Markov decision processes (POMDPs), that are a standard framework for robotics applications to model uncertainties present in the real world, with temporal logic specifications. All temporal logic specifications in linear-time temporal logic (LTL) can be expressed as parity objectives. We study the qualitative analysis problem for POMDPs with parity objectives that asks whether there is a controller (policy) to ensure that the objective holds with probability 1 (almost-surely). While the qualitative analysis of POMDPs with parity objectives is undecidable, recent results show that when restricted to finite-memory policies the problem is EXPTIME-complete. While the problem is intractable in theory, we present a practical approach to solve the qualitative analysis problem. We designed several heuristics to deal with the exponential complexity, and have used our implementation on a number of well-known POMDP examples for robotics applications. Our results provide the first practical approach to solve the qualitative analysis of robot motion planning with LTL properties in the presence of uncertainty

arXiv.org e-Print Archive

Crossref

IST PubRep

IST Austria: PubRep (Institute of Science and Technology)

Differentiable Algorithm Networks for Composable Robot Learning

Author: Hsu David
Kaelbling Leslie Pack
Karkus Peter
Lee Wee Sun
Lozano-Perez Tomas
Ma Xiao
Publication venue
Publication date: 28/05/2019
Field of study

This paper introduces the Differentiable Algorithm Network (DAN), a composable architecture for robot learning systems. A DAN is composed of neural network modules, each encoding a differentiable robot algorithm and an associated model; and it is trained end-to-end from data. DAN combines the strengths of model-driven modular system design and data-driven end-to-end learning. The algorithms and models act as structural assumptions to reduce the data requirements for learning; end-to-end learning allows the modules to adapt to one another and compensate for imperfect models and algorithms, in order to achieve the best overall system performance. We illustrate the DAN methodology through a case study on a simulated robot system, which learns to navigate in complex 3-D environments with only local visual observations and an image of a partially correct 2-D floor map.Comment: RSS 2019 camera ready. Video is available at https://youtu.be/4jcYlTSJF4

arXiv.org e-Print Archive

DSpace@MIT