Search CORE

5,813 research outputs found

Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions

Author: Agha-mohammadi Ali-akbar
Amato Christopher
How Jonathan P.
Omidshafiei Shayegan
Publication venue
Publication date: 20/02/2015
Field of study

The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but representing and solving Dec-POMDPs is often intractable for large problems. To allow for a high-level representation that is natural for multi-robot problems and scalable to large discrete and continuous problems, this paper extends the Dec-POMDP model to the decentralized partially observable semi-Markov decision process (Dec-POSMDP). The Dec-POSMDP formulation allows asynchronous decision-making by the robots, which is crucial in multi-robot domains. We also present an algorithm for solving this Dec-POSMDP which is much more scalable than previous methods since it can incorporate closed-loop belief space macro-actions in planning. These macro-actions are automatically constructed to produce robust solutions. The proposed method's performance is evaluated on a complex multi-robot package delivery problem under uncertainty, showing that our approach can naturally represent multi-robot problems and provide high-quality solutions for large-scale problems

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Crossref

How to Deploy a Wire with a Robotic Platform: Learning from Human Visual Demonstrations

Author: Michieletto Stefano
Pagello Enrico
Stival Francesca
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

In this paper, we address the problem of deploying a wire along a specific path selected by an unskilled user. The robot has to learn the selected path and pass a wire through the peg table by using the same tool. The main contribution regards the hybrid use of Cartesian positions provided by a learning procedure and joint positions obtained by inverse kinematics and motion planning. Some constraints are introduced to deal with non-rigid material without breaks or knots. We took into account a series of metrics to evaluate the robot learning capabilities, all of them over performed the targets

Archivio istituzionale della ricerca - Università di Padova

Anytime Point-Based Approximations for Large POMDPs

Author: Gordon G.
Pineau J.
Thrun S.
Publication venue: 'AI Access Foundation'
Publication date: 04/10/2011
Field of study

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally intractable for all but the smallest problems. A well-known technique for speeding up POMDP solving involves performing value backups at specific belief points, rather than over the entire belief simplex. The efficiency of this approach, however, depends greatly on the selection of points. This paper presents a set of novel techniques for selecting informative belief points which work well in practice. The point selection procedure is combined with point-based value backups to form an effective anytime POMDP algorithm called Point-Based Value Iteration (PBVI). The first aim of this paper is to introduce this algorithm and present a theoretical analysis justifying the choice of belief selection technique. The second aim of this paper is to provide a thorough empirical comparison between PBVI and other state-of-the-art POMDP methods, in particular the Perseus algorithm, in an effort to highlight their similarities and differences. Evaluation is performed using both standard POMDP domains and realistic robotic tasks

arXiv.org e-Print Archive

Crossref