Search CORE

8 research outputs found

ConTaCT: Deciding to Communicate during Time-Critical Collaborative Tasks in Unknown, Deterministic Domains

Author: Shah Julie A.
Unhelkar Vaibhav Vasant
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 20/01/2016
Field of study

Communication between agents has the potential to improve team performance of collaborative tasks. However, communication is not free in most domains, requiring agents to reason about the costs and benefits of sharing information. In this work, we develop an online, decentralized communication policy, ConTaCT, that enables agents to decide whether or not to communicate during time-critical collaborative tasks in unknown, deterministic environments. Our approach is motivated by real-world applications, including the coordination of disaster response and search and rescue teams. These settings motivate a model structure that explicitly represents the world model as initially unknown but deterministic in nature, and that de-emphasizes uncertainty about action outcomes. Simulated experiments are conducted in which ConTaCT is compared to other multi-agent communication policies, and results indicate that ConTaCT achieves comparable task performance while substantially reducing communication overhead

CiteSeerX

DSpace@MIT

Association for the Advancement of Artificial Intelligence: AAAI Publications

Decentralized multi-robot cooperation with auctioned pomdps

Author: Aníbal Ollero
Jesús Capitán
Luis Merino
Matthijs T J Spaan
Publication venue
Publication date: 01/01/2013
Field of study

ABSTRACT Planning under uncertainty faces a scalability problem when considering multi-robot teams, as the information space scales exponentially with the number of robots. To address this issue, this paper proposes to decentralize multi-agent Partially Observable Markov Decision Process (POMDPs) while maintaining cooperation between robots by using POMDP policy auctions. Auctions provide a flexible way of coordinating individual policies modeled by POMDPs and have low communication requirements. Additionally, communication models in the multi-agent POMDP literature severely mismatch with real inter-robot communication. We address this issue by applying a decentralized data fusion method in order to efficiently maintain a joint belief state among the robots. The paper focuses on a cooperative tracking application, in which several robots have to jointly track a moving target of interest. The proposed ideas are illustrated in real multi-robot experiments, showcasing the flexible and robust coordination that our techniques can provide

CiteSeerX

Association for the Advancement of Artificial Intelligence: AAAI Publications

Winning back the CUP for Distributed POMDPs: Planning over continuous belief spaces

Author: Nair Ranjit
Tambe Milind
VARAKANTHAM Pradeep
Yokoo Makoto
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2006
Field of study

continuous belief space

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Towards efficient planning for real world partially observable domains

Author: Makoto Yokoo
Manuela Veloso
Milind Tambe (chairperson
Pradeep Varakantham
Stacy Marsella
Sven Koenig
Publication venue: University of Southern California
Publication date: 01/01/2006
Field of study

In partial fulfillment of the degree of Doctor of Philosophy (Computer Science)</p

CiteSeerX

Institutional Knowledge at Singapore Management University

USC Digital Library

Recommended from our members

Decision-Theoretic Meta-reasoning in Partially Observable and Decentralized Settings

Author: Carlin Alan Scott
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2012
Field of study

This thesis examines decentralized meta-reasoning. For a single agent or multiple agents, it may not be enough for agents to compute correct decisions if they do not do so in a timely or resource efficient fashion. The utility of agent decisions typically increases with decision quality, but decreases with computation time. The reasoning about one\u27s computation process is referred to as meta-reasoning. Aspects of meta-reasoning considered in this thesis include the reasoning about how to allocate computational resources, including when to stop one type of computation and begin another, and when to stop all computation and report an answer. Given a computational model, this translates into computing how to schedule the basic computations that solve a problem. This thesis constructs meta-reasoning strategies for the purposes of monitoring and control in multi-agent settings, specifically settings that can be modeled by the Decentralized Partially Observable Markov Decision Process (Dec-POMDP). It uses decision theory to optimize computation for efficiency in time and space in communicative and non-communicative decentralized settings. Whereas base-level reasoning describes the optimization of actual agent behaviors, the meta-reasoning strategies produced by this thesis dynamically optimize the computational resources which lead to the selection of base-level behaviors

ScholarWorks@UMass Amherst

Communication for Improving Policy Computation in Distributed POMDPs

Author: Maayan Roth
Makoto Yokoo
Milind Tambe
Ranjit Nair
Publication venue: ACM Press
Publication date: 01/01/2004
Field of study

Distributed Partially Observable Markov Decision Problems (POMDPs) are emerging as a popular approach for modeling multiagent teamwork where a group of agents work together to jointly maximize a reward function. Since the problem of finding the optimal joint policy for a distributed POMDP has been shown to be NEXP-Complete if no assumptions are made about the domain conditions, several locally optimal approaches have emerged as a viable solution. However, the use of communicative actions as part of these locally optimal algorithms has been largely ignored or has been applied only under restrictive assumptions about the domain. In this paper, we show how communicative acts can be explicitly introduced in order to find locally optimal joint policies that allow agents to coordinate better through synchronization achieved via communication. Furthermore, the introduction of communication allows us to develop a novel compact policy representation that results in savings of both space and time which are verified empirically. Finally, through the imposition of constraints on communication such as not going without communicating for more than K steps, even greater space and time savings can be obtained

CiteSeerX