Search CORE

4 research outputs found

Algorithms for distributed exploration

Author: Kudenko D.
Strens M.J.A.
Walker T.
Publication venue: 'IOS Press'
Publication date
Field of study

In this paper we propose algorithms for a set of problems where a distributed team of agents tries to compile a global map of the environment from local observations. We focus on two approaches: one based on behavioural agent technology where agents are pulled (or repelled) by various forces, and another where agents follow a approximate planning approach that is based on dynamic programming. We study these approaches under different conditions, such as different types of environments, varying sensor and communication ranges, and the availability of prior knowledge of the map. The results show that in most cases the simpler behavioural agent teams perform at least as well, if not better, than the teams based on approximate planning and dynamic programming. The research has not only practical implications for distributed exploration tasks, but also for analogous distributed search or optimisation problems

White Rose Research Online

Solving Optimization Problem Using Multi-agent Model Based on Belief Interaction

Author: A.S. Rao
C.J. Petrie
G. Weiss
H. Jing
J.D. Siirola
M.J.A. Strens
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Crossref

Simple Model-Based Exploration and Exploitation of Markov Decision Processes Using the Elimination Algorithm

Author: C. Lusena
L.P. Kaelbling
L.P. Kaelbling
M. Kearns
M. Kearns
M. Mundhenk
M.J.A. Strens
R.I. Brafman
R.M. Neal
R.S. Sutton
S.J. Russell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Combining Policy Search with Planning in Multi-agent Cooperation

Author: B. Bulka
G. Fraser
J. Baxter
J.R. Kok
L. Peshkin
M. Grounds
M.J.A. Strens
O. Buffet
O. Obst
O. Obst
P. Stone
R.J. Williams
T. Nakashima
Publication venue
Publication date: 01/01/2009
Field of study

It is cooperation that essentially differentiates multi-agent systems (MASs) from single-agent intelligence. In realistic MAS applications such as RoboCup, repeated work has shown that traditional machine learning (ML) approaches have difficulty mapping directly from cooperative behaviours to actuator outputs. To overcome this problem, vertical layered architectures are commonly used to break cooperation down into behavioural layers; ML has then been used to generate different low-level skills, and a planning mechanism added to create high-level cooperation. We propose a novel method called Policy Search Planning (PSP), in which Policy Search is used to find an optimal policy for selecting plans from a plan pool. PSP extends an existing gradient-search method (GPOMDP) to a MAS domain. We demonstrate how PSP can be used in RoboCup Simulation, and our experimental results reveal robustness, adaptivity, and outperformance over other methods. © 2009 Springer Berlin Heidelberg

Crossref

Oxford University Research Archive