Search CORE

4,205 research outputs found

Task-Based Information Compression for Multi-Agent Communication Problems with Channel Rate Constraints

Author: Chatzinotas Symeon
Mostaani Arsham
Ottersten Björn
Vu Thang X.
Publication venue
Publication date: 27/07/2021
Field of study

A collaborative task is assigned to a multiagent system (MAS) in which agents are allowed to communicate. The MAS runs over an underlying Markov decision process and its task is to maximize the averaged sum of discounted one-stage rewards. Although knowing the global state of the environment is necessary for the optimal action selection of the MAS, agents are limited to individual observations. The inter-agent communication can tackle the issue of local observability, however, the limited rate of the inter-agent communication prevents the agent from acquiring the precise global state information. To overcome this challenge, agents need to communicate their observations in a compact way such that the MAS compromises the minimum possible sum of rewards. We show that this problem is equivalent to a form of rate-distortion problem which we call the task-based information compression. We introduce a scheme for task-based information compression titled State aggregation for information compression (SAIC), for which a state aggregation algorithm is analytically designed. The SAIC is shown to be capable of achieving near-optimal performance in terms of the achieved sum of discounted rewards. The proposed algorithm is applied to a rendezvous problem and its performance is compared with several benchmarks. Numerical experiments confirm the superiority of the proposed algorithm.Comment: 13 pages, 9 figure

arXiv.org e-Print Archive

Open Repository and Bibliography - Luxembourg

FigShare

Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games

Author: Long Haitao
Peng Peng
Tang Zhenkun
Wang Jun
Wen Ying
Yang Yaodong
Yuan Quan
Publication venue
Publication date: 29/03/2017
Field of study

Many artificial intelligence (AI) applications often require multiple intelligent agents to work in a collaborative effort. Efficient learning for intra-agent communication and coordination is an indispensable step towards general AI. In this paper, we take StarCraft combat game as a case study, where the task is to coordinate multiple agents as a team to defeat their enemies. To maintain a scalable yet effective communication protocol, we introduce a Multiagent Bidirectionally-Coordinated Network (BiCNet ['bIknet]) with a vectorised extension of actor-critic formulation. We show that BiCNet can handle different types of combats with arbitrary numbers of AI agents for both sides. Our analysis demonstrates that without any supervisions such as human demonstrations or labelled data, BiCNet could learn various types of advanced coordination strategies that have been commonly used by experienced game players. In our experiments, we evaluate our approach against multiple baselines under different scenarios; it shows state-of-the-art performance, and possesses potential values for large-scale real-world applications.Comment: 10 pages, 10 figures. Previously as title: "Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games", Mar 201

arXiv.org e-Print Archive

UCL Discovery

Analysis and design of multiagent systems using MAS-CommonKADS

Author: Garijo Ayestaran Mercedes
González Cristóbal José Carlos
Iglesias Fernandez Carlos Angel
Velasco Pérez Juan Ramón
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1998
Field of study

This article proposes an agent-oriented methodology called MAS-CommonKADS and develops a case study. This methodology extends the knowledge engineering methodology CommonKADSwith techniquesfrom objectoriented and protocol engineering methodologies. The methodology consists of the development of seven models: Agent Model, that describes the characteristics of each agent; Task Model, that describes the tasks that the agents carry out; Expertise Model, that describes the knowledge needed by the agents to achieve their goals; Organisation Model, that describes the structural relationships between agents (software agents and/or human agents); Coordination Model, that describes the dynamic relationships between software agents; Communication Model, that describes the dynamic relationships between human agents and their respective personal assistant software agents; and Design Model, that refines the previous models and determines the most suitable agent architecture for each agent, and the requirements of the agent network

CiteSeerX

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Multiagent cooperation for solving global optimization problems: an extendible framework with example cooperation strategies

Author: A. Günay
A. Neumaier
Akın Günay
D. Barbucha
D.J. Wales
D.S. Siirola
D.S. Siirola
E. Kaszkurewicz
E.-G. Talbi
F. Schoen
Fatma Başak Aydemir
Figen Öztoprak
J. Nocedal
J.J. Moré
M. Gendreau
M. Hapner
M. Winikoff
M. Yokoo
M.J. North
M.P. Singh
N. Melab
O. Shehory
P.J. Modi
P.M. Pardalos
Pınar Yolum
R. Östermark
R. Ötermark
R.H. Bordini
S. Kraus
S. Luke
S. Staab
S. Talukdar
T.G. Crainic
Ş. I. Birbil
Ş. İlker Birbil
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/11/2012
Field of study

This paper proposes the use of multiagent cooperation for solving global optimization problems through the introduction of a new multiagent environment, MANGO. The strength of the environment lays in itsflexible structure based on communicating software agents that attempt to solve a problem cooperatively. This structure allows the execution of a wide range of global optimization algorithms described as a set of interacting operations. At one extreme, MANGO welcomes an individual non-cooperating agent, which is basically the traditional way of solving a global optimization problem. At the other extreme, autonomous agents existing in the environment cooperate as they see fit during run time. We explain the development and communication tools provided in the environment as well as examples of agent realizations and cooperation scenarios. We also show how the multiagent structure is more effective than having a single nonlinear optimization algorithm with randomly selected initial points

Crossref

Sabanci University Research Database