Search CORE

6,160 research outputs found

Bearing-based formation control with second-order agent dynamics

Author: Tron Roberto
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

We consider the distributed formation control problem for a network of agents using visual measurements. We propose solutions that are based on bearing (and optionally distance) measurements, and agents with double integrator dynamics. We assume that a subset of the agents can track, in addition to their neighbors, a set of static features in the environment. These features are not considered to be part of the formation, but they are used to asymptotically control the velocity of the agents. We analyze the convergence properties of the proposed protocols analytically and through simulations.Published versionSupporting documentatio

Crossref

Boston University Institutional Repository (OpenBU)

Deep Reinforcement Learning for Swarm Systems

Author: Hüttenrauch Maximilian
Neumann Gerhard
Šošić Adrian
Publication venue
Publication date: 01/01/2019
Field of study

Recently, deep reinforcement learning (RL) methods have been applied successfully to multi-agent scenarios. Typically, these methods rely on a concatenation of agent states to represent the information content required for decentralized decision making. However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties inherent to these systems: (i) the agents in the swarm are interchangeable and (ii) the exact number of agents in the swarm is irrelevant. Therefore, we propose a new state representation for deep multi-agent RL based on mean embeddings of distributions. We treat the agents as samples of a distribution and use the empirical mean embedding as input for a decentralized policy. We define different feature spaces of the mean embedding using histograms, radial basis functions and a neural network learned end-to-end. We evaluate the representation on two well known problems from the swarm literature (rendezvous and pursuit evasion), in a globally and locally observable setup. For the local setup we furthermore introduce simple communication protocols. Of all approaches, the mean embedding representation using neural network features enables the richest information exchange between neighboring agents facilitating the development of more complex collective strategies.Comment: 31 pages, 12 figures, version 3 (published in JMLR Volume 20

arXiv.org e-Print Archive

TUbiblio