2,440 research outputs found
Optimal output consensus for linear systems: A topology free approach
In this paper, for any homogeneous system of agents with linear continuous
time dynamics, we formulate an optimal control problem. In this problem a
convex cost functional of the control signals of the agents shall be minimized,
while the outputs of the agents shall coincide at some given finite time. This
is an instance of the rendezvous or finite time consensus problem. We solve
this problem without any constraints on the communication topology and provide
a solution as an explicit feedback control law for the case when the dynamics
of the agents is output controllable. It turns out that the communication graph
topology induced by the solution is complete. Based on this solution for the
finite time consensus problem, we provide a solution to the case of infinite
time horizon. Furthermore, we investigate under what circumstances it is
possible to express the controller as a feedback control law of the output
instead of the states.Comment: 8 page
Deep Reinforcement Learning for Swarm Systems
Recently, deep reinforcement learning (RL) methods have been applied
successfully to multi-agent scenarios. Typically, these methods rely on a
concatenation of agent states to represent the information content required for
decentralized decision making. However, concatenation scales poorly to swarm
systems with a large number of homogeneous agents as it does not exploit the
fundamental properties inherent to these systems: (i) the agents in the swarm
are interchangeable and (ii) the exact number of agents in the swarm is
irrelevant. Therefore, we propose a new state representation for deep
multi-agent RL based on mean embeddings of distributions. We treat the agents
as samples of a distribution and use the empirical mean embedding as input for
a decentralized policy. We define different feature spaces of the mean
embedding using histograms, radial basis functions and a neural network learned
end-to-end. We evaluate the representation on two well known problems from the
swarm literature (rendezvous and pursuit evasion), in a globally and locally
observable setup. For the local setup we furthermore introduce simple
communication protocols. Of all approaches, the mean embedding representation
using neural network features enables the richest information exchange between
neighboring agents facilitating the development of more complex collective
strategies.Comment: 31 pages, 12 figures, version 3 (published in JMLR Volume 20
Fuzzy Distributed Cooperative Tracking For A Swarm Of Unmanned Aerial Vehicles With Heterogeneous Goals
Copyright © 2015 Taylor & Francis This is an Accepted Manuscript of an article published by Taylor & Francis in International Journal of Systems Science on 29 December 2015, available online: http://www.tandfonline.com/10.1080/00207721.2015.1126380This article proposes a systematic analysis for a tracking problem which ensures cooperation amongst a swarm of UAVs, modelled as nonlinear systems with linear and angular velocity constraints, in order to achieve different goals. A distributed Takagi-Sugeno (TS) framework design is adopted for the representation of the nonlinear model of the dynamics of the UAVs. The distributed control law which is introduced is composed of both node and network level information. Firstly feedback gains are synthesised using a Parallel Distributed Compensation (PDC) control law structure, for a collection of isolated UAVs; ignoring communications among the swarm. Then secondly, based on an alternation-like procedure, the resulting feedback gains are used to determine Lyapunov matrices which are utilised at network level to incorporate into the control law the relative differences in the states of the vehicles, and to induce cooperative behaviour. Eventually stability is guaranteed for the entire swarm. The control synthesis is performed using tools from linear control theory: in particular the design criteria are posed as Linear Matrix Inequalities (LMIs). An example based on a UAV tracking scenario is included to outline the efficacy of the approach.Engineering and Physical Sciences Research Council (EPSRC
- …