2,440 research outputs found

    Optimal output consensus for linear systems: A topology free approach

    Get PDF
    In this paper, for any homogeneous system of agents with linear continuous time dynamics, we formulate an optimal control problem. In this problem a convex cost functional of the control signals of the agents shall be minimized, while the outputs of the agents shall coincide at some given finite time. This is an instance of the rendezvous or finite time consensus problem. We solve this problem without any constraints on the communication topology and provide a solution as an explicit feedback control law for the case when the dynamics of the agents is output controllable. It turns out that the communication graph topology induced by the solution is complete. Based on this solution for the finite time consensus problem, we provide a solution to the case of infinite time horizon. Furthermore, we investigate under what circumstances it is possible to express the controller as a feedback control law of the output instead of the states.Comment: 8 page

    Deep Reinforcement Learning for Swarm Systems

    Full text link
    Recently, deep reinforcement learning (RL) methods have been applied successfully to multi-agent scenarios. Typically, these methods rely on a concatenation of agent states to represent the information content required for decentralized decision making. However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties inherent to these systems: (i) the agents in the swarm are interchangeable and (ii) the exact number of agents in the swarm is irrelevant. Therefore, we propose a new state representation for deep multi-agent RL based on mean embeddings of distributions. We treat the agents as samples of a distribution and use the empirical mean embedding as input for a decentralized policy. We define different feature spaces of the mean embedding using histograms, radial basis functions and a neural network learned end-to-end. We evaluate the representation on two well known problems from the swarm literature (rendezvous and pursuit evasion), in a globally and locally observable setup. For the local setup we furthermore introduce simple communication protocols. Of all approaches, the mean embedding representation using neural network features enables the richest information exchange between neighboring agents facilitating the development of more complex collective strategies.Comment: 31 pages, 12 figures, version 3 (published in JMLR Volume 20

    Fuzzy Distributed Cooperative Tracking For A Swarm Of Unmanned Aerial Vehicles With Heterogeneous Goals

    Get PDF
    Copyright © 2015 Taylor & Francis This is an Accepted Manuscript of an article published by Taylor & Francis in International Journal of Systems Science on 29 December 2015, available online: http://www.tandfonline.com/10.1080/00207721.2015.1126380This article proposes a systematic analysis for a tracking problem which ensures cooperation amongst a swarm of UAVs, modelled as nonlinear systems with linear and angular velocity constraints, in order to achieve different goals. A distributed Takagi-Sugeno (TS) framework design is adopted for the representation of the nonlinear model of the dynamics of the UAVs. The distributed control law which is introduced is composed of both node and network level information. Firstly feedback gains are synthesised using a Parallel Distributed Compensation (PDC) control law structure, for a collection of isolated UAVs; ignoring communications among the swarm. Then secondly, based on an alternation-like procedure, the resulting feedback gains are used to determine Lyapunov matrices which are utilised at network level to incorporate into the control law the relative differences in the states of the vehicles, and to induce cooperative behaviour. Eventually stability is guaranteed for the entire swarm. The control synthesis is performed using tools from linear control theory: in particular the design criteria are posed as Linear Matrix Inequalities (LMIs). An example based on a UAV tracking scenario is included to outline the efficacy of the approach.Engineering and Physical Sciences Research Council (EPSRC
    corecore