52,473 research outputs found

    CONTROL AND ESTIMATION ALGORITHMS FOR MULTIPLE-AGENT SYSTEMS

    Get PDF
    Tese arquivada ao abrigo da Portaria nº 227/2017 de 25 de julhoIn this thesis we study crucial problems within complex, large scale, networked control systems and mobile sensor networks. The ¯rst one is the problem of decomposition of a large-scale system into several interconnected subsystems, based on the imposed information structure constraints. After associating an intelligent agent with each subsystem, we face with a problem of formulating their local estimation and control laws and designing inter-agent communication strategies which ensure stability, desired performance, scalability and robustness of the overall system. Another problem addressed in this thesis, which is critical in mobile sensor networks paradigm, is the problem of searching positions for mobile nodes in order to achieve optimal overall sensing capabilities. Novel, overlapping decentralized state and parameter estimation schemes based on the consensus strategy have been proposed, in both continuous-time and discrete-time. The algorithms are proposed in the form of a multi-agent network based on a combination of local estimators and a dynamic consensus strategy, assuming possible intermittent observations and communication faults. Under general conditions concerning the agent resources and the network topology, conditions are derived for the stability and convergence of the algorithms. For the state estimation schemes, a strategy based on minimization of the steady-state mean-square estimation error is proposed for selection of the consensus gains; these gains can also be adjusted by local adaptation schemes. It is also demonstrated that there exists a connection between the network complexity and e±ciency of denoising, i.e., of suppression of the measurement noise in°uence. Several numerical examples serve to illustrate characteristic properties of the proposed algorithm and to demonstrate its applicability to real problems. Furthermore, several structures and algorithms for multi-agent control based on a dynamic consensus strategy have been proposed. Two novel classes of structured, overlapping decentralized control algorithms are presented. For the ¯rst class, an agreement between the agents is implemented at the level of control inputs, while the second class is based on the agreement at the state estimation level. The proposed control algorithms have been illustrated by several examples. Also, the second class of the proposed consensus based control scheme has been applied to decentralized overlapping tracking control of planar formations of UAVs. A comparison is given with the proposed novel design methodology based on the expansion/contraction paradigm and the inclusion principle. Motivated by the applications to the optimal mobile sensor positioning within mobile sensor networks, the perturbation-based extremum seeking algorithm has been modifed and extended. It has been assumed that the integrator gain and the perturbation amplitude are time varying (decreasing in time with a proper rate) and that the output is corrupted with measurement noise. The proposed basic, one dimensional, algorithm has been extended to two dimensional, hybrid schemes and directly applied to the planar optimal mobile sensor positioning, where the vehicles can be modeled as velocity actuated point masses, force actuated point masses, or nonholonomic unicycles. The convergence of all the proposed algorithms, with probability one and in the mean square sense, has been proved. Also, the problem of target assignment in multi-agent systems using multi-variable extremum seeking algorithm has been addressed. An algorithm which e®ectively solves the problem has been proposed, based on the local extremum seeking of the specially designed global utility functions which capture the dependance among di®erent, possibly con°icting objectives of the agents. It has been demonstrated how the utility function parameters and agents' initial conditions impact the trajectories and destinations of the agents. All the proposed extremum seeking based algorithms have been illustrated with several simulations

    Information Theory and Cooperative Control in Networked Multi-Agent Systems with Applications to Smart Grid

    Get PDF
    This dissertation focuses on information theoretic aspects of and cooperative control techniques in networked multi-agent systems (NMAS) with communication constraints. In the first part of the dissertation, information theoretic limitations of tracking problems in networked control systems, especially leader-follower systems with communication constraints, are studied. Necessary conditions on the data rate of each communication link for tracking of the leader-follower systems are provided. By considering the forward and feedback channels as one cascade channel, we also provide a lower bound for the data rate of the cascade channel for the system to track a reference signal such that the tracking error has finite second moment. Finally, the aforementioned results are extended to the case in which the leader system and follower system have different system models. In the second part, we propose an easily scalable hierarchical decision-making and control architecture for smart grid with communication constraints in which distributed customers equipped with renewable distributed generation (RDG) interact and trade energy in the grid. We introduce the key components and their interactions in the proposed control architecture and discuss the design of distributed controllers which deal with short-term and long-term grid stability, power load balancing and energy routing. At microgrid level, under the assumption of user cooperation and inter-user communications, we propose a distributed networked control strategy to solve the demand-side management problem in microgrids. Moreover, by considering communication delays between users and microgrid central controller, we propose a distributed networked control strategy with prediction to solve the demand-side management problem with communication delays. In the third part, we consider the disturbance attenuation and stabilization problem in networked control systems. To be specific, we consider the string stability in a large group of interconnected systems over a communication network. Its potential applications could be found in formation tracking control in groups of robots, as well as uncertainty reduction and disturbance attenuation in smart grid. We propose a leader-following consensus protocol for such interconnected systems and derive the sufficient conditions, in terms of communication topology and control parameters, for string stability. Simulation results and performance in terms of disturbance propagation are also given. In the fourth part, we consider distributed tracking and consensus in networked multi-agent systems with noisy time-varying graphs and incomplete data. In particular, a distributed tracking with consensus algorithm is developed for the space-object tracking with a satellite surveillance network. We also intend to investigate the possible application of such methods in smart grid networks. Later, conditions for achieving distributed consensus are discussed and the rate of convergence is quantified for noisy time-varying graphs with incomplete data. We also provide detailed simulation results and performance comparison of the proposed distributed tracking with consensus algorithm in the case of space-object tracking problem and that of distributed local Kalman filtering with centralized fusion and centralized Kalman filter. The information theoretic limitations developed in the first part of this dissertation provide guildlines for design and analysis of tracking problems in networked control systems. The results reveal the mutual interaction and joint application of information theory and control theory in networked control systems. Second, the proposed architectures and approaches enable scalability in smart grid design and allow resource pooling among distributed energy resources (DER) so that the grid stability and optimality is maintained. The proposed distributed networked control strategy with prediction provides an approach for cooperative control at RDG-equipped customers within a self-contained microgrid with different feedback delays. Our string stability analysis in the third part of this dissertation allows a single networked control system to be extended to a large group of interconnected subsystems while system stability is still maintained. It also reveals the disturbance propagation through the network and the effect of disturbance in one subsystem on other subsystems. The proposed leader-following consensus protocol in the constrained communication among users reveals the effect of communication in stabilization of networked control systems and the interaction between communication and control over a network. Finally, the distributed tracking and consensus in networked multi-agent systems problem shows that information sharing among users improves the quality of local estimates and helps avoid conflicting and inefficient distributed decisions. It also reveals the effect of the graph topologies and incomplete node measurements on the speed of achieving distributed decision and final consensus accuracy

    Deep Reinforcement Learning for Swarm Systems

    Full text link
    Recently, deep reinforcement learning (RL) methods have been applied successfully to multi-agent scenarios. Typically, these methods rely on a concatenation of agent states to represent the information content required for decentralized decision making. However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties inherent to these systems: (i) the agents in the swarm are interchangeable and (ii) the exact number of agents in the swarm is irrelevant. Therefore, we propose a new state representation for deep multi-agent RL based on mean embeddings of distributions. We treat the agents as samples of a distribution and use the empirical mean embedding as input for a decentralized policy. We define different feature spaces of the mean embedding using histograms, radial basis functions and a neural network learned end-to-end. We evaluate the representation on two well known problems from the swarm literature (rendezvous and pursuit evasion), in a globally and locally observable setup. For the local setup we furthermore introduce simple communication protocols. Of all approaches, the mean embedding representation using neural network features enables the richest information exchange between neighboring agents facilitating the development of more complex collective strategies.Comment: 31 pages, 12 figures, version 3 (published in JMLR Volume 20

    Differential Inequalities in Multi-Agent Coordination and Opinion Dynamics Modeling

    Get PDF
    Distributed algorithms of multi-agent coordination have attracted substantial attention from the research community; the simplest and most thoroughly studied of them are consensus protocols in the form of differential or difference equations over general time-varying weighted graphs. These graphs are usually characterized algebraically by their associated Laplacian matrices. Network algorithms with similar algebraic graph theoretic structures, called being of Laplacian-type in this paper, also arise in other related multi-agent control problems, such as aggregation and containment control, target surrounding, distributed optimization and modeling of opinion evolution in social groups. In spite of their similarities, each of such algorithms has often been studied using separate mathematical techniques. In this paper, a novel approach is offered, allowing a unified and elegant way to examine many Laplacian-type algorithms for multi-agent coordination. This approach is based on the analysis of some differential or difference inequalities that have to be satisfied by the some "outputs" of the agents (e.g. the distances to the desired set in aggregation problems). Although such inequalities may have many unbounded solutions, under natural graphic connectivity conditions all their bounded solutions converge (and even reach consensus), entailing the convergence of the corresponding distributed algorithms. In the theory of differential equations the absence of bounded non-convergent solutions is referred to as the equation's dichotomy. In this paper, we establish the dichotomy criteria of Laplacian-type differential and difference inequalities and show that these criteria enable one to extend a number of recent results, concerned with Laplacian-type algorithms for multi-agent coordination and modeling opinion formation in social groups.Comment: accepted to Automatic
    corecore