610 research outputs found
Distributed formation control of multiple unmanned aerial vehicles over time-varying graphs using population games
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.This paper presents a control technique based on distributed population dynamics under time-varying communication graphs for a multi-agent system structured in a leader-follower fashion. Here, the leader agent follows a particular trajectory and the follower agents should track it in a certain organized formation manner. The tracking of the leader can be performed in the position coordinates x; y; and z, and in the yaw angle phi. Additional features are performed with this method: each agent has only partial knowledge of the position of other agents and not necessarily all agents should communicate to the leader. Moreover, it is possible to integrate a new agent into the formation (or for an agent to leave the formation task) in a dynamical manner. In addition, the formation configuration can be changed along the time, and the distributed population-games-based controller achieves the new organization goal accommodating conveniently the information-sharing graph in function of the communication range capabilities of each UAV. Finally, several simulations are presented to illustrate different scenarios, e.g., formation with time-varying communication network, and time-varying formationPeer ReviewedPostprint (author's final draft
Distributed population dynamics : optimization and control applications
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Population dynamics have been widely used in the design of learning and control systems for networked engineering applications, where the information dependency among elements of the network has become a relevant issue. Classic population dynamics (e.g., replicator, logit choice, Smith, and projection) require full information to evolve to the solution (Nash equilibrium). The main reason is that classic population dynamics are deduced by assuming well-mixed populations, which limits the applications where this theory can be implemented. In this paper, we extend the concept of population dynamics for nonwell-mixed populations in order to deal with distributed information structures that are characterized by noncomplete graphs. Although the distributed population dynamics proposed in this paper use partial information, they preserve similar characteristics and properties of their classic counterpart. Specifically, we prove mass conservation and convergence to Nash equilibrium. To illustrate the performance of the proposed dynamics, we show some applications in the solution of optimization problems, classic games, and the design of distributed controllers.Peer ReviewedPostprint (author's final draft
Using a theory of mind to find best responses to memory-one strategies
Memory-one strategies are a set of Iterated Prisoner's Dilemma strategies
that have been praised for their mathematical tractability and performance
against single opponents. This manuscript investigates best response memory-one
strategies with a theory of mind for their opponents. The results add to the
literature that has shown that extortionate play is not always optimal by
showing that optimal play is often not extortionate. They also provide evidence
that memory-one strategies suffer from their limited memory in multi agent
interactions and can be out performed by optimised strategies with longer
memory. We have developed a theory that has allowed to explore the entire space
of memory-one strategies. The framework presented is suitable to study
memory-one strategies in the Prisoner's Dilemma, but also in evolutionary
processes such as the Moran process, Furthermore, results on the stability of
defection in populations of memory-one strategies are also obtained
Distributed population dynamics: Optimization and control applications
Population dynamics have been widely used in the design of learning and control systems for networked engineering applications, where the information dependency among elements of the network has become a relevant issue. Classic population dynamics (e.g., replicator, logit choice, Smith, and projection) require full information to evolve to the solution (Nash equilibrium). The main reason is that classic population dynamics are deduced by assuming well-mixed populations, which limits the applications where this theory can be implemented. In this paper, we extend the concept of population dynamics for nonwell-mixed populations in order to deal with distributed information structures that are characterized by noncomplete graphs. Although the distributed population dynamics proposed in this paper use partial information, they preserve similar characteristics and properties of their classic counterpart. Specifically, we prove mass conservation and convergence to Nash equilibrium. To illustrate the performance of the proposed dynamics, we show some applications in the solution of optimization problems, classic games, and the design of distributed controllers.This work has been supported by COLCIENCIAS–COLFUTURO, grants No: 528 and 6172; and by Project ALTERNAR, Acuerdo 005, 07/19/13 CTeI–SGR–Narino, Colombia.Peer reviewe
Recommended from our members
Robust Hybrid Systems for Control, Learning, and Optimization in Networked Dynamical Systems
The deployment of advanced real-time control and optimization strategies in socially-integratedengineering systems could significantly improve our quality of life whilecreating jobs and economic opportunity. However, in cyber-physical systems such assmart grids, transportation networks, healthcare, and robotic systems, there still existseveral challenges that prevent the implementation of intelligent control strategies.These challenges include the existence of limited communication networks, dynamicand stochastic environments, multiple decision makers interacting with the system,and complex hybrid dynamics emerging from the feedback interconnection of physicalprocesses and computational devices.In this dissertation, we study the problem of designing robust control and optimizationalgorithms for cyber-physical systems using the framework of hybrid dynamicalsystems. We propose different theoretical frameworks for the design and analysis offeedback mechanisms that optimize the performance of dynamical systems without requiringan explicit characterization of their mathematical model, i.e., in a model-freeway. The closed-loop system that emerges of the interconnection of the plant with thefeedback mechanism describes, in general, a set-valued hybrid dynamical system. Thesetypes of systems combine continuous-time and discrete-time dynamics, and they usuallylack the uniqueness of solutions property. The framework of set-valued hybriddynamical systems allows us to study many complex dynamical systems that emerge indifferent engineering applications, such as networked multi-agent systems with switching graphs, non-smooth mechanical systems, dynamic pricing mechanisms in transportationsystems, autonomous robots with logic-based controllers, etc. We proposea step-by-step approach to the design of different types of discrete-time, continuous-time,hybrid, and stochastic controllers for different types of applications, extendingand generalizing different results in the literature in the area of extremum seeking control,sampled-data extremization, robust synchronization, and stochastic learning innetworked systems. Our theoretical results are illustrated via different simulations andnumerical examples
Model-free Optimization of Trajectory And Impedance Parameters on Exercise Robots With Applications To Human Performance And Rehabilitation
This dissertation focuses on the study and optimization of human training and its physiological effects through the use of advanced exercise machines (AEMs). These machines provide an invaluable contribution to advanced training by combining exercise physiology with technology. Unlike conventional exercise machines (CEMs), AEMs provide controllable trajectories and impedances by using electric motors and control systems. Therefore, they can produce various patterns even in the absence of gravity. Moreover, the ability of the AEMs to target multiple physiological systems makes them the best available option to improve human performance and rehabilitation. During the early stage of the research, the physiological effects produced under training by the manual regulation of the trajectory and impedance parameters of the AEMs were studied. Human dynamics appear as not only complex but also unique and time-varying due to the particular features of each person such as its musculoskeletal distribution, level of fatigue,fitness condition, hydration, etc. However, the possibility of the optimization of the AEM training parameters by using physiological effects was likely, thus the optimization objective started to be formulated. Some previous research suggests that a model-based optimization of advanced training is complicated for real-time environments as a consequence of the high level of v complexity, computational cost, and especially the many unidentifiable parameters. Moreover, a model-based method differs from person to person and it would require periodic updates based on physical and psychological variations in the user. Consequently, we aimed to develop a model-free optimization framework based on the use of Extremum Seeking Control (ESC). ESC is a non-model based controller for real-time optimization which its main advantage over similar controllers is its ability to deal with unknown plants. This framework uses a physiological effect of training as bio-feedback. Three different frameworks were performed for single-variable and multi-variable optimization of trajectory and impedance parameters. Based on the framework, the objective is achieved by seeking the optimal trajectory and/or impedance parameters associated with the orientation of the ellipsoidal path to be tracked by the user and the stiffness property of the resistance by using weighted measures of muscle activations
- …