Search CORE

748 research outputs found

Resilient Autonomous Control of Distributed Multi-agent Systems in Contested Environments

Author: Modares Hamidreza
Moghadam Rohollah
Publication venue
Publication date: 08/04/2018
Field of study

An autonomous and resilient controller is proposed for leader-follower multi-agent systems under uncertainties and cyber-physical attacks. The leader is assumed non-autonomous with a nonzero control input, which allows changing the team behavior or mission in response to environmental changes. A resilient learning-based control protocol is presented to find optimal solutions to the synchronization problem in the presence of attacks and system dynamic uncertainties. An observer-based distributed H_infinity controller is first designed to prevent propagating the effects of attacks on sensors and actuators throughout the network, as well as to attenuate the effect of these attacks on the compromised agent itself. Non-homogeneous game algebraic Riccati equations are derived to solve the H_infinity optimal synchronization problem and off-policy reinforcement learning is utilized to learn their solution without requiring any knowledge of the agent's dynamics. A trust-confidence based distributed control protocol is then proposed to mitigate attacks that hijack the entire node and attacks on communication links. A confidence value is defined for each agent based solely on its local evidence. The proposed resilient reinforcement learning algorithm employs the confidence value of each agent to indicate the trustworthiness of its own information and broadcast it to its neighbors to put weights on the data they receive from it during and after learning. If the confidence value of an agent is low, it employs a trust mechanism to identify compromised agents and remove the data it receives from them from the learning process. Simulation results are provided to show the effectiveness of the proposed approach

arXiv.org e-Print Archive

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Online Research Commons @ATU (Arkansas Tech University)

Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games

Author: Ding Dawei
Modares Hamidreza
Wang Liming
Wunsch Donald C.
Yang Yongliang
Yin Yixin
Publication venue: Scholars\u27 Mine
Publication date: 01/06/2019
Field of study

This paper develops an integral value iteration (VI) method to efficiently find online the Nash equilibrium solution of two-player non-zero-sum (NZS) differential games for linear systems with partially unknown dynamics. To guarantee the closed-loop stability about the Nash equilibrium, the explicit upper bound for the discounted factor is given. To show the efficacy of the presented online model-free solution, the integral VI method is compared with the model-based off-line policy iteration method. Moreover, the theoretical analysis of the integral VI algorithm in terms of three aspects, i.e., positive definiteness properties of the updated cost functions, the stability of the closed-loop systems, and the conditions that guarantee the monotone convergence, is provided in detail. Finally, the simulation results demonstrate the efficacy of the presented algorithms

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Integral Reinforcement Learning for Finding Online the Feedback Nash Equilibrium of Nonzero-Sum Differential Games

Author: Draguna Vrabie
Frank L. Lewis
Publication venue: 'IntechOpen'
Publication date: 14/01/2011
Field of study

IntechOpen

Output-feedback online optimal control for a class of nonlinear systems

Author: Harlan Michael
Kamalapurkar Rushikesh
Self Ryan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/07/2019
Field of study

In this paper an output-feedback model-based reinforcement learning (MBRL) method for a class of second-order nonlinear systems is developed. The control technique uses exact model knowledge and integrates a dynamic state estimator within the model-based reinforcement learning framework to achieve output-feedback MBRL. Simulation results demonstrate the efficacy of the developed method

arXiv.org e-Print Archive

Crossref