Search CORE

5,197 research outputs found

A new iterative algorithm for solving H∞ control problem of continuous-time Markovian jumping linear systems based on online implementation

Author: Ding Zhengtao
He Shuping
Liu Fei
Song Jun
Publication venue
Publication date: 01/01/2016
Field of study

The University of Manchester - Institutional Repository

Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games

Author: Ding Dawei
Modares Hamidreza
Wang Liming
Wunsch Donald C.
Yang Yongliang
Yin Yixin
Publication venue: Scholars\u27 Mine
Publication date: 01/06/2019
Field of study

This paper develops an integral value iteration (VI) method to efficiently find online the Nash equilibrium solution of two-player non-zero-sum (NZS) differential games for linear systems with partially unknown dynamics. To guarantee the closed-loop stability about the Nash equilibrium, the explicit upper bound for the discounted factor is given. To show the efficacy of the presented online model-free solution, the integral VI method is compared with the model-based off-line policy iteration method. Moreover, the theoretical analysis of the integral VI algorithm in terms of three aspects, i.e., positive definiteness properties of the updated cost functions, the stability of the closed-loop systems, and the conditions that guarantee the monotone convergence, is provided in detail. Finally, the simulation results demonstrate the efficacy of the presented algorithms

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Integral Reinforcement Learning for Finding Online the Feedback Nash Equilibrium of Nonzero-Sum Differential Games

Author: Draguna Vrabie
Frank L. Lewis
Publication venue: 'IntechOpen'
Publication date: 14/01/2011
Field of study

IntechOpen

Neural network optimal control for nonlinear system based on zero-sum differential game

Author: Xingjian Fu
Zizheng Li
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2021
Field of study

summary:In this paper, for a class of the complex nonlinear system control problems, based on the two-person zero-sum game theory, combined with the idea of approximate dynamic programming(ADP), the constrained optimization control problem is solved for the nonlinear systems with unknown system functions and unknown time-varying disturbances. In order to obtain the approximate optimal solution of the zero-sum game, the multilayer neural network is used to fit the evaluation network, the execution network and the disturbance network of ADP respectively. The Lyapunov stability theory is used to prove the uniform convergence, and the system control output converges to the neighborhood of the target reference value. Finally, the simulation example verifies the effectiveness of the algorithm

Institute of Mathematics AS CR, v. v. i.