Neural Network iLQR: A New Reinforcement Learning Architecture

Cheng, Zilong; Lee, Tong Heng; Lewis, Frank L.; Ma, Jun; Zhang, Xiaoxue

Neural Network iLQR: A New Reinforcement Learning Architecture

Authors: Zilong Cheng
Tong Heng Lee
Frank L. Lewis
Jun Ma
Xiaoxue Zhang
Publication date: 21 November 2020
Publisher

Abstract

As a notable machine learning paradigm, the research efforts in the context of reinforcement learning have certainly progressed leaps and bounds. When compared with reinforcement learning methods with the given system model, the methodology of the reinforcement learning architecture based on the unknown model generally exhibits significantly broader universality and applicability. In this work, a new reinforcement learning architecture is developed and presented without the requirement of any prior knowledge of the system model, which is termed as an approach of a "neural network iterative linear quadratic regulator (NNiLQR)". Depending solely on measurement data, this method yields a completely new non-parametric routine for the establishment of the optimal policy (without the necessity of system modeling) through iterative refinements of the neural network system. Rather importantly, this approach significantly outperforms the classical iterative linear quadratic regulator (iLQR) method in terms of the given objective function because of the innovative utilization of further exploration in the methodology. As clearly indicated from the results attained in two illustrative examples, these significant merits of the NNiLQR method are demonstrated rather evidently.Comment: 13 pages, 9 figure

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2011.10737

Last time updated on 02/03/2021