Search CORE

2 research outputs found

Training Reinforcement Neurocontrollers Using The Polytope Algorithm

Author: Aristidis Likas
Correspondence A. Likas
Isaac Lagaris
Publication venue
Publication date
Field of study

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorithm to adjust the weights of the action network so that a simple direct measure of the training performance is maximized. Experimental results from the application of the method to the pole balancing problem indicate improved training performance compared with critic-based and genetic reinforcement approaches. Keywords: reinforcement learning, neurocontrol, optimization, polytope algorithm, pole balancing, genetic reinforcement. TRAINING REINFORCEMENT NEUROCONTROLLERS USING THE POLYTOPE ALGORITHM Abstract A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorithm to adjust the weights of the action network so that a simple direct measure of the training performance is maximized. Exper..

CiteSeerX