Search CORE

12,838 research outputs found

Adaptive Critic Designs

Author: Prokhorov Danil V.
Wunsch Donald C.
Publication venue: Scholars\u27 Mine
Publication date: 01/09/1997
Field of study

We discuss a variety of adaptive critic designs (ACDs) for neurocontrol. These are suitable for learning in noisy, nonlinear, and nonstationary environments. They have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Our discussion of these origins leads to an explanation of three design families: heuristic dynamic programming, dual heuristic programming, and globalized dual heuristic programming (GDHP). The main emphasis is on DHP and GDHP as advanced ACDs. We suggest two new modifications of the original GDHP design that are currently the only working implementations of GDHP. They promise to be useful for many engineering applications in the areas of optimization and optimal control. Based on one of these modifications, we present a unified approach to all ACDs. This leads to a generalized training procedure for ACD

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Stable Adaptive Control Using New Critic Designs

Author: Werbos Paul J.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 20/11/2012
Field of study

Classical adaptive control proves total-system stability for control of linear plants, but only for plants meeting very restrictive assumptions. Approximate Dynamic Programming (ADP) has the potential, in principle, to ensure stability without such tight restrictions. It also offers nonlinear and neural extensions for optimal control, with empirically supported links to what is seen in the brain. However, the relevant ADP methods in use today -- TD, HDP, DHP, GDHP -- and the Galerkin-based versions of these all have serious limitations when used here as parallel distributed real-time learning systems; either they do not possess quadratic unconditional stability (to be defined) or they lead to incorrect results in the stochastic case. (ADAC or Q-learning designs do not help.) After explaining these conclusions, this paper describes new ADP designs which overcome these limitations. It also addresses the Generalized Moving Target problem, a common family of static optimization problems, and describes a way to stabilize large-scale economic equilibrium models, such as the old long-term energy model of DOE.Comment: Includes general reviews of alternative control technologies and reinforcement learning. 4 figs, >70p., >200 eqs. Implementation details, stability analysis. Included in 9/24/98 patent disclosure. pdf version uploaded 2012, based on direct conversion of the original word/html file, because of issues of format compatabilit

arXiv.org e-Print Archive

Crossref

Dynamic Re-Optimization of a Fed-Batch Fermentor using Adaptive Critic Designs

Author: Iyer M. S.
Wunsch Donald C.
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2001
Field of study

Traditionally, fed-batch biochemical process optimization and control uses complicated off-line optimizers, with no online model adaptation or re-optimization. This study demonstrates the applicability of a class of adaptive critic designs for online re-optimization and control of an aerobic fed-batch fermentor. Specifically, the performance of an entire class of adaptive critic designs, viz., heuristic dynamic programming, dual heuristic programming and generalized dual heuristic programming, was demonstrated to be superior to that of a heuristic random optimizer, on optimization of a fed-batch fermentor operation producing monoclonal antibodie

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Choice of Utility Functions for Adaptive Critic Designs

Author: Venayagamoorthy Ganesh K.
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2004
Field of study

This paper first presents a general overview of Adaptive Critic Designs (ACDs) and their existing control applications. It describes the importance of the right choice of utility functions for the development of critic networks and their convergence to the cost-to-go function J. A closer look into the step by step derivation of a utility function for the design of an ACD nonlinear optimal neurocontroller to replace/augment the conventional controllers, the automatic voltage regulator and governor, in a power system consisting of a generator connected to the power grid is described and some results are presented

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Online Reinforcement Learning-Based Neural Network Controller Design for Affine Nonlinear Discrete-Time Systems

Author: Sarangapani Jagannathan
Yang Qinmin
Publication venue: Scholars\u27 Mine
Publication date: 01/07/2007
Field of study

In this paper, a novel reinforcement learning neural network (NN)-based controller, referred to adaptive critic controller, is proposed for general multi-input and multi- output affine unknown nonlinear discrete-time systems in the presence of bounded disturbances. Adaptive critic designs consist of two entities, an action network that produces optimal solution and a critic that evaluates the performance of the action network. The critic is termed adaptive as it adapts itself to output the optimal cost-to-go function and the action network is adapted simultaneously based on the information from the critic. In our online learning method, one NN is designated as the critic NN, which approximates the Bellman equation. An action NN is employed to derive the control signal to track a desired system trajectory while minimizing the cost function. Online updating weight tuning schemes for these two NNs are also derived and uniformly ultimate boundedness (UUB) of the tracking error and weight estimates is shown. The effectiveness of the controller is evaluated on a two-link robotic arm system

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Adaptive Critic Designs for Optimal Control of Power Systems

Author: Harley Ronald G.
Venayagamoorthy Ganesh K.
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2005
Field of study

The increasing complexity of the modern power grid highlights the need for advanced modeling and control techniques for effective control of excitation, turbine and flexible AC transmission systems (FACTS). The crucial factors affecting the modern power systems today is voltage and load flow control. Simulation studies in the PSCAD/EMTDC environment and realtime laboratory experimental studies carried out are described and the results show the successful control of the power system elements and the entire power system with adaptive and optimal neurocontrol schemes. Performances of the neurocontrollers are compared with the conventional PI controllers for damping under different operating conditions for small and large disturbances

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Fully Evolvable Optimal Neurofuzzy Controller Using Adaptive Critic Designs

Author: Harley Ronald G.
Mohagheghi Salman
Venayagamoorthy Ganesh K.
Publication venue: Scholars\u27 Mine
Publication date: 01/12/2008
Field of study

A near-optimal neurofuzzy external controller is designed in this paper for a static compensator (STATCOM) in a multimachine power system. The controller provides an auxiliary reference signal for the STATCOM in such a way that it improves the damping of the rotor speed deviations of its neighboring generators. A zero-order Takagi-Sugeno fuzzy rule base constitutes the core of the controller. A heuristic dynamic programming (HDP) based approach is used to further train the controller and enable it to provide nonlinear near-optimal control at different operating conditions of the power system. Based on the connectionist systems theory, the parameters of the neurofuzzy controller, including the membership functions, undergo training. Simulation results are provided that compare the performance of the neurofuzzy controller with and without updating the fuzzy set parameters. Simulation results indicate that updating the membership functions can noticeably improve the performance of the controller and reduce the size of the STATCOM, which leads to lower capital investment

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Adaptive Critic Designs Based Coupled Neurocontrollers for a Static Compensator

Author: Harley Ronald G.
Mohagheghi Salman
Venayagamoorthy Ganesh K.
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2006
Field of study

A novel nonlinear optimal neurocontroller for a static compensator (STATCOM) connected to a power system, using artificial neural networks, is presented in this paper. The heuristic dynamic programming (HDP) method, a member of the adaptive critic designs (ACD) family, is used for the design of the STATCOM neurocontroller. The proposed controller is a nonlinear optimal controller that provides coupled control for the line voltage and the dc link voltage regulation loops of the STATCOM. An action dependent approach is used, in which the controller is independent of a model of the network. Moreover, a proportional-integrator approach allows the neurocontroller to deal with the actual signals rather than the deviations. Simulation results are provided to show that the proposed ACD based neurocontroller is more effective in controlling the STATCOM compared to finely tuned conventional PI controllers

Crossref

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Adaptive Critic Designs and Their Implementations on Different Neural Network Architectures

Author: Harley Ronald G.
Park Jung-Wook
Venayagamoorthy Ganesh K.
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2003
Field of study

The design of nonlinear optimal neurocontrollers based on the Adaptive Critic Designs (ACDs) family of algorithms has recently attracted interest. This paper presents a summary of these algorithms, and compares their performance when implemented on two different types of artificial neural networks, namely the multilayer perceptron neural network (MLPNN) and the radial basis function neural network (RBFNN). As an example for the application of the ACDs, the control of synchronous generator on an electric power grid is considered and results are presented to compare the different ACD family members and their implementations on different neural network architectures

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Optimal Control Synthesis of a Class of Nonlinear Systems Using Single Network Adaptive Critics

Author: Balakrishnan S. N.
Padhi Radhakant
Unnikrishnan Nishant
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2004
Field of study

Adaptive critic (AC) neural network solutions to optimal control designs using dynamic programming has reduced the need of complex computations and storage requirements that typical dynamic programming requires. In this paper, a single network adaptive critic (SNAC) is presented. This approach is applicable to a class of nonlinear systems where the optimal control (stationary) equation is explicitly solvable for control in terms of state and costate variables. The SNAC architecture offers three potential advantages; a simpler architecture, significant savings of computational load and reduction in approximation errors. In order to demonstrate these benefits, a real-life micro-electro-mechanical-system (MEMS) problem has been solved. This demonstrates that the SNAC technique is applicable for complex engineering systems. Both AC and SNAC approaches are compared in terms of some metrics

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine