Search CORE

61 research outputs found

Efficiently Solving Repeated Integer Linear Programming Problems by Learning Solutions of Similar Linear Programming Problems using Boosting Trees

Author: Banerjee Ashis Gopal
Roy Nicholas
Publication venue
Publication date: 01/01/2015
Field of study

It is challenging to obtain online solutions of large-scale integer linear programming (ILP) problems that occur frequently in slightly different forms during planning for autonomous systems. We refer to such ILP problems as repeated ILP problems. The branch-and-bound (BAB) algorithm is commonly used to solve ILP problems, and a significant amount of computation time is expended in solving numerous relaxed linear programming (LP) problems at the nodes of the BAB trees. We observe that the relaxed LP problems, both within a particular BAB tree and across multiple trees for repeated ILP problems, are similar to each other in the sense that they contain almost the same number of constraints, similar objective function and constraint coefficients, and an identical number of decision variables. We present a boosting tree-based regression technique for learning a set of functions that map the objective function and the constraints to the decision variables of such a system of similar LP problems; this enables us to efficiently infer approximately optimal solutions of the repeated ILP problems. We provide theoretical performance guarantees on the predicted values and demonstrate the effectiveness of the algorithm in four representative domains involving a library of benchmark ILP problems, aircraft carrier deck scheduling, vehicle routing, and vehicle control

CiteSeerX

DSpace@MIT

Tailored Presolve Techniques in Branch-and-Bound Method for Fast Mixed-Integer Optimal Control Applications

Author: Di Cairano Stefano
Quirynen Rien
Publication venue
Publication date: 22/11/2022
Field of study

Mixed-integer model predictive control (MI-MPC) can be a powerful tool for modeling hybrid control systems. In case of a linear-quadratic objective in combination with linear or piecewise-linear system dynamics and inequality constraints, MI-MPC needs to solve a mixed-integer quadratic program (MIQP) at each sampling time step. This paper presents a collection of block-sparse presolve techniques to efficiently remove decision variables, and to remove or tighten inequality constraints, tailored to mixed-integer optimal control problems (MIOCP). In addition, we describe a novel heuristic approach based on an iterative presolve algorithm to compute a feasible but possibly suboptimal MIQP solution. We present benchmarking results for a C code implementation of the proposed BB-ASIPM solver, including a branch-and-bound (B&B) method with the proposed tailored presolve techniques and an active-set based interior point method (ASIPM), compared against multiple state-of-the-art MIQP solvers on a case study of motion planning with obstacle avoidance constraints. Finally, we demonstrate the computational performance of the BB-ASIPM solver on the dSPACE Scalexio real-time embedded hardware using a second case study of stabilization for an underactuated cart-pole with soft contacts.Comment: 27 pages, 7 figures, 2 tables, submitted to journal of Optimal Control Applications and Method

arXiv.org e-Print Archive

Neural Networks for Fast Optimisation in Model Predictive Control: A Review

Author: Asadi Houshyar
Gonzalez Camilo
Kooijman Lars
Lim Chee Peng
Publication venue
Publication date: 05/09/2023
Field of study

Model Predictive Control (MPC) is an optimal control algorithm with strong stability and robustness guarantees. Despite its popularity in robotics and industrial applications, the main challenge in deploying MPC is its high computation cost, stemming from the need to solve an optimisation problem at each control interval. There are several methods to reduce this cost. This survey focusses on approaches where a neural network is used to approximate an existing controller. Herein, relevant and unique neural approximation methods for linear, nonlinear, and robust MPC are presented and compared. Comparisons are based on the theoretical guarantees that are preserved, the factor by which the original controller is sped up, and the size of problem that a framework is applicable to. Research contributions include: a taxonomy that organises existing knowledge, a summary of literary gaps, discussion on promising research directions, and simple guidelines for choosing an approximation framework. The main conclusions are that (1) new benchmarking tools are needed to help prove the generalisability and scalability of approximation frameworks, (2) future breakthroughs most likely lie in the development of ties between control and learning, and (3) the potential and applicability of recently developed neural architectures and tools remains unexplored in this field.Comment: 34 pages, 6 figures 3 tables. Submitted to ACM Computing Survey

arXiv.org e-Print Archive

Warm Start of Mixed-Integer Programs for Model Predictive Control of Hybrid Systems

Author: Marcucci Tobia
Tedrake Russ
Publication venue
Publication date: 30/03/2020
Field of study

In hybrid Model Predictive Control (MPC), a Mixed-Integer Quadratic Program (MIQP) is solved at each sampling time to compute the optimal control action. Although these optimizations are generally very demanding, in MPC we expect consecutive problem instances to be nearly identical. This paper addresses the question of how computations performed at one time step can be reused to accelerate (warm start) the solution of subsequent MIQPs. Reoptimization is not a rare practice in integer programming: for small variations of certain problem data, the branch-and-bound algorithm allows an efficient reuse of its search tree and the dual bounds of its leaf nodes. In this paper we extend these ideas to the receding-horizon settings of MPC. The warm-start algorithm we propose copes naturally with arbitrary model errors, has a negligible computational cost, and frequently enables an a-priori pruning of most of the search space. Theoretical considerations and experimental evidence show that the proposed method tends to reduce the combinatorial complexity of the hybrid MPC problem to that of a one-step look-ahead optimization, greatly easing the online computation burden

arXiv.org e-Print Archive

DSpace@MIT

Advances in Polynomial Optimization

Author: González Rodríguez Brais
Publication venue
Publication date: 01/01/2022
Field of study

Polynomial optimization has a wide range of practical applications in fields such as optimal control, energy and water networks, facility location, management science, and finance. It also generalizes relevant optimization problems thoroughly studied in the literature, such as mixed-binary linear optimization, quadratic optimization, and complementarity problems. As finding globally optimal solutions is an extremely challenging task, the development of efficient techniques for solving polynomial optimization problems is of particular relevance. In this thesis we provide a detailed study of different techniques to solve this kind of problems and we introduce some nobel approaches in this field, including the use of statistical learning techniques. Furthermore, we also present a practical application of polynomial optimization to finance and more specifically, portfolio design

Repositorio Institucional da Universidade de Santiago de Compostela

이동블록 및 잔류편차 제거 모델예측제어 기법의 최적성 향상

Author: 손상환
Publication venue: 서울대학교 대학원
Publication date: 01/02/2020
Field of study

학위논문(박사)--서울대학교 대학원 :공과대학 화학생물공학부,2020. 2. 이종민.Model predictive control (MPC) is a receding horizon control which derives finite-horizon optimal solution for current state on-line by solving an optimal control problem. MPC has had a tremendous impact on both industrial and control research areas. There are several outstanding issues in MPC. MPC has to solve the optimization problem within a sampling period so that the reduction of on-line computational complexity is a one of the main research subject in MPC. Another major issue is model-plant mismatch due to the model based predictive approach so that offset-free tracking schemes by compensating model-plant mismatch or unmeasured disturbance has been developed. In this thesis, we focused on the optimality performance of move blocking which fixes the decision variables over arbitrary time intervals to reduce computational load for on-line optimization in MPC and disturbance estimator approach based offset-free MPC which is the most standardly used method to accomplish offset-free tracking in MPC. We improve the optimality performance of move blocked MPC in two ways. The first scheme provides a superior base sequence by linearly interpolating complementary base sequences, and the second scheme provides a proper time-varying blocking structure with semi-explicit approach. Moreover, we improve the optimality performance of offset-free MPC by exploiting learned model-plant mismatch compensating signal from estimated disturbance data. With the proposed schemes, we efficiently improve the optimality performance while guaranteeing the recursive feasibility and closed-loop stability.모델예측제어는 현재 시스템 상태에 대한 유한 구간 최적해를 도출하는 온라인 이동 구간 제어 방식이다. 모델예측제어는 피드백을 통한 공정 동특성과 제약 조건을 효과적으로 반영하는 장점으로 인해 산업 및 제어 연구 분야에 큰 영향을 미쳤다. 이러한 모델예측제어에는 몇 가지 해결되어야 할 문제가 있다. 모델예측제어에서는 샘플링 기간 내에 최적화 문제를 풀어내야 하기 때문에, 온라인 계산 복잡성의 감소가 주요 연구 주제 중 하나로 활발히 연구되고 있다. 또 다른 주요 문제는 모델에 기반한 예측을 이용하는 접근 방식으로 인해 모델-플랜트 불일치로 인한 오차를 해결해야 한다는 점이며, 모델 플랜트 불일치 또는 측정되지 않은 외란을 보상하여 잔류편차 없이 참조신호를 추적하는 연구가 활발히 이루어지고 있다. 이 논문에서는 모델예측제어에서의 온라인 최적화를 위한 계산 부하를 줄이기 위해 임의의 시간 간격에 걸쳐 결정 변수를 고정시키는 이동 블록 전략의 최적성 향상에 중점을 두었으며, 또한 잔류편차를 제거하기 위해 가장 표준적으로 사용되는 외란 추정기를 이용한 잔류편차-제거 모델예측제어 기법의 최적성 향상에 중점을 두었다. 이 논문에서는 이동 블록 모델예측제어의 최적 성능을 향상시키기 위한 두 가지 전략을 제시한다. 첫 번째 전략은 이동 블록 전략에서 일반적으로 고정된 채로 사용되는 기반 시퀀스를 상호 보완적인 두 기반 시퀀스의 선형 보간으로 대체함으로써 보다 우수한 기반 시퀀스를 제공하며, 두 번째 전략은 준-명시적 접근법을 활용하여 현재 시스템 상태에 적절한 시변 블록 구조를 온라인에서 제공한다. 또한, 잔류편차-제거 모델예측제어 기법의 최적 성능을 향상시키기 위해 추정 외란 데이터로부터 학습된 모델-플랜트 불일치 보상 신호를 온라인에서 이용하는 전략을 제안하였다. 제안된 세 가지 기법을 통해 모델예측제어의 반복적 실현가능성과 폐쇄-루프 안정성을 보장하면서 최적 성능을 효율적으로 개선 하였다.1. Introduction 1 2. Move-blocked model predictive control with linear interpolation of base sequences 5 2.1 Introduction 5 2.2 Preliminaries 9 2.2.1 MPC formulation 9 2.2.2 Move blocking 12 2.2.3 Move blocked MPC (MBMPC) 15 2.3 Move blocking schemes 16 2.3.1 Previous solution based offset blocking 17 2.3.2 LQR solution based offset blocking 18 2.4 Interpolated solution based move blocking 20 2.4.1 Interpolated solution based MBMPC 20 2.4.2 QP formulation 26 2.5 Numerical examples 29 2.5.1 Example 1 (Feasible region) 30 2.5.2 Example 2 (Performance in regulation problem) 33 2.5.3 Example 3 (Performance in tracking problem) 36 3. Move-blocked model predictive control with time-varying blocking structure by semi-explicit approach 43 3.1 Introduction 43 3.2 Problem formulation 46 3.3 Move blocked MPC 48 3.3.1 Move blocking scheme 48 3.3.2 Implementation of move blocking 51 3.4 Semi-explicit approach for move blocked MPC 53 3.4.1 Off-line generation of critical region 56 3.4.2 On-line MPC scheme with critical region search 60 3.4.3 Property of semi-explicit move blocked MPC 62 3.5 Numerical examples 70 3.5.1 Example 1 (Regulation problem) 71 3.5.2 Example 2 (Tracking problem) 77 4. Model-plant mismatch learning offset-free model predictive control 83 4.1 Introduction 83 4.2 Offset-free MPC: Disturbance estimator approach 86 4.2.1 Preliminaries 86 4.2.2 Disturbance estimator and controller design 87 4.2.3 Offset-free tracking condition 89 4.3 Model-plant mismatch learning offset-free MPC 91 4.3.1 Model-plant mismatch learning 92 4.3.2 Application of learned model-plant mismatch 97 4.3.3 Robust asymptotic stability of model-plant mismatch learning offset-free MPC 102 4.4 Numerical example 117 4.4.1 System with random set-point 120 4.4.2 Transformed system 125 4.4.3 System with multiple random set-points 128 5. Concluding remarks 134 5.1 Move-blocked model predictive control with linear interpolation of base sequences 134 5.2 Move-blocked model predictive control with time-varying blocking structure by semi-explicit approach 135 5.3 Model-plant mismatch learning offset-free model predictive control 136 5.4 Conclusions 138 5.5 Future work 139 Bibliography 145Docto

SNU Open Repository and Archive

Advances in the Optimization of Energy Systems and Machine Learning Hyperparameters

Author: Tso William Weikang
Publication venue
Publication date: 07/01/2021
Field of study

Intensifying public concern about climate change risks has accelerated the push for more tangible action in the transition toward low-carbon or carbon-neutral energy. Concurrently, the energy industry is also undergoing a digital transformation with the explosion in available data and computational power. To address these challenges, systematic decision-making strategies are necessary to analyze the vast array of technology options and information sources while navigating this energy transition. In this work, mathematical optimization is utilized to answer some of the outstanding issues around designing cleaner processes from resources such as natural gas and renewables, operating the logistics of these energy systems, and statistical modeling from data. First, exploiting natural gas to produce lower emission liquid transportation fuels is investigated through an optimization-based process synthesis. This extends previous studies by incorporating chemical looping as an alternative syngas production method for the first time. Second, a similar process synthesis approach is implemented for the optimal design of a novel biomass-based process that coproduces ammonia and methanol, improving their production flexibility and profit margins. Next, operational difficulties with solar and wind energies due to their temporal intermittency and uneven geographical distribution are tackled with a supply chain optimization model and a clustering decomposition algorithm. The former describes power generation through energy carriers (hydrogen-rich chemicals) connecting resource-dense rural areas to resource-deficient urban centers. Results show the potential of energy carriers for long-term storage. The latter is developed to identify the appropriate number of representative time periods for approximating an optimization problem with time series data, instead of using a full time horizon. This algorithm is applied to the simultaneous design and scheduling of a renewable power system with battery storage. Finally, building machine learning models from data is commonly performed through k-fold cross-validation. From recasting this as a bilevel optimization, the exact solution to hyperparameter optimization is obtainable through parametric programming for machine learning models that are LP/QP. This extends previous results in statistics to a broader class of machine learning models

Texas A&M Repository

Design of multi-parametric NCO-tracking controllers for linear continuous-time systems

Author: Sun Muxin
Publication venue: Chemical Engineering, Imperial College London
Publication date: 01/02/2018
Field of study

Process optimization for industrial applications aims to achieve performance enhancements while satisfying system constraints. A major challenge for any such method lies in the problem of uncertainty stemming from model mismatch and process disturbances. Classical approaches such as model predictive control usually handle the uncertainty by repeatedly solving the optimization problem on-line, which may prove a rather computationally demanding task nonetheless and cause serious delays for fast dynamic systems. Existing approaches for mitigating the on-line computational burden via off-line optimization include multi-parametric programming and NCO-tracking. Multi-parametric programming aims to generate a mapping of control strategies as a function of given parameters; whereas NCO-tracking involves tracking the necessary conditions of optimality (NCOs) based on a precomputed control switching structure, which enables a dynamic real-time optimization problem to be transferred into an on-line tracking problem using a feedback controller. A methodology, called multi-parametric (mp-)NCO-tracking is developed in this thesis, whereby multi-parametric dynamic optimization and NCO-tracking methods are combined into a unified framework. An algorithm for the design of mp-NCO-tracking controllers for continuous-time, linear-quadratic optimal control problems is presented in Chapter 2. The off-line step defines the multi-parametric control structure mapped to given uncertain (measurable) parameters in terms of so-called critical regions and feedback laws. Specifically, each critical region corresponds to a unique control switching structure in terms of the sequence of active constraints. The on-line step involves determining the current critical region once the parameter value has been revealed, and then applying the corresponding feedback control laws in a receding horizon manner. The mp-NCO-tracking approach provides a means for relaxing the invariant switching structure assumption in NCO-tracking by constructing critical regions for various switching structures. Moreover, addressing the problem directly in continuous-time can potentially reduce the number of critical regions compared with standard multi-parametric programming based on a time discretization and a control vector parameterization. The methodology and its benefits are illustrated for a number of simple case studies. To obtain the mathematical representation of the generally nonlinear critical regions, Chapter 3 investigates a machine learning model as a classifier, based on deep neural network. This feed-forward network is selected for its representational power as a universal approximator for arbitrary continuous functions. Here, the classifier takes the unknown parameter as input and maps the corresponding critical regions in terms of their switching structures. An algorithm for training the classifier is presented, which involves generating the training data set, setting up a neural network architecture, and applying optimization based training. By using a Softmax classifier in the output layer of the network, a normalized probability distribution is obtained, which consist of a vector with as many elements as the total number of critical regions, and each element representing the likelihood for a region to be the correct one. The classifier is conveniently embedded into the multi-parametric NCO-tracking controller for choosing the real-time switching structure in on-line control. Lastly, a robustification of the mp-NCO-tracking methodology is developed in Chapter 4, where constraints are guaranteed to be satisfied under all possible uncertainty scenarios, which leads to a min-max formulation. A robust counterpart formulation of the multi-parametric dynamic optimization problem is presented, which considers both additive or multiplicative time-varying disturbances. The approach involves backing-off the path and terminal constraints of the linear-quadratic optimal control problem based on a worst-case uncertainty propagation computed using either interval or ellipsoidal reachability tubes. The uncertain system state is decomposed into a nominal reference and a perturbed component, and a convex enclosure of the reachable set for the perturbed component is precomputed via some auxiliary differential equations. Conservative constraint back-offs are obtained from the precomputed reachability tubes, which enables the controller design procedure in the nominal case to be directly applied for the robust control problem, and to retain the same computational effort as in the nominal case. These developments are demonstrated by numerical case studies, and ways of extending this approach to more general, nonlinear optimal control problems are discussed in Chapter 5.Open Acces

Spiral - Imperial College Digital Repository

모델기반강화학습을이용한공정제어및최적화

Author: 김종우
Publication venue: 서울대학교 대학원
Publication date: 01/02/2020
Field of study

학위논문(박사)--서울대학교 대학원 :공과대학 화학생물공학부,2020. 2. 이종민.순차적 의사결정 문제는 공정 최적화의 핵심 분야 중 하나이다. 이 문제의 수치적 해법 중 가장 많이 사용되는 것은 순방향으로 작동하는 직접법 (direct optimization) 방법이지만, 몇가지 한계점을 지니고 있다. 최적해는 open-loop의 형태를 지니고 있으며, 불확정성이 존재할때 방법론의 수치적 복잡도가 증가한다는 것이다. 동적 계획법 (dynamic programming) 은 이러한 한계점을 근원적으로 해결할 수 있지만, 그동안 공정 최적화에 적극적으로 고려되지 않았던 이유는 동적 계획법의 결과로 얻어진 편미분 방정식 문제가 유한차원 벡터공간이 아닌 무한차원의 함수공간에서 다루어지기 때문이다. 소위 차원의 저주라고 불리는 이 문제를 해결하기 위한 한가지 방법으로서, 샘플을 이용한 근사적 해법에 초점을 둔 강화학습 방법론이 연구되어 왔다. 본 학위논문에서는 강화학습 방법론 중, 공정 최적화에 적합한 모델 기반 강화학습에 대해 연구하고, 이를 공정 최적화의 대표적인 세가지 순차적 의사결정 문제인 스케줄링, 상위단계 최적화, 하위단계 제어에 적용하는 것을 목표로 한다. 이 문제들은 각각 부분관측 마르코프 결정 과정 (partially observable Markov decision process), 제어-아핀 상태공간 모델 (control-affine state space model), 일반적 상태공간 모델 (general state space model)로 모델링된다. 또한 각 수치적 모델들을 해결하기 위해 point based value iteration (PBVI), globalized dual heuristic programming (GDHP), and differential dynamic programming (DDP)로 불리는 방법들을 도입하였다. 이 세가지 문제와 방법론에서 제시된 특징들을 다음과 같이 요약할 수 있다: 첫번째로, 스케줄링 문제에서 closed-loop 피드백 형태의 해를 제시할 수 있었다. 이는 기존 직접법에서 얻을 수 없었던 형태로서, 강화학습의 강점을 부각할 수 있는 측면이라 생각할 수 있다. 두번째로 고려한 하위단계 제어 문제에서, 동적 계획법의 무한차원 함수공간 최적화 문제를 함수 근사 방법을 통해 유한차원 벡터공간 최적화 문제로 완화할 수 있는 방법을 도입하였다. 특히, 심층 신경망을 이용하여 함수 근사를 하였고, 이때 발생하는 여러가지 장점과 수렴 해석 결과를 본 학위논문에 실었다. 마지막 문제는 상위 단계 동적 최적화 문제이다. 동적 최적화 문제에서 발생하는 제약 조건하에서 강화학습을 수행하기 위해, 원-쌍대 미분동적 계획법 (primal-dual DDP) 방법론을 새로 제안하였다. 앞서 설명한 세가지 문제에 적용된 방법론을 검증하고, 동적 계획법이 직접법에 비견될 수 있는 방법론이라는 주장을 실증하기 위해 여러가지 공정 예제를 실었다.Sequential decision making problem is a crucial technology for plant-wide process optimization. While the dominant numerical method is the forward-in-time direct optimization, it is limited to the open-loop solution and has difficulty in considering the uncertainty. Dynamic programming method complements the limitations, nonetheless associated functional optimization suffers from the curse-of-dimensionality. The sample-based approach for approximating the dynamic programming, referred to as reinforcement learning (RL) can resolve the issue and investigated throughout this thesis. The method that accounts for the system model explicitly is in particular interest. The model-based RL is exploited to solve the three representative sequential decision making problems; scheduling, supervisory optimization, and regulatory control. The problems are formulated with partially observable Markov decision process, control-affine state space model, and general state space model, and associated model-based RL algorithms are point based value iteration (PBVI), globalized dual heuristic programming (GDHP), and differential dynamic programming (DDP), respectively. The contribution for each problem can be written as follows: First, for the scheduling problem, we developed the closed-loop feedback scheme which highlights the strength compared to the direct optimization method. In the second case, the regulatory control problem is tackled by the function approximation method which relaxes the functional optimization to the finite dimensional vector space optimization. Deep neural networks (DNNs) is utilized as the approximator, and the advantages as well as the convergence analysis is performed in the thesis. Finally, for the supervisory optimization problem, we developed the novel constraint RL framework that uses the primal-dual DDP method. Various illustrative examples are demonstrated to validate the developed model-based RL algorithms and to support the thesis statement on which the dynamic programming method can be considered as a complementary method for direct optimization method.1. Introduction 1 1.1 Motivation and previous work 1 1.2 Statement of contributions 9 1.3 Outline of the thesis 11 2. Background and preliminaries 13 2.1 Optimization problem formulation and the principle of optimality 13 2.1.1 Markov decision process 15 2.1.2 State space model 19 2.2 Overview of the developed RL algorithms 28 2.2.1 Point based value iteration 28 2.2.2 Globalized dual heuristic programming 29 2.2.3 Differential dynamic programming 32 3. A POMDP framework for integrated scheduling of infrastructure maintenance and inspection 35 3.1 Introduction 35 3.2 POMDP solution algorithm 38 3.2.1 General point based value iteration 38 3.2.2 GapMin algorithm 46 3.2.3 Receding horizon POMDP 49 3.3 Problem formulation for infrastructure scheduling 54 3.3.1 State 56 3.3.2 Maintenance and inspection actions 57 3.3.3 State transition function 61 3.3.4 Cost function 67 3.3.5 Observation set and observation function 68 3.3.6 State augmentation 69 3.4 Illustrative example and simulation result 69 3.4.1 Structural point for the analysis of a high dimensional belief space 72 3.4.2 Infinite horizon policy under the natural deterioration process 72 3.4.3 Receding horizon POMDP 79 3.4.4 Validation of POMDP policy via Monte Carlo simulation 83 4. A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system 88 4.1 Introduction 88 4.2 Function approximation and learning with deep neural networks 91 4.2.1 GDHP with a function approximator 91 4.2.2 Stable learning of DNNs 96 4.2.3 Overall algorithm 103 4.3 Results and discussions 107 4.3.1 Example 1: Semi-batch reactor 107 4.3.2 Example 2: Diffusion-Convection-Reaction (DCR) process 120 5. Convergence analysis of the model-based deep reinforcement learning for optimal control of nonlinear control-affine system 126 5.1 Introduction 126 5.2 Convergence proof of globalized dual heuristic programming (GDHP) 128 5.3 Function approximation with deep neural networks 137 5.3.1 Function approximation and gradient descent learning 137 5.3.2 Forward and backward propagations of DNNs 139 5.4 Convergence analysis in the deep neural networks space 141 5.4.1 Lyapunov analysis of the neural network parameter errors 141 5.4.2 Lyapunov analysis of the closed-loop stability 150 5.4.3 Overall Lyapunov function 152 5.5 Simulation results and discussions 157 5.5.1 System description 158 5.5.2 Algorithmic settings 160 5.5.3 Control result 161 6. Primal-dual differential dynamic programming for constrained dynamic optimization of continuous system 170 6.1 Introduction 170 6.2 Primal-dual differential dynamic programming for constrained dynamic optimization 172 6.2.1 Augmented Lagrangian method 172 6.2.2 Primal-dual differential dynamic programming algorithm 175 6.2.3 Overall algorithm 179 6.3 Results and discussions 179 7. Concluding remarks 186 7.1 Summary of the contributions 187 7.2 Future works 189 Bibliography 192Docto

SNU Open Repository and Archive