Search CORE

150 research outputs found

Stability Verification of Neural Network Controllers using Mixed-Integer Programming

Author: Jones Colin N.
Kuhn Daniel
Schwan Roland
Publication venue
Publication date: 31/05/2023
Field of study

We propose a framework for the stability verification of Mixed-Integer Linear Programming (MILP) representable control policies. This framework compares a fixed candidate policy, which admits an efficient parameterization and can be evaluated at a low computational cost, against a fixed baseline policy, which is known to be stable but expensive to evaluate. We provide sufficient conditions for the closed-loop stability of the candidate policy in terms of the worst-case approximation error with respect to the baseline policy, and we show that these conditions can be checked by solving a Mixed-Integer Quadratic Program (MIQP). Additionally, we demonstrate that an outer and inner approximation of the stability region of the candidate policy can be computed by solving an MILP. The proposed framework is sufficiently general to accommodate a broad range of candidate policies including ReLU Neural Networks (NNs), optimal solution maps of parametric quadratic programs, and Model Predictive Control (MPC) policies. We also present an open-source toolbox in Python based on the proposed framework, which allows for the easy verification of custom NN architectures and MPC formulations. We showcase the flexibility and reliability of our framework in the context of a DC-DC power converter case study and investigate its computational complexity

arXiv.org e-Print Archive

Robust online motion planning with reachable sets

Author: Anirudha Majumdar
Anirudha Majumdar
Russ Tedrake
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2013
Field of study

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2013.Cataloged from PDF version of thesis.Includes bibliographical references (p. 51-55).In this thesis we consider the problem of generating motion plans for a nonlinear dynamical system that are guaranteed to succeed despite uncertainty in the environment, parametric model uncertainty, disturbances, and/or errors in state estimation. Furthermore, we consider the case where these plans must be generated online, because constraints such as obstacles in the environment may not be known until they are perceived (with a noisy sensor) at runtime. Previous work on feedback motion planning for nonlinear systems was limited to offline planning due to the computational cost of safety verification. Here we augment the traditional trajectory library approach by designing locally stabilizing controllers for each nominal trajectory in the library and providing guarantees on the resulting closed loop systems. We leverage sums-of-squares programming to design these locally stabilizing controllers by explicitly attempting to minimize the size of the worst case reachable set of the closed-loop system subjected to bounded disturbances and uncertainty. The reachable sets associated with each trajectory in the library can be thought of as "funnels" that the system is guaranteed to remain within. The resulting funnel library is then used to sequentially compose motion plans at runtime while ensuring the safety of the robot. A major advantage of the work presented here is that by explicitly taking into account the effect of uncertainty, the robot can evaluate motion plans based on how vulnerable they are to disturbances. We demonstrate our method on a simulation of a plane flying through a two dimensional forest of polygonal trees with parametric uncertainty and disturbances in the form of a bounded "cross-wind". We further validate our approach by carefully evaluating the guarantees on invariance provided by funnels on two challenging underactuated systems (the "Acrobot" and a small-sized airplane).by Anirudha Majumdar.S.M

CiteSeerX

DSpace@MIT

Correct-By-Construction Control Synthesis for Systems with Disturbance and Uncertainty

Author: Chen Yuxiao
Publication venue
Publication date: 01/01/2018
Field of study

This dissertation focuses on correct-by-construction control synthesis for Cyber-Physical Systems (CPS) under model uncertainty and disturbance. CPSs are systems that interact with the physical world and perform complicated dynamic tasks where safety is often the overriding factor. Correct-by-construction control synthesis is a concept that provides formal performance guarantees to closed-loop systems by rigorous mathematic reasoning. Since CPSs interact with the environment, disturbance and modeling uncertainty are critical to the success of the control synthesis. Disturbance and uncertainty may come from a variety of sources, such as exogenous disturbance, the disturbance caused by co-existing controllers and modeling uncertainty. To better accommodate the different types of disturbance and uncertainty, the verification and control synthesis methods must be chosen accordingly. Four approaches are included in this dissertation. First, to deal with exogenous disturbance, a polar algorithm is developed to compute an avoidable set for obstacle avoidance. Second, a supervised learning based method is proposed to design a good student controller that has safety built-in and rarely triggers the intervention of the supervisory controller, thus targeting the design of the student controller. Third, to deal with the disturbance caused by co-existing controllers, a Lyapunov verification method is proposed to formally verify the safety of coexisting controllers while respecting the confidentiality requirement. Finally, a data-driven approach is proposed to deal with model uncertainty. A minimal robust control invariant set is computed for an uncertain dynamic system without a given model by first identifying the set of admissible models and then simultaneously computing the invariant set while selecting the optimal model. The proposed methods are applicable to many real-world applications and reflect the notion of using the structure of the system to achieve performance guarantees without being overly conservative.PHDMechanical EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttps://deepblue.lib.umich.edu/bitstream/2027.42/145933/1/chenyx_1.pd

Deep Blue Documents at the University of Michigan

Learning a feasible and stabilizing explicit model predictive control law by robust optimization

Author: Domahidi Alex
Jones Colin
Morari Manfred
Zeilinger Melanie Nicole
Publication venue
Publication date: 24/10/2011
Field of study

Fast model predictive control on embedded sys- tems has been successfully applied to plants with microsecond sampling times employing a precomputed state-to-input map. However, the complexity of this so-called explicit MPC can be prohibitive even for low-dimensional systems. In this pa- per, we introduce a new synthesis method for low-complexity suboptimal MPC controllers based on function approximation from randomly chosen point-wise sample values. In addition to standard machine learning algorithms formulated as convex programs, we provide sufficient conditions on the learning algo- rithm in the form of tractable convex constraints that guarantee input and state constraint satisfaction, recursive feasibility and stability of the closed loop system. The resulting control law can be fully parallelized, which renders the approach particularly suitable for highly concurrent embedded platforms such as FPGAs. A numerical example shows the effectiveness of the proposed method

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Approximate Dynamic Programming for Constrained Piecewise Affine Systems with Stability and Safety Guarantees

Author: Boom Ton van den
De Schutter Bart
He Kanghui
Shi Shengling
Publication venue
Publication date: 27/06/2023
Field of study

Infinite-horizon optimal control of constrained piecewise affine (PWA) systems has been approximately addressed by hybrid model predictive control (MPC), which, however, has computational limitations, both in offline design and online implementation. In this paper, we consider an alternative approach based on approximate dynamic programming (ADP), an important class of methods in reinforcement learning. We accommodate non-convex union-of-polyhedra state constraints and linear input constraints into ADP by designing PWA penalty functions. PWA function approximation is used, which allows for a mixed-integer encoding to implement ADP. The main advantage of the proposed ADP method is its online computational efficiency. Particularly, we propose two control policies, which lead to solving a smaller-scale mixed-integer linear program than conventional hybrid MPC, or a single convex quadratic program, depending on whether the policy is implicitly determined online or explicitly computed offline. We characterize the stability and safety properties of the closed-loop systems, as well as the sub-optimality of the proposed policies, by quantifying the approximation errors of value functions and policies. We also develop an offline mixed-integer linear programming-based method to certify the reliability of the proposed method. Simulation results on an inverted pendulum with elastic walls and on an adaptive cruise control problem validate the control performance in terms of constraint satisfaction and CPU time

arXiv.org e-Print Archive

Recommended from our members

Towards a safe and responsive control framework for human-centered robots

Author: He Binghan
Publication venue
Publication date: 25/06/2021
Field of study

Human-centered robots are a specific kind of service robot, which interact with humans physically or cognitively and help humans with tasks in uncertain environments. They can be humanoid robots, exoskeletons, or manipulators and mobile platforms that provide us good services. However, human-centered robots are still not perfect enough for us to use nowadays. On the one hand, human-centered robots are still slow and inefficient for their tasks because the human inputs and dynamics that they react to are uncertain, immeasurable, or even completely unknown. On the other hand, human-centered robots face much more complicated safety requirements than other kinds of robots because humans are dynamic and vulnerable during physical human-robot interaction. To resolve these issues of human-centered robots, the work in this dissertation explores new models for reducing human uncertainty and new control algorithms for improving safety warranty. The first half of this dissertation introduces a complex stiffness model for describing the uncertain human impedance. The discovery of this new model is motivated to explain the observation of a consistent damping ratio of a human versus different environmental dynamics. It replaces the linear damping term in a conventional mass-spring-damping model with a hysteretic damping term, an imaginary value in the frequency domain. Because of the correlation between the stiffness term and the newly discovered hysteretic damping term in the complex stiffness model, we can significantly reduce the human impedance uncertainty. Based on the complex stiffness model, we can adopt nonlinear control strategies for improving the responsiveness and the human-friendliness of human-centered robots. The second half of this dissertation introduces the concept of a barrier pair, which consists of a barrier function and a controller for the safety verification and warranty of a human-centered robot. We obtain a barrier pair by solving an optimization problem subject to a series of linear matrix inequalities representing the state-space, input, and stability constraints of a human-centered robot. By incorporating sampling-based methods into the synthesis of barrier pairs, human-centered robots can guarantee safe operation with non-convex state-space constraints. The sampling-based barrier pair method helps us construct a control framework of human-robot shared autonomy. A human-centered robot in this control framework uses an inference of a human's objective to figure out how to assist the human and prevent the human from potential accidents.Mechanical Engineerin

Texas ScholarWorks