67,904 research outputs found
Chance-Constrained Trajectory Optimization for Safe Exploration and Learning of Nonlinear Systems
Learning-based control algorithms require data collection with abundant
supervision for training. Safe exploration algorithms ensure the safety of this
data collection process even when only partial knowledge is available. We
present a new approach for optimal motion planning with safe exploration that
integrates chance-constrained stochastic optimal control with dynamics learning
and feedback control. We derive an iterative convex optimization algorithm that
solves an \underline{Info}rmation-cost \underline{S}tochastic
\underline{N}onlinear \underline{O}ptimal \underline{C}ontrol problem
(Info-SNOC). The optimization objective encodes both optimal performance and
exploration for learning, and the safety is incorporated as distributionally
robust chance constraints. The dynamics are predicted from a robust regression
model that is learned from data. The Info-SNOC algorithm is used to compute a
sub-optimal pool of safe motion plans that aid in exploration for learning
unknown residual dynamics under safety constraints. A stable feedback
controller is used to execute the motion plan and collect data for model
learning. We prove the safety of rollout from our exploration method and
reduction in uncertainty over epochs, thereby guaranteeing the consistency of
our learning method. We validate the effectiveness of Info-SNOC by designing
and implementing a pool of safe trajectories for a planar robot. We demonstrate
that our approach has higher success rate in ensuring safety when compared to a
deterministic trajectory optimization approach.Comment: Submitted to RA-L 2020, review-
Chance-Constrained Trajectory Optimization for Safe Exploration and Learning of Nonlinear Systems
Learning-based control algorithms require data collection with abundant supervision for training. Safe exploration algorithms ensure the safety of this data collection process even when only partial knowledge is available. We present a new approach for optimal motion planning with safe exploration that integrates chance-constrained stochastic optimal control with dynamics learning and feedback control. We derive an iterative convex optimization algorithm that solves an Information-cost Stochastic Nonlinear Optimal Control problem (Info-SNOC). The optimization objective encodes control cost for performance and exploration cost for learning, and the safety is incorporated as distributionally robust chance constraints. The dynamics are predicted from a robust regression model that is learned from data. The Info-SNOC algorithm is used to compute a sub-optimal pool of safe motion plans that aid in exploration for learning unknown residual dynamics under safety constraints. A stable feedback controller is used to execute the motion plan and collect data for model learning. We prove the safety of rollout from our exploration method and reduction in uncertainty over epochs, thereby guaranteeing the consistency of our learning method. We validate the effectiveness of Info-SNOC by designing and implementing a pool of safe trajectories for a planar robot. We demonstrate that our approach has higher success rate in ensuring safety when compared to a deterministic trajectory optimization approach
Robust Prediction Error Estimation with Monte-Carlo Methodology
In predictive modeling with simulation or machine learning, it is critical to
assess the quality of estimated values through output analysis accurately. In
recent decades output analysis has become enriched with methods that quantify
the impact of input data uncertainty in the model outputs to increase
robustness. However, most developments apply when the input data can be
parametrically parameterized. We propose a unified output analysis framework
for simulation and machine learning outputs through the lens of Monte Carlo
sampling. This framework provides nonparametric quantification of the variance
and bias induced in the outputs with higher-order accuracy. Our new
bias-corrected estimation from the model outputs leverages the extension of
fast iterative bootstrap sampling and higher-order influence functions. For the
scalability of the proposed estimation methods, we devise budget-optimal rules
and leverage control variates for variance reduction. Our numerical results
demonstrate a clear advantage in building better and more robust confidence
intervals for both simulation and machine learning frameworks
Learning Robustness with Bounded Failure: An Iterative MPC Approach
We propose an approach to design a Model Predictive Controller (MPC) for
constrained Linear Time Invariant systems performing an iterative task. The
system is subject to an additive disturbance, and the goal is to learn to
satisfy state and input constraints robustly. Using disturbance measurements
after each iteration, we construct Confidence Support sets, which contain the
true support of the disturbance distribution with a given probability. As more
data is collected, the Confidence Supports converge to the true support of the
disturbance. This enables design of an MPC controller that avoids conservative
estimate of the disturbance support, while simultaneously bounding the
probability of constraint violation. The efficacy of the proposed approach is
then demonstrated with a detailed numerical example.Comment: Added GitHub link to all source code
Robust gradient-based discrete-time iterative learning control algorithms
This paper considers the use of matrix models and the robustness of a gradient-based Iterative Learning Control (ILC) algorithm using both fixed learning gains and gains derived from parameter optimization. The philosophy of the paper is to ensure monotonic convergence with respect to the mean square value of the error time series. The paper provides a complete and rigorous analysis for the systematic use of matrix models in ILC. Matrix models make analysis clearer and provide necessary and sufficient conditions for robust monotonic convergence. They
also permit the construction of sufficient frequency domain conditions for robust monotonic convergence on finite time intervals for both causal and non-causal controller dynamics. The results are compared with recent results for robust inverse-model based ILC algorithms and it is seen
that the algorithm has the potential to improve robustness to high frequency modelling errors provided that resonances within the plant bandwidth have been suppressed by feedback or series compensation
Comparison of different repetitive control architectures: synthesis and comparison. Application to VSI Converters
Repetitive control is one of the most used control approaches to deal with periodic references/disturbances. It owes its properties to the inclusion of an internal model in the controller that corresponds to a periodic signal generator. However, there exist many different ways to include this internal model. This work presents a description of the different schemes by means of which repetitive control can be implemented. A complete analytic analysis and comparison is performed together with controller synthesis guidance. The voltage source inverter controller experimental results are included to illustrative conceptual developmentsPeer ReviewedPostprint (published version
- …