36,115 research outputs found
Chance-Constrained Trajectory Optimization for Safe Exploration and Learning of Nonlinear Systems
Learning-based control algorithms require data collection with abundant
supervision for training. Safe exploration algorithms ensure the safety of this
data collection process even when only partial knowledge is available. We
present a new approach for optimal motion planning with safe exploration that
integrates chance-constrained stochastic optimal control with dynamics learning
and feedback control. We derive an iterative convex optimization algorithm that
solves an \underline{Info}rmation-cost \underline{S}tochastic
\underline{N}onlinear \underline{O}ptimal \underline{C}ontrol problem
(Info-SNOC). The optimization objective encodes both optimal performance and
exploration for learning, and the safety is incorporated as distributionally
robust chance constraints. The dynamics are predicted from a robust regression
model that is learned from data. The Info-SNOC algorithm is used to compute a
sub-optimal pool of safe motion plans that aid in exploration for learning
unknown residual dynamics under safety constraints. A stable feedback
controller is used to execute the motion plan and collect data for model
learning. We prove the safety of rollout from our exploration method and
reduction in uncertainty over epochs, thereby guaranteeing the consistency of
our learning method. We validate the effectiveness of Info-SNOC by designing
and implementing a pool of safe trajectories for a planar robot. We demonstrate
that our approach has higher success rate in ensuring safety when compared to a
deterministic trajectory optimization approach.Comment: Submitted to RA-L 2020, review-
A Distributed Model Predictive Control Framework for Road-Following Formation Control of Car-like Vehicles (Extended Version)
This work presents a novel framework for the formation control of multiple
autonomous ground vehicles in an on-road environment. Unique challenges of this
problem lie in 1) the design of collision avoidance strategies with obstacles
and with other vehicles in a highly structured environment, 2) dynamic
reconfiguration of the formation to handle different task specifications. In
this paper, we design a local MPC-based tracking controller for each individual
vehicle to follow a reference trajectory while satisfying various constraints
(kinematics and dynamics, collision avoidance, \textit{etc.}). The reference
trajectory of a vehicle is computed from its leader's trajectory, based on a
pre-defined formation tree. We use logic rules to organize the collision
avoidance behaviors of member vehicles. Moreover, we propose a methodology to
safely reconfigure the formation on-the-fly. The proposed framework has been
validated using high-fidelity simulations.Comment: Extended version of the conference paper submission on ICARCV'1
Bayesian model predictive control: Efficient model exploration and regret bounds using posterior sampling
Tight performance specifications in combination with operational constraints
make model predictive control (MPC) the method of choice in various industries.
As the performance of an MPC controller depends on a sufficiently accurate
objective and prediction model of the process, a significant effort in the MPC
design procedure is dedicated to modeling and identification. Driven by the
increasing amount of available system data and advances in the field of machine
learning, data-driven MPC techniques have been developed to facilitate the MPC
controller design. While these methods are able to leverage available data,
they typically do not provide principled mechanisms to automatically trade off
exploitation of available data and exploration to improve and update the
objective and prediction model. To this end, we present a learning-based MPC
formulation using posterior sampling techniques, which provides finite-time
regret bounds on the learning performance while being simple to implement using
off-the-shelf MPC software and algorithms. The performance analysis of the
method is based on posterior sampling theory and its practical efficiency is
illustrated using a numerical example of a highly nonlinear dynamical
car-trailer system
Decentralized MPC based Obstacle Avoidance for Multi-Robot Target Tracking Scenarios
In this work, we consider the problem of decentralized multi-robot target
tracking and obstacle avoidance in dynamic environments. Each robot executes a
local motion planning algorithm which is based on model predictive control
(MPC). The planner is designed as a quadratic program, subject to constraints
on robot dynamics and obstacle avoidance. Repulsive potential field functions
are employed to avoid obstacles. The novelty of our approach lies in embedding
these non-linear potential field functions as constraints within a convex
optimization framework. Our method convexifies non-convex constraints and
dependencies, by replacing them as pre-computed external input forces in robot
dynamics. The proposed algorithm additionally incorporates different methods to
avoid field local minima problems associated with using potential field
functions in planning. The motion planner does not enforce predefined
trajectories or any formation geometry on the robots and is a comprehensive
solution for cooperative obstacle avoidance in the context of multi-robot
target tracking. We perform simulation studies in different environmental
scenarios to showcase the convergence and efficacy of the proposed algorithm.
Video of simulation studies: \url{https://youtu.be/umkdm82Tt0M
Sparse and Constrained Stochastic Predictive Control for Networked Systems
This article presents a novel class of control policies for networked control
of Lyapunov-stable linear systems with bounded inputs. The control channel is
assumed to have i.i.d. Bernoulli packet dropouts and the system is assumed to
be affected by additive stochastic noise. Our proposed class of policies is
affine in the past dropouts and saturated values of the past disturbances. We
further consider a regularization term in a quadratic performance index to
promote sparsity in control. We demonstrate how to augment the underlying
optimization problem with a constant negative drift constraint to ensure
mean-square boundedness of the closed-loop states, yielding a convex quadratic
program to be solved periodically online. The states of the closed-loop plant
under the receding horizon implementation of the proposed class of policies are
mean square bounded for any positive bound on the control and any non-zero
probability of successful transmission
- …