77 research outputs found
Safe Controller Optimization for Quadrotors with Gaussian Processes
One of the most fundamental problems when designing controllers for dynamic
systems is the tuning of the controller parameters. Typically, a model of the
system is used to obtain an initial controller, but ultimately the controller
parameters must be tuned manually on the real system to achieve the best
performance. To avoid this manual tuning step, methods from machine learning,
such as Bayesian optimization, have been used. However, as these methods
evaluate different controller parameters on the real system, safety-critical
system failures may happen. In this paper, we overcome this problem by
applying, for the first time, a recently developed safe optimization algorithm,
SafeOpt, to the problem of automatic controller parameter tuning. Given an
initial, low-performance controller, SafeOpt automatically optimizes the
parameters of a control law while guaranteeing safety. It models the underlying
performance measure as a Gaussian process and only explores new controller
parameters whose performance lies above a safe performance threshold with high
probability. Experimental results on a quadrotor vehicle indicate that the
proposed method enables fast, automatic, and safe optimization of controller
parameters without human intervention.Comment: IEEE International Conference on Robotics and Automation, 2016. 6
pages, 4 figures. A video of the experiments can be found at
http://tiny.cc/icra16_video . A Python implementation of the algorithm is
available at https://github.com/befelix/SafeOp
On the Construction of Safe Controllable Regions for Affine Systems with Applications to Robotics
This paper studies the problem of constructing in-block controllable (IBC)
regions for affine systems. That is, we are concerned with constructing regions
in the state space of affine systems such that all the states in the interior
of the region are mutually accessible through the region's interior by applying
uniformly bounded inputs. We first show that existing results for checking
in-block controllability on given polytopic regions cannot be easily extended
to address the question of constructing IBC regions. We then explore the
geometry of the problem to provide a computationally efficient algorithm for
constructing IBC regions. We also prove the soundness of the algorithm. We then
use the proposed algorithm to construct safe speed profiles for different
robotic systems, including fully-actuated robots, ground robots modeled as
unicycles with acceleration limits, and unmanned aerial vehicles (UAVs).
Finally, we present several experimental results on UAVs to verify the
effectiveness of the proposed algorithm. For instance, we use the proposed
algorithm for real-time collision avoidance for UAVs.Comment: 17 pages, 18 figures, under review for publication in Automatic
Multi-Robot Transfer Learning: A Dynamical System Perspective
Multi-robot transfer learning allows a robot to use data generated by a
second, similar robot to improve its own behavior. The potential advantages are
reducing the time of training and the unavoidable risks that exist during the
training phase. Transfer learning algorithms aim to find an optimal transfer
map between different robots. In this paper, we investigate, through a
theoretical study of single-input single-output (SISO) systems, the properties
of such optimal transfer maps. We first show that the optimal transfer learning
map is, in general, a dynamic system. The main contribution of the paper is to
provide an algorithm for determining the properties of this optimal dynamic map
including its order and regressors (i.e., the variables it depends on). The
proposed algorithm does not require detailed knowledge of the robots' dynamics,
but relies on basic system properties easily obtainable through simple
experimental tests. We validate the proposed algorithm experimentally through
an example of transfer learning between two different quadrotor platforms.
Experimental results show that an optimal dynamic map, with correct properties
obtained from our proposed algorithm, achieves 60-70% reduction of transfer
learning error compared to the cases when the data is directly transferred or
transferred using an optimal static map.Comment: 7 pages, 6 figures, accepted at the 2017 IEEE/RSJ International
Conference on Intelligent Robots and System
- …