36,257 research outputs found
Hybrid control trajectory optimization under uncertainty
Trajectory optimization is a fundamental problem in robotics. While optimization of continuous control trajectories is well developed, many applications require both discrete and continuous, i.e. hybrid controls. Finding an optimal sequence of hybrid controls is challenging due to the exponential explosion of discrete control combinations. Our method, based on Differential Dynamic Programming (DDP), circumvents this problem by incorporating discrete actions inside DDP: we first optimize continuous mixtures of discrete actions, and, subsequently force the mixtures into fully discrete actions. Moreover, we show how our approach can be extended to partially observable Markov decision processes (POMDPs) for trajectory planning under uncertainty. We validate the approach in a car driving problem where the robot has to switch discrete gears and in a box pushing application where the robot can switch the side of the box to push. The pose and the friction parameters of the pushed box are initially unknown and only indirectly observable
Motion Planning of Uncertain Ordinary Differential Equation Systems
This work presents a novel motion planning framework, rooted in nonlinear programming theory, that treats uncertain fully and under-actuated dynamical systems described by ordinary differential equations. Uncertainty in multibody dynamical systems comes from various sources, such as: system parameters, initial conditions, sensor and actuator noise, and external forcing. Treatment of uncertainty in design is of paramount practical importance because all real-life systems are affected by it, and poor robustness and suboptimal performance result if it’s not accounted for in a given design. In this work uncertainties are modeled using Generalized Polynomial Chaos and are solved quantitatively using a least-square collocation method. The computational efficiency of this approach enables the inclusion of uncertainty statistics in the nonlinear programming optimization process. As such, the proposed framework allows the user to pose, and answer, new design questions related to uncertain dynamical systems.
Specifically, the new framework is explained in the context of forward, inverse, and hybrid dynamics formulations. The forward dynamics formulation, applicable to both fully and under-actuated systems, prescribes deterministic actuator inputs which yield uncertain state trajectories. The inverse dynamics formulation is the dual to the forward dynamic, and is only applicable to fully-actuated systems; deterministic state trajectories are prescribed and yield uncertain actuator inputs. The inverse dynamics formulation is more computationally efficient as it requires only algebraic evaluations and completely avoids numerical integration. Finally, the hybrid dynamics formulation is applicable to under-actuated systems where it leverages the benefits of inverse dynamics for actuated joints and forward dynamics for unactuated joints; it prescribes actuated state and unactuated input trajectories which yield uncertain unactuated states and actuated inputs.
The benefits of the ability to quantify uncertainty when planning the motion of multibody dynamic systems are illustrated through several case-studies. The resulting designs determine optimal motion plans—subject to deterministic and statistical constraints—for all possible systems within the probability space
Optimal Event-Driven Multi-Agent Persistent Monitoring of a Finite Set of Targets
We consider the problem of controlling the movement of multiple cooperating
agents so as to minimize an uncertainty metric associated with a finite number
of targets. In a one-dimensional mission space, we adopt an optimal control
framework and show that the solution is reduced to a simpler parametric
optimization problem: determining a sequence of locations where each agent may
dwell for a finite amount of time and then switch direction. This amounts to a
hybrid system which we analyze using Infinitesimal Perturbation Analysis (IPA)
to obtain a complete on-line solution through an event-driven gradient-based
algorithm which is also robust with respect to the uncertainty model used. The
resulting controller depends on observing the events required to excite the
gradient-based algorithm, which cannot be guaranteed. We solve this problem by
proposing a new metric for the objective function which creates a potential
field guaranteeing that gradient values are non-zero. This approach is compared
to an alternative graph-based task scheduling algorithm for determining an
optimal sequence of target visits. Simulation examples are included to
demonstrate the proposed methods.Comment: 12 pages full version, IEEE Conference on Decision and Control, 201
Reactive Planar Manipulation with Convex Hybrid MPC
This paper presents a reactive controller for planar manipulation tasks that
leverages machine learning to achieve real-time performance. The approach is
based on a Model Predictive Control (MPC) formulation, where the goal is to
find an optimal sequence of robot motions to achieve a desired object motion.
Due to the multiple contact modes associated with frictional interactions, the
resulting optimization program suffers from combinatorial complexity when
tasked with determining the optimal sequence of modes.
To overcome this difficulty, we formulate the search for the optimal mode
sequences offline, separately from the search for optimal control inputs
online. Using tools from machine learning, this leads to a convex hybrid MPC
program that can be solved in real-time. We validate our algorithm on a planar
manipulation experimental setup where results show that the convex hybrid MPC
formulation with learned modes achieves good closed-loop performance on a
trajectory tracking problem
Online Learning for Ground Trajectory Prediction
This paper presents a model based on an hybrid system to numerically simulate
the climbing phase of an aircraft. This model is then used within a trajectory
prediction tool. Finally, the Covariance Matrix Adaptation Evolution Strategy
(CMA-ES) optimization algorithm is used to tune five selected parameters, and
thus improve the accuracy of the model. Incorporated within a trajectory
prediction tool, this model can be used to derive the order of magnitude of the
prediction error over time, and thus the domain of validity of the trajectory
prediction. A first validation experiment of the proposed model is based on the
errors along time for a one-time trajectory prediction at the take off of the
flight with respect to the default values of the theoretical BADA model. This
experiment, assuming complete information, also shows the limit of the model. A
second experiment part presents an on-line trajectory prediction, in which the
prediction is continuously updated based on the current aircraft position. This
approach raises several issues, for which improvements of the basic model are
proposed, and the resulting trajectory prediction tool shows statistically
significantly more accurate results than those of the default model.Comment: SESAR 2nd Innovation Days (2012
From Uncertainty Data to Robust Policies for Temporal Logic Planning
We consider the problem of synthesizing robust disturbance feedback policies
for systems performing complex tasks. We formulate the tasks as linear temporal
logic specifications and encode them into an optimization framework via
mixed-integer constraints. Both the system dynamics and the specifications are
known but affected by uncertainty. The distribution of the uncertainty is
unknown, however realizations can be obtained. We introduce a data-driven
approach where the constraints are fulfilled for a set of realizations and
provide probabilistic generalization guarantees as a function of the number of
considered realizations. We use separate chance constraints for the
satisfaction of the specification and operational constraints. This allows us
to quantify their violation probabilities independently. We compute disturbance
feedback policies as solutions of mixed-integer linear or quadratic
optimization problems. By using feedback we can exploit information of past
realizations and provide feasibility for a wider range of situations compared
to static input sequences. We demonstrate the proposed method on two robust
motion-planning case studies for autonomous driving
- …