39 research outputs found
Task-Driven Estimation and Control via Information Bottlenecks
Our goal is to develop a principled and general algorithmic framework for
task-driven estimation and control for robotic systems. State-of-the-art
approaches for controlling robotic systems typically rely heavily on accurately
estimating the full state of the robot (e.g., a running robot might estimate
joint angles and velocities, torso state, and position relative to a goal).
However, full state representations are often excessively rich for the specific
task at hand and can lead to significant computational inefficiency and
brittleness to errors in state estimation. In contrast, we present an approach
that eschews such rich representations and seeks to create task-driven
representations. The key technical insight is to leverage the theory of
information bottlenecks}to formalize the notion of a "task-driven
representation" in terms of information theoretic quantities that measure the
minimality of a representation. We propose novel iterative algorithms for
automatically synthesizing (offline) a task-driven representation (given in
terms of a set of task-relevant variables (TRVs)) and a performant control
policy that is a function of the TRVs. We present online algorithms for
estimating the TRVs in order to apply the control policy. We demonstrate that
our approach results in significant robustness to unmodeled measurement
uncertainty both theoretically and via thorough simulation experiments
including a spring-loaded inverted pendulum running to a goal location.Comment: 9 pages, 4 figures, abridged version accepted to ICRA2019;
Incorporates changes in final conference submissio
Robust online motion planning with reachable sets
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2013.Cataloged from PDF version of thesis.Includes bibliographical references (p. 51-55).In this thesis we consider the problem of generating motion plans for a nonlinear dynamical system that are guaranteed to succeed despite uncertainty in the environment, parametric model uncertainty, disturbances, and/or errors in state estimation. Furthermore, we consider the case where these plans must be generated online, because constraints such as obstacles in the environment may not be known until they are perceived (with a noisy sensor) at runtime. Previous work on feedback motion planning for nonlinear systems was limited to offline planning due to the computational cost of safety verification. Here we augment the traditional trajectory library approach by designing locally stabilizing controllers for each nominal trajectory in the library and providing guarantees on the resulting closed loop systems. We leverage sums-of-squares programming to design these locally stabilizing controllers by explicitly attempting to minimize the size of the worst case reachable set of the closed-loop system subjected to bounded disturbances and uncertainty. The reachable sets associated with each trajectory in the library can be thought of as "funnels" that the system is guaranteed to remain within. The resulting funnel library is then used to sequentially compose motion plans at runtime while ensuring the safety of the robot. A major advantage of the work presented here is that by explicitly taking into account the effect of uncertainty, the robot can evaluate motion plans based on how vulnerable they are to disturbances. We demonstrate our method on a simulation of a plane flying through a two dimensional forest of polygonal trees with parametric uncertainty and disturbances in the form of a bounded "cross-wind". We further validate our approach by carefully evaluating the guarantees on invariance provided by funnels on two challenging underactuated systems (the "Acrobot" and a small-sized airplane).by Anirudha Majumdar.S.M
Some Applications of Polynomial Optimization in Operations Research and Real-Time Decision Making
We demonstrate applications of algebraic techniques that optimize and certify
polynomial inequalities to problems of interest in the operations research and
transportation engineering communities. Three problems are considered: (i)
wireless coverage of targeted geographical regions with guaranteed signal
quality and minimum transmission power, (ii) computing real-time certificates
of collision avoidance for a simple model of an unmanned vehicle (UV)
navigating through a cluttered environment, and (iii) designing a nonlinear
hovering controller for a quadrotor UV, which has recently been used for load
transportation. On our smaller-scale applications, we apply the sum of squares
(SOS) relaxation and solve the underlying problems with semidefinite
programming. On the larger-scale or real-time applications, we use our recently
introduced "SDSOS Optimization" techniques which result in second order cone
programs. To the best of our knowledge, this is the first study of real-time
applications of sum of squares techniques in optimization and control. No
knowledge in dynamics and control is assumed from the reader
Fundamental Tradeoffs in Learning with Prior Information
We seek to understand fundamental tradeoffs between the accuracy of prior
information that a learner has on a given problem and its learning performance.
We introduce the notion of prioritized risk, which differs from traditional
notions of minimax and Bayes risk by allowing us to study such fundamental
tradeoffs in settings where reality does not necessarily conform to the
learner's prior. We present a general reduction-based approach for extending
classical minimax lower-bound techniques in order to lower bound the
prioritized risk for statistical estimation problems. We also introduce a novel
generalization of Fano's inequality (which may be of independent interest) for
lower bounding the prioritized risk in more general settings involving
unbounded losses. We illustrate the ability of our framework to provide
insights into tradeoffs between prior information and learning performance for
problems in estimation, regression, and reinforcement learning.Comment: Proceedings of the 40th International Conference on Machine Learning,
Honolulu, Hawaii, USA. PMLR 202, 202