13,666 research outputs found
Task-Driven Estimation and Control via Information Bottlenecks
Our goal is to develop a principled and general algorithmic framework for
task-driven estimation and control for robotic systems. State-of-the-art
approaches for controlling robotic systems typically rely heavily on accurately
estimating the full state of the robot (e.g., a running robot might estimate
joint angles and velocities, torso state, and position relative to a goal).
However, full state representations are often excessively rich for the specific
task at hand and can lead to significant computational inefficiency and
brittleness to errors in state estimation. In contrast, we present an approach
that eschews such rich representations and seeks to create task-driven
representations. The key technical insight is to leverage the theory of
information bottlenecks}to formalize the notion of a "task-driven
representation" in terms of information theoretic quantities that measure the
minimality of a representation. We propose novel iterative algorithms for
automatically synthesizing (offline) a task-driven representation (given in
terms of a set of task-relevant variables (TRVs)) and a performant control
policy that is a function of the TRVs. We present online algorithms for
estimating the TRVs in order to apply the control policy. We demonstrate that
our approach results in significant robustness to unmodeled measurement
uncertainty both theoretically and via thorough simulation experiments
including a spring-loaded inverted pendulum running to a goal location.Comment: 9 pages, 4 figures, abridged version accepted to ICRA2019;
Incorporates changes in final conference submissio
Machine Learning for Fluid Mechanics
The field of fluid mechanics is rapidly advancing, driven by unprecedented
volumes of data from field measurements, experiments and large-scale
simulations at multiple spatiotemporal scales. Machine learning offers a wealth
of techniques to extract information from data that could be translated into
knowledge about the underlying fluid mechanics. Moreover, machine learning
algorithms can augment domain knowledge and automate tasks related to flow
control and optimization. This article presents an overview of past history,
current developments, and emerging opportunities of machine learning for fluid
mechanics. It outlines fundamental machine learning methodologies and discusses
their uses for understanding, modeling, optimizing, and controlling fluid
flows. The strengths and limitations of these methods are addressed from the
perspective of scientific inquiry that considers data as an inherent part of
modeling, experimentation, and simulation. Machine learning provides a powerful
information processing framework that can enrich, and possibly even transform,
current lines of fluid mechanics research and industrial applications.Comment: To appear in the Annual Reviews of Fluid Mechanics, 202
Reinforcement learning or active inference?
This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain
Using humanoid robots to study human behavior
Our understanding of human behavior advances as our humanoid robotics work progresses-and vice versa. This team's work focuses on trajectory formation and planning, learning from demonstration, oculomotor control and interactive behaviors. They are programming robotic behavior based on how we humans “program” behavior in-or train-each other
Recurrence Plots in Nonlinear Time Series Analysis: Free Software
Recurrence plots are graphical devices specially suited to detect hidden dynamical patterns and nonlinearities in data. However, there are few programs available to apply such a mehodology. This paper reviews one of the best free programs to apply nonlinear time series analysis: Visual Recurrence Analysis (VRA). This program is targeted to recurrence analysis and the so-called Recurrence Quantitative Analysis (RQA, the quantitative counterpart of recurrence plots), although it includes many procedures in a friendly visual environment. Comparisons with alternative programs are performed.
- …