27,963 research outputs found
Differentiable Optimal Control via Differential Dynamic Programming
Robot design optimization, imitation learning and system identification share
a common problem which requires optimization over robot or task parameters at
the same time as optimizing the robot motion. To solve these problems, we can
use differentiable optimal control for which the gradients of the robot's
motion with respect to the parameters are required. We propose a method to
efficiently compute these gradients analytically via the differential dynamic
programming (DDP) algorithm using sensitivity analysis (SA). We show that we
must include second-order dynamics terms when computing the gradients. However,
we do not need to include them when computing the motion. We validate our
approach on the pendulum and double pendulum systems. Furthermore, we compare
against using the derivatives of the iterative linear quadratic regulator
(iLQR), which ignores these second-order terms everywhere, on a co-design task
for the Kinova arm, where we optimize the link lengths of the robot for a
target reaching task. We show that optimizing using iLQR gradients diverges as
ignoring the second-order dynamics affects the computation of the derivatives.
Instead, optimizing using DDP gradients converges to the same optimum for a
range of initial designs allowing our formulation to scale to complex systems
Robot training using system identification
This paper focuses on developing a formal, theory-based design methodology to generate transparent robot control programs using mathematical functions. The research finds its theoretical roots in robot training and system identification techniques such as Armax (Auto-Regressive Moving Average models with eXogenous inputs) and Narmax (Non-linear Armax). These techniques produce linear and non-linear polynomial functions that model the relationship between a robot’s sensor perception and motor response.
The main benefits of the proposed design methodology, compared to the traditional robot programming techniques are: (i) It is a fast and efficient way of generating robot control code, (ii) The generated robot control programs are transparent mathematical functions that can be used to form hypotheses and theoretical analyses of robot behaviour, and (iii) It requires very little explicit knowledge of robot programming where end-users/programmers who do not have any specialised robot programming skills can nevertheless generate task-achieving sensor-motor couplings.
The nature of this research is concerned with obtaining sensor-motor couplings, be it through human demonstration via the robot, direct human demonstration, or other means. The viability of our methodology has been demonstrated by teaching various mobile robots different sensor-motor tasks such as wall following, corridor passing, door traversal and route learning
Comparative evaluation of approaches in T.4.1-4.3 and working definition of adaptive module
The goal of this deliverable is two-fold: (1) to present and compare different approaches towards learning and encoding movements us- ing dynamical systems that have been developed by the AMARSi partners (in the past during the first 6 months of the project), and (2) to analyze their suitability to be used as adaptive modules, i.e. as building blocks for the complete architecture that will be devel- oped in the project. The document presents a total of eight approaches, in two groups: modules for discrete movements (i.e. with a clear goal where the movement stops) and for rhythmic movements (i.e. which exhibit periodicity). The basic formulation of each approach is presented together with some illustrative simulation results. Key character- istics such as the type of dynamical behavior, learning algorithm, generalization properties, stability analysis are then discussed for each approach. We then make a comparative analysis of the different approaches by comparing these characteristics and discussing their suitability for the AMARSi project
- …