303 research outputs found
Minimax Iterative Dynamic Game: Application to Nonlinear Robot Control Tasks
Multistage decision policies provide useful control strategies in
high-dimensional state spaces, particularly in complex control tasks. However,
they exhibit weak performance guarantees in the presence of disturbance, model
mismatch, or model uncertainties. This brittleness limits their use in
high-risk scenarios. We present how to quantify the sensitivity of such
policies in order to inform of their robustness capacity. We also propose a
minimax iterative dynamic game framework for designing robust policies in the
presence of disturbance/uncertainties. We test the quantification hypothesis on
a carefully designed deep neural network policy; we then pose a minimax
iterative dynamic game (iDG) framework for improving policy robustness in the
presence of adversarial disturbances. We evaluate our iDG framework on a
mecanum-wheeled robot, whose goal is to find a ocally robust optimal multistage
policy that achieve a given goal-reaching task. The algorithm is simple and
adaptable for designing meta-learning/deep policies that are robust against
disturbances, model mismatch, or model uncertainties, up to a disturbance
bound. Videos of the results are on the author's website,
http://ecs.utdallas.edu/~opo140030/iros18/iros2018.html, while the codes for
reproducing our experiments are on github,
https://github.com/lakehanne/youbot/tree/rilqg. A self-contained environment
for reproducing our results is on docker,
https://hub.docker.com/r/lakehanne/youbotbuntu14/Comment: 2018 International Conference on Intelligent Robots and System
Humanoid Robot Soccer Locomotion and Kick Dynamics: Open Loop Walking, Kicking and Morphing into Special Motions on the Nao Robot
Striker speed and accuracy in the RoboCup (SPL) international robot soccer league is becoming
increasingly important as the level of play rises. Competition around the ball is now decided in a
matter of seconds. Therefore, eliminating any wasted actions or motions is crucial when attempting to
kick the ball.
It is common to see a discontinuity between walking and kicking where a robot will return to an
initial pose in preparation for the kick action. In this thesis we explore the removal of this behaviour
by developing a transition gait that morphs the walk directly into the kick back swing pose. The
solution presented here is targeted towards the use of the Aldebaran walk for the Nao robot.
The solution we develop involves the design of a central pattern generator to allow for controlled
steps with realtime accuracy, and a phase locked loop method to synchronise with the Aldebaran walk
so that precise step length control can be activated when required. An open loop trajectory mapping
approach is taken to the walk that is stabilized statically through the use of a phase varying joint
holding torque technique. We also examine the basic princples of open loop walking, focussing on the
commonly overlooked frontal plane motion.
The act of kicking itself is explored both analytically and empirically, and solutions are provided
that are versatile and powerful. Included as an appendix, the broader matter of striker behaviour
(process of goal scoring) is reviewed and we present a velocity control algorithm that is very accurate
and efficient in terms of speed of execution
Locomoção de humanoides robusta e versátil baseada em controlo analĂtico e fĂsica residual
Humanoid robots are made to resemble humans but their locomotion
abilities are far from ours in terms of agility and versatility. When humans
walk on complex terrains or face external disturbances, they
combine a set of strategies, unconsciously and efficiently, to regain
stability. This thesis tackles the problem of developing a robust omnidirectional
walking framework, which is able to generate versatile
and agile locomotion on complex terrains. We designed and developed
model-based and model-free walk engines and formulated the
controllers using different approaches including classical and optimal
control schemes and validated their performance through simulations
and experiments. These frameworks have hierarchical structures that
are composed of several layers. These layers are composed of several
modules that are connected together to fade the complexity and
increase the flexibility of the proposed frameworks. Additionally, they
can be easily and quickly deployed on different platforms.
Besides, we believe that using machine learning on top of analytical approaches
is a key to open doors for humanoid robots to step out of laboratories.
We proposed a tight coupling between analytical control and
deep reinforcement learning. We augmented our analytical controller
with reinforcement learning modules to learn how to regulate the walk
engine parameters (planners and controllers) adaptively and generate
residuals to adjust the robot’s target joint positions (residual physics).
The effectiveness of the proposed frameworks was demonstrated and
evaluated across a set of challenging simulation scenarios. The robot
was able to generalize what it learned in one scenario, by displaying
human-like locomotion skills in unforeseen circumstances, even in the
presence of noise and external pushes.Os robĂ´s humanoides sĂŁo feitos para se parecerem com humanos,
mas suas habilidades de locomoção estão longe das nossas em termos
de agilidade e versatilidade. Quando os humanos caminham em
terrenos complexos ou enfrentam distĂşrbios externos combinam diferentes
estratégias, de forma inconsciente e eficiente, para recuperar a
estabilidade. Esta tese aborda o problema de desenvolver um sistema
robusto para andar de forma omnidirecional, capaz de gerar uma locomoção
para robôs humanoides versátil e ágil em terrenos complexos.
Projetámos e desenvolvemos motores de locomoção sem modelos e
baseados em modelos. Formulámos os controladores usando diferentes
abordagens, incluindo esquemas de controlo clássicos e ideais,
e validámos o seu desempenho por meio de simulações e experiências
reais. Estes frameworks têm estruturas hierárquicas compostas por
várias camadas. Essas camadas são compostas por vários módulos
que sĂŁo conectados entre si para diminuir a complexidade e aumentar
a flexibilidade dos frameworks propostos. Adicionalmente, o sistema
pode ser implementado em diferentes plataformas de forma fácil.
Acreditamos que o uso de aprendizagem automática sobre abordagens
analĂticas Ă© a chave para abrir as portas para robĂ´s humanoides
saĂrem dos laboratĂłrios. Propusemos um forte acoplamento entre controlo
analĂtico e aprendizagem profunda por reforço. Expandimos o
nosso controlador analĂtico com mĂłdulos de aprendizagem por reforço
para aprender como regular os parâmetros do motor de caminhada
(planeadores e controladores) de forma adaptativa e gerar resĂduos
para ajustar as posições das juntas alvo do robĂ´ (fĂsica residual). A
eficácia das estruturas propostas foi demonstrada e avaliada em um
conjunto de cenários de simulação desafiadores. O robô foi capaz de
generalizar o que aprendeu em um cenário, exibindo habilidades de
locomoção humanas em circunstâncias imprevistas, mesmo na presença
de ruĂdo e impulsos externos.Programa Doutoral em Informátic
Motion Planning and Control for the Locomotion of Humanoid Robot
This thesis aims to contribute on the motion planning and control problem of the locomotion
of humanoid robots. For the motion planning, various methods were proposed
in different levels of model dependence. First, a model free approach was proposed
which utilizes linear regression to estimate the relationship between foot placement
and moving velocity. The data-based feature makes it quite robust to handle modeling
error and external disturbance. As a generic control philosophy, it can be applied to
various robots with different gaits. To reduce the risk of collecting experimental data
of model-free method, based on the simplified linear inverted pendulum model, the
classic planning method of model predictive control was explored to optimize CoM
trajectory with predefined foot placements or optimize them two together with respect
to the ZMP constraint. Along with elaborately designed re-planning algorithm and
sparse discretization of trajectories, it is fast enough to run in real time and robust
enough to resist external disturbance. Thereafter, nonlinear models are utilized for
motion planning by performing forward simulation iteratively following the multiple
shooting method. A walking pattern is predefined to fix most of the degrees of the
robot, and only one decision variable, foot placement, is left in one motion plane and
therefore able to be solved in milliseconds which is sufficient to run in real time. In
order to track the planned trajectories and prevent the robot from falling over, diverse
control strategies were proposed according to the types of joint actuators. CoM stabilizer
was designed for the robots with position-controlled joints while quasi-static
Cartesian impedance control and optimization-based full body torque control were
implemented for the robots with torque-controlled joints. Various scenarios were set
up to demonstrate the feasibility and robustness of the proposed approaches, like
walking on uneven terrain, walking with narrow feet or straight leg, push recovery
and so on
- …