39 research outputs found

    Nonlinear Model Predictive Control for Motion Generation of Humanoids

    Get PDF
    Das Ziel dieser Arbeit ist die Untersuchung und Entwicklung numerischer Methoden zur Bewegungserzeugung von humanoiden Robotern basierend auf nichtlinearer modell-prädiktiver Regelung. Ausgehend von der Modellierung der Humanoiden als komplexe Mehrkörpermodelle, die sowohl durch unilaterale Kontaktbedingungen beschränkt als auch durch die Formulierung unteraktuiert sind, wird die Bewegungserzeugung als Optimalsteuerungsproblem formuliert. In dieser Arbeit werden numerische Erweiterungen basierend auf den Prinzipien der Automatischen Differentiation für rekursive Algorithmen, die eine effiziente Auswertung der dynamischen Größen der oben genannten Mehrkörperformulierung erlauben, hergeleitet, sodass sowohl die nominellen Größen als auch deren ersten Ableitungen effizient ausgewertet werden können. Basierend auf diesen Ideen werden Erweiterungen für die Auswertung der Kontaktdynamik und der Berechnung des Kontaktimpulses vorgeschlagen. Die Echtzeitfähigkeit der Berechnung von Regelantworten hängt stark von der Komplexität der für die Bewegungerzeugung gewählten Mehrkörperformulierung und der zur Verfügung stehenden Rechenleistung ab. Um einen optimalen Trade-Off zu ermöglichen, untersucht diese Arbeit einerseits die mögliche Reduktion der Mehrkörperdynamik und andererseits werden maßgeschneiderte numerische Methoden entwickelt, um die Echtzeitfähigkeit der Regelung zu realisieren. Im Rahmen dieser Arbeit werden hierfür zwei reduzierte Modelle hergeleitet: eine nichtlineare Erweiterung des linearen inversen Pendelmodells sowie eine reduzierte Modellvariante basierend auf der centroidalen Mehrkörperdynamik. Ferner wird ein Regelaufbau zur GanzkörperBewegungserzeugung vorgestellt, deren Hauptbestandteil jeweils aus einem speziell diskretisierten Problem der nichtlinearen modell-prädiktiven Regelung sowie einer maßgeschneiderter Optimierungsmethode besteht. Die Echtzeitfähigkeit des Ansatzes wird durch Experimente mit den Robotern HRP-2 und HeiCub verifiziert. Diese Arbeit schlägt eine Methode der nichtlinear modell-prädiktiven Regelung vor, die trotz der Komplexität der vollen Mehrkörperformulierung eine Berechnung der Regelungsantwort in Echtzeit ermöglicht. Dies wird durch die geschickte Kombination von linearer und nichtlinearer modell-prädiktiver Regelung auf der aktuellen beziehungsweise der letzten Linearisierung des Problems in einer parallelen Regelstrategie realisiert. Experimente mit dem humanoiden Roboter Leo zeigen, dass, im Vergleich zur nominellen Strategie, erst durch den Einsatz dieser Methode eine Bewegungserzeugung auf dem Roboter möglich ist. Neben Methoden der modell-basierten Optimalsteuerung werden auch modell-freie Methoden des verstärkenden Lernens (Reinforcement Learning) für die Bewegungserzeugung untersucht, mit dem Fokus auf den schwierig zu modellierenden Modellunsicherheiten der Roboter. Im Rahmen dieser Arbeit werden eine allgemeine vergleichende Studie sowie Leistungskennzahlen entwickelt, die es erlauben, modell-basierte und -freie Methoden quantitativ bezüglich ihres Lösungsverhaltens zu vergleichen. Die Anwendung der Studie auf ein akademisches Beispiel zeigt Unterschiede und Kompromisse sowie Break-Even-Punkte zwischen den Problemformulierungen. Diese Arbeit schlägt basierend auf dieser Grundlage zwei mögliche Kombinationen vor, deren Eigenschaften bewiesen und in Simulation untersucht werden. Außerdem wird die besser abschneidende Variante auf dem humanoiden Roboter Leo implementiert und mit einem nominellen modell-basierten Regler verglichen

    System Identification of Bipedal Locomotion in Robots and Humans

    Get PDF
    The ability to perform a healthy walking gait can be altered in numerous cases due to gait disorder related pathologies. The latter could lead to partial or complete mobility loss, which affects the patients’ quality of life. Wearable exoskeletons and active prosthetics have been considered as a key component to remedy this mobility loss. The control of such devices knows numerous challenges that are yet to be addressed. As opposed to fixed trajectories control, real-time adaptive reference generation control is likely to provide the wearer with more intent control over the powered device. We propose a novel gait pattern generator for the control of such devices, taking advantage of the inter-joint coordination in the human gait. Our proposed method puts the user in the control loop as it maps the motion of healthy limbs to that of the affected one. To design such control strategy, it is critical to understand the dynamics behind bipedal walking. We begin by studying the simple compass gait walker. We examine the well-known Virtual Constraints method of controlling bipedal robots in the image of the compass gait. In addition, we provide both the mechanical and control design of an affordable research platform for bipedal dynamic walking. We then extend the concept of virtual constraints to human locomotion, where we investigate the accuracy of predicting lower limb joints angular position and velocity from the motion of the other limbs. Data from nine healthy subjects performing specific locomotion tasks were collected and are made available online. A successful prediction of the hip, knee, and ankle joints was achieved in different scenarios. It was also found that the motion of the cane alone has sufficient information to help predict good trajectories for the lower limb in stairs ascent. Better estimates were obtained using additional information from arm joints. We also explored the prediction of knee and ankle trajectories from the motion of the hip joints

    Quantization, Calibration and Planning for Euclidean Motions in Robotic Systems

    Get PDF
    The properties of Euclidean motions are fundamental in all areas of robotics research. Throughout the past several decades, investigations on some low-level tasks like parameterizing specific movements and generating effective motion plans have fostered high-level operations in an autonomous robotic system. In typical applications, before executing robot motions, a proper quantization of basic motion primitives could simplify online computations; a precise calibration of sensor readings could elevate the accuracy of the system controls. Of particular importance in the whole autonomous robotic task, a safe and efficient motion planning framework would make the whole system operate in a well-organized and effective way. All these modules encourage huge amounts of efforts in solving various fundamental problems, such as the uniformity of quantization in non-Euclidean manifolds, the calibration errors on unknown rigid transformations due to the lack of data correspondence and noise, the narrow passage and the curse of dimensionality bottlenecks in developing motion planning algorithms, etc. Therefore, the goal of this dissertation is to tackle these challenges in the topics of quantization, calibration and planning for Euclidean motions

    Robot Assisted Shoulder Rehabilitation: Biomechanical Modelling, Design and Performance Evaluation

    Get PDF
    The upper limb rehabilitation robots have made it possible to improve the motor recovery in stroke survivors while reducing the burden on physical therapists. Compared to manual arm training, robot-supported training can be more intensive, of longer duration, repetitive and task-oriented. To be aligned with the most biomechanically complex joint of human body, the shoulder, specific considerations have to be made in the design of robotic shoulder exoskeletons. It is important to assist all shoulder degrees-of-freedom (DOFs) when implementing robotic exoskeletons for rehabilitation purposes to increase the range of motion (ROM) and avoid any joint axes misalignments between the robot and human’s shoulder that cause undesirable interaction forces and discomfort to the user. The main objective of this work is to design a safe and a robotic exoskeleton for shoulder rehabilitation with physiologically correct movements, lightweight modules, self-alignment characteristics and large workspace. To achieve this goal a comprehensive review of the existing shoulder rehabilitation exoskeletons is conducted first to outline their main advantages and disadvantages, drawbacks and limitations. The research has then focused on biomechanics of the human shoulder which is studied in detail using robotic analysis techniques, i.e. the human shoulder is modelled as a mechanism. The coupled constrained structure of the robotic exoskeleton connected to a human shoulder is considered as a hybrid human-robot mechanism to solve the problem of joint axes misalignments. Finally, a real-scale prototype of the robotic shoulder rehabilitation exoskeleton was built to test its operation and its ability for shoulder rehabilitation

    Descriptive and explanatory tools for human movement and state estimation in humanoid robotics

    Get PDF
    Le sujet principal de cette thèse est le mouvement des systèmes anthropomorphes, et plus particulièrement la locomotion bipède des humains et des robots humanoïdes. Pour caractériser et comprendre la locomotion bipède, il est instructif d'en étudier les causes, qui résident dans le contrôle et l'organisation du mouvement, et les conséquences qui en résultent, que sont le mouvement et les interactions physiques avec l'environnement. Concernant les causes, par exemple, quels sont les principes qui régissent l'organisation des ordres moteurs pour élaborer une stratégie de déplacement spécifique ? Puis, quelles grandeurs physiques pouvons-nous calculer pour décrire au mieux le mouvement résultant de ces commandes motrices ? Ces questions sont en partie abordées par la proposition d'une extension mathématique de l'approche du Uncontrolled Manifold au contrôle moteur de tâches dynamiques, puis par la présentation d'un nouveau descripteur de la locomotion anthropomorphe. En lien avec ce travail analytique vient le problème de l'estimation de l'état pour les systèmes anthropomorphes. La difficulté d'un tel problème vient du fait que les mesures apportent un bruit qui n'est pas toujours séparable des données informatives, et que l'état du système n'est pas nécessairement observable. Pour se débarrasser du bruit, des techniques de filtrage classiques peuvent être employées, mais elles sont susceptibles d'altérer le contenu des signaux d'intérêt. Pour faire face à ce problème, nous présentons une méthode récursive, basée sur le filtrage complémentaire, pour estimer la position du centre de masse et la variation du moment cinétique d'un système en contact, deux quantités centrales de la locomotion bipède. Une autre idée pour se débarrasser du bruit de mesure est de réaliser qu'il résulte en une estimation irréaliste de la dynamique du système. En exploitant les équations du mouvement, qui dictent la dynamique temporelle du système, et en estimant une trajectoire plutôt qu'un point unique, nous présentons ensuite une estimation du maximum de vraisemblance en utilisant l'algorithme de programmation différentielle dynamique pour effectuer une estimation optimale de l'état centroidal des systèmes en contact. Finalement, une réflexion pluridisciplinaire est présentée, sur le rôle fonctionnel et computationnel joué par la tête chez les animaux. La pertinence de son utilisation en robotique mobile y est discutée, pour l'estimation d'état et la perception multisensorielle.The substantive subject of this thesis is the motion of anthropomorphic systems, and more particularly the bipedal locomotion of humans and humanoid robots. To characterize and understand bipedal locomotion, it is instructive to study its motor causes and its resulting physical consequences, namely, the interactions with the environment. Concerning the causes, for instance, what are the principles that govern the organization of motor orders in humans for elaborating a specific displacement strategy? And then, which physical quantities can we compute for best describing the motion resulting from these motor orders ? These questions are in part addressed by the proposal of a mathematical extension of the Uncontrolled Manifold approach for the motor control of dynamic tasks and through the presentation of a new descriptor of anthropomorphic locomotion. In connection with this analytical work, comes the problem of state estimation in anthropomorphic systems. The difficulty of such a problem comes from the fact that the measurements carry noise which is not always separable from the informative data, and that the state of the system is not necessarily observable. To get rid of the noise, classical filtering techniques can be employed but they are likely to distort the signals. To cope with this issue, we present a recursive method, based on complementary filtering, to estimate the position of the center of mass and the angular momentum variation of the human body, two central quantities of human locomotion. Another idea to get rid of the measurements noise is to acknowledge the fact that it results in an unrealistic estimation of the motion dynamics. By exploiting the equations of motion, which dictate the temporal dynamics of the system, and by estimating a trajectory versus a single point, we then present maximum likelihood estimation using the dynamic differential programming algorithm to perform optimal centroidal state estimation for systems in contact. Finally, a multidisciplinary reflection on the functional and computational role played by the head in animals is presented. The relevance of using this solution in mobile robotics is discussed, particularly for state estimation and multisensory perception

    Parametric Human Movements:Learning, Synthesis, Recognition, and Tracking

    Get PDF

    Incorporating Human Expertise in Robot Motion Learning and Synthesis

    Get PDF
    With the exponential growth of robotics and the fast development of their advanced cognitive and motor capabilities, one can start to envision humans and robots jointly working together in unstructured environments. Yet, for that to be possible, robots need to be programmed for such types of complex scenarios, which demands significant domain knowledge in robotics and control. One viable approach to enable robots to acquire skills in a more flexible and efficient way is by giving them the capabilities of autonomously learn from human demonstrations and expertise through interaction. Such framework helps to make the creation of skills in robots more social and less demanding on programing and robotics expertise. Yet, current imitation learning approaches suffer from significant limitations, mainly about the flexibility and efficiency for representing, learning and reasoning about motor tasks. This thesis addresses this problem by exploring cost-function-based approaches to learning robot motion control, perception and the interplay between them. To begin with, the thesis proposes an efficient probabilistic algorithm to learn an impedance controller to accommodate motion contacts. The learning algorithm is able to incorporate important domain constraints, e.g., about force representation and decomposition, which are nontrivial to handle by standard techniques. Compliant handwriting motions are developed on an articulated robot arm and a multi-fingered hand. This work provides a flexible approach to learn robot motion conforming to both task and domain constraints. Furthermore, the thesis also contributes with techniques to learn from and reason about demonstrations with partial observability. The proposed approach combines inverse optimal control and ensemble methods, yielding a tractable learning of cost functions with latent variables. Two task priors are further incorporated. The first human kinematics prior results in a model which synthesizes rich and believable dynamical handwriting. The latter prior enforces dynamics on the latent variable and facilitates a real-time human intention cognition and an on-line motion adaptation in collaborative robot tasks. Finally, the thesis establishes a link between control and perception modalities. This work offers an analysis that bridges inverse optimal control and deep generative model, as well as a novel algorithm that learns cost features and embeds the modal coupling prior. This work contributes an end-to-end system for synthesizing arm joint motion from letter image pixels. The results highlight its robustness against noisy and out-of-sample sensory inputs. Overall, the proposed approach endows robots the potential to reason about diverse unstructured data, which is nowadays pervasive but hard to process for current imitation learning

    Machine Learning through Exploration for Perception-Driven Robotics

    Get PDF
    The ability of robots to perform tasks in human environments has largely been limited to rather simple and specific tasks, such as lawn mowing and vacuum cleaning. As such, current robots are far away from the robot butlers, assistants, and housekeepers that are depicted in science fiction movies. Part of this gap can be explained by the fact that human environments are hugely varied, complex and unstructured. For example, the homes that a domestic robot might end up in are hugely varied. Since every home has a different layout with different objects and furniture, it is impossible for a human designer to anticipate all challenges a robot might face, and equip the robot a priori with all the necessary perceptual and manipulation skills. Instead, robots could be programmed in a way that allows them to adapt to any environment that they are in. In that case, the robot designer would not need to precisely anticipate such environments. The ability to adapt can be provided by robot learning techniques, which can be applied to learn skills for perception and manipulation. Many of the current robot learning techniques, however, rely on human supervisors to provide annotations or demonstrations, and to fine-tuning the methods parameters and heuristics. As such, it can require a significant amount of human time investment to make a robot perform a task in a novel environment, even if statistical learning techniques are used. In this thesis, I focus on another way of obtaining the data a robot needs to learn about the environment and how to successfully perform skills in it. By exploring the environment using its own sensors and actuators, rather than passively waiting for annotations or demonstrations, a robot can obtain this data by itself. I investigate multiple approaches that allow a robot to explore its environment autonomously, while trying to minimize the design effort required to deploy such algorithms in different situations. First, I consider an unsupervised robot with minimal prior knowledge about its environment. It can only learn through observed sensory feedback obtained though interactive exploration of its environment. In a bottom-up, probabilistic approach, the robot tries to segment the objects in its environment through clustering with minimal prior knowledge. This clustering is based on static visual scene features and observed movement. Information theoretic principles are used to autonomously select actions that maximize the expected information gain, and thus learning speed. Our evaluations on a real robot system equipped with an on-board camera show that the proposed method handles noisy inputs better than previous methods, and that action selection according to the information gain criterion does increase the learning speed. Often, however, the goal of a robot is not just to learn the structure of the environment, but to learn how to perform a task encoded by a reward signal. In addition to the weak feedback provided by reward signals, the robot has access to rich sensory data, that, even for simple tasks, is often non-linear and high-dimensional. Sensory data can be leveraged to learn a system model, but in high-dimensional sensory spaces this step often requires manually designing features. I propose a robot reinforcement learning algorithm with learned non-parametric models, value functions, and policies that can deal with high-dimensional state representations. As such, the proposed algorithm is well-suited to deal with high-dimensional signals such as camera images. To avoid that the robot converges prematurely to a sub-optimal solution, the information loss of policy updates is limited. This constraint makes sure the robot keeps exploring the effects of its behavior on the environment. The experiments show that the proposed non-parametric relative entropy policy search algorithm performs better than prior methods that either do not employ bounded updates, or that try to cover the state-space with general-purpose radial basis functions. Furthermore, the method is validated on a real-robot setup with high-dimensional camera image inputs. One problem with typical exploration strategies is that the behavior is perturbed independently in each time step, for example through selecting a random action or random policy parameters. As such, the resulting exploration behavior might be incoherent. Incoherence causes inefficient random walk behavior, makes the system less robust, and causes wear and tear on the robot. A typical solution is to perturb the policy parameters directly, and use the same perturbation for an entire episode. However, this strategy tends to increase the number of episodes needed, since only a single perturbation can be evaluated per episode. I introduce a strategy that can make a more balanced trade-off between the advantages of these two approaches. The experiments show that intermediate trade-offs, rather than independent or episode-based exploration, is beneficial across different tasks and learning algorithms. This thesis thus addresses how robots can learn autonomously by exploring the world through unsupervised learning and reinforcement learning. Throughout the thesis, new approaches and algorithms are introduced: a probabilistic interactive segmentation approach, the non-parametric relative entropy policy search algorithm, and a framework for generalized exploration. To allow the learning algorithms to be applied in different and unknown environments, the design effort and supervision required from human designers or users is minimized. These approaches and algorithms contribute towards the capability of robots to autonomously learn useful skills in human environments in a practical manner
    corecore