Search CORE

270 research outputs found

Reset-free Trial-and-Error Learning for Robot Damage Recovery

Author: Baranes
Blanke
Bongard
Browne
Calandra
Carlson
Corbato
Cully
DeDonato
Deisenroth
Deisenroth
Deisenroth
Droniou
Durrant-Whyte
Guizzo
Hester
Isermann
Jean-Baptiste Mouret
Kavraki
Kober
Konstantinos Chatzilygeroudis
Koos
LaValle
LaValle
Lengagne
Mnih
Mostafa
Mouret
Nguyen
Nguyen-Tuong
Nori
Peters
Pugh
Quiñonero-Candela
Rasmussen
Ren
Shahriari
Silver
Stulp
Sutton
Vassilis Vassiliades
Verma
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

The high probability of hardware failures prevents many advanced robots (e.g., legged robots) from being confidently deployed in real-world situations (e.g., post-disaster rescue). Instead of attempting to diagnose the failures, robots could adapt by trial-and-error in order to be able to complete their tasks. In this situation, damage recovery can be seen as a Reinforcement Learning (RL) problem. However, the best RL algorithms for robotics require the robot and the environment to be reset to an initial state after each episode, that is, the robot is not learning autonomously. In addition, most of the RL methods for robotics do not scale well with complex robots (e.g., walking robots) and either cannot be used at all or take too long to converge to a solution (e.g., hours of learning). In this paper, we introduce a novel learning algorithm called "Reset-free Trial-and-Error" (RTE) that (1) breaks the complexity by pre-generating hundreds of possible behaviors with a dynamics simulator of the intact robot, and (2) allows complex robots to quickly recover from damage while completing their tasks and taking the environment into account. We evaluate our algorithm on a simulated wheeled robot, a simulated six-legged robot, and a real six-legged walking robot that are damaged in several ways (e.g., a missing leg, a shortened leg, faulty motor, etc.) and whose objective is to reach a sequence of targets in an arena. Our experiments show that the robots can recover most of their locomotion abilities in an environment with obstacles, and without any human intervention.Comment: 18 pages, 16 figures, 3 tables, 6 pseudocodes/algorithms, video at https://youtu.be/IqtyHFrb3BU, code at https://github.com/resibots/chatzilygeroudis_2018_rt

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Using Centroidal Voronoi Tessellations to Scale Up the Multi-dimensional Archive of Phenotypic Elites Algorithm

Author: Chatzilygeroudis Konstantinos
Mouret Jean-Baptiste
Vassiliades Vassilis
Publication venue
Publication date: 01/01/2017
Field of study

The recently introduced Multi-dimensional Archive of Phenotypic Elites (MAP-Elites) is an evolutionary algorithm capable of producing a large archive of diverse, high-performing solutions in a single run. It works by discretizing a continuous feature space into unique regions according to the desired discretization per dimension. While simple, this algorithm has a main drawback: it cannot scale to high-dimensional feature spaces since the number of regions increase exponentially with the number of dimensions. In this paper, we address this limitation by introducing a simple extension of MAP-Elites that has a constant, pre-defined number of regions irrespective of the dimensionality of the feature space. Our main insight is that methods from computational geometry could partition a high-dimensional space into well-spread geometric regions. In particular, our algorithm uses a centroidal Voronoi tessellation (CVT) to divide the feature space into a desired number of regions; it then places every generated individual in its closest region, replacing a less fit one if the region is already occupied. We demonstrate the effectiveness of the new "CVT-MAP-Elites" algorithm in high-dimensional feature spaces through comparisons against MAP-Elites in maze navigation and hexapod locomotion tasks

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Intelligent approaches in locomotion - a review

Author: Jordanov Ivan
Wright Jonathan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2015
Field of study

Portsmouth University Research Portal (Pure)

Climbing and Walking Robots

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Nowadays robotics is one of the most dynamic fields of scientific researches. The shift of robotics researches from manufacturing to services applications is clear. During the last decades interest in studying climbing and walking robots has been increased. This increasing interest has been in many areas that most important ones of them are: mechanics, electronics, medical engineering, cybernetics, controls, and computers. Today’s climbing and walking robots are a combination of manipulative, perceptive, communicative, and cognitive abilities and they are capable of performing many tasks in industrial and non- industrial environments. Surveillance, planetary exploration, emergence rescue operations, reconnaissance, petrochemical applications, construction, entertainment, personal services, intervention in severe environments, transportation, medical and etc are some applications from a very diverse application fields of climbing and walking robots. By great progress in this area of robotics it is anticipated that next generation climbing and walking robots will enhance lives and will change the way the human works, thinks and makes decisions. This book presents the state of the art achievments, recent developments, applications and future challenges of climbing and walking robots. These are presented in 24 chapters by authors throughtot the world The book serves as a reference especially for the researchers who are interested in mobile robots. It also is useful for industrial engineers and graduate students in advanced study

Directory of Open Access Books (DOAB)

Analytical Workspace, Kinematics, and Foot Force Based Stability of Hexapod Walking Robots

Author: Agheli Hajiabadi Mohammad Mahdi
Publication venue: Digital WPI
Publication date: 24/04/2013
Field of study

Many environments are inaccessible or hazardous for humans. Remaining debris after earthquake and fire, ship hulls, bridge installations, and oil rigs are some examples. For these environments, major effort is being placed into replacing humans with robots for manipulation purposes such as search and rescue, inspection, repair, and maintenance. Mobility, manipulability, and stability are the basic needs for a robot to traverse, maneuver, and manipulate in such irregular and highly obstructed terrain. Hexapod walking robots are as a salient solution because of their extra degrees of mobility, compared to mobile wheeled robots. However, it is essential for any multi-legged walking robot to maintain its stability over the terrain or under external stimuli. For manipulation purposes, the robot must also have a sufficient workspace to satisfy the required manipulability. Therefore, analysis of both workspace and stability becomes very important. An accurate and concise inverse kinematic solution for multi-legged robots is developed and validated. The closed-form solution of lateral and spatial reachable workspace of axially symmetric hexapod walking robots are derived and validated through simulation which aid in the design and optimization of the robot parameters and workspace. To control the stability of the robot, a novel stability margin based on the normal contact forces of the robot is developed and then modified to account for the geometrical and physical attributes of the robot. The margin and its modified version are validated by comparison with a widely known stability criterion through simulated and physical experiments. A control scheme is developed to integrate the workspace and stability of multi-legged walking robots resulting in a bio-inspired reactive control strategy which is validated experimentally

DigitalCommons@WPI

Temporal information processing and memory guided behaviors with recurrent neural networks

Author: Dasgupta Sakyasingha
Publication venue
Publication date: 28/01/2015
Field of study

Georg-August-University Göttingen

A Bioinspired Dynamical Vertical Climbing Robot

Author: Clark Jonathan E
Koditschek Daniel E
Lei Pei-Chun
Lynch Goran A
Publication venue: ScholarlyCommons
Publication date: 01/01/2012
Field of study

This paper describes the inspiration, design, analysis, implementation of and experimentation with the first dynamical vertical climbing robot. Biologists have proposed a pendulous climbing model that abstracts remarkable similarities in dynamic wall scaling behavior exhibited by radically different animal species. We study numerically a version of that pendulous climbing template dynamically re-scaled for applicability to utilitarian payloads with conventional electronics and actuation. This simulation study reveals that the incorporation of passive compliance can compensate for an artifact’s poorer power density and scale disadvantages relative to biology. However the introduction of additional dynamical elements raises new concerns about stability regarding both the power stroke and limb coordination that we allay via mathematical analysis of further simplified models. Combining these numerical and analytical insights into a series of design prototypes, we document the correspondence of the various models to the variously scaled platforms and report that our approximately two kilogram platform climbs dynamically at vertical speeds up to 1.5 bodylengths per second. In particular, the final 2.6 kg final prototype climbs at an average steady state speed of 0.66 m/s against gravity on a carpeted vertical wall, in rough agreement with our various models’ predictions

ScholarlyCommons@Penn

Bio-inspired Dynamic Control Systems with Time Delays

Author: DERRICK BENJAMIN,JOHN
Publication venue
Publication date: 01/01/2014
Field of study

The world around us exhibits a rich and ever changing environment of startling, bewildering and fascinating complexity. Almost everything is never as simple as it seems, but through the chaos we may catch fleeting glimpses of the mechanisms within. Throughout the history of human endeavour we have mimicked nature to harness it for our own ends. Our attempts to develop truly autonomous and intelligent machines have however struggled with the limitations of our human ability. This has encouraged some to shirk this responsibility and instead model biological processes and systems to do it for us. This Thesis explores the introduction of continuous time delays into biologically inspired dynamic control systems. We seek to exploit rich temporal dynamics found in physical and biological systems for modelling complex or adaptive behaviour through the artificial evolution of networks to control robots. Throughout, arguments have been presented for the modelling of delays not only to better represent key facets of physical and biological systems, but to increase the computational potential of such systems for the synthesis of control. The thorough investigation of the dynamics of small delayed networks with a wide range of time delays has been undertaken, with a detailed mathematical description of the fixed points of the system and possible oscillatory modes developed to fully describe the behaviour of a single node. Exploration of the behaviour for even small delayed networks illustrates the range of complex behaviour possible and guides the development of interesting solutions. To further exploit the potential of the rich dynamics in such systems, a novel approach to the 3D simulation of locomotory robots has been developed focussing on minimising the computational cost. To verify this simulation tool a simple quadruped robot was developed and the motion of the robot when undergoing a manually designed gait evaluated. The results displayed a high degree of agreement between the simulation and laser tracker data, verifying the accuracy of the model developed. A new model of a dynamic system which includes continuous time delays has been introduced, and its utility demonstrated in the evolution of networks for the solution of simple learning behaviours. A range of methods has been developed for determining the time delays, including the novel concept of representing the time delays as related to the distance between nodes in a spatial representation of the network. The application of these tools to a range of examples has been explored, from Gene Regulatory Networks (GRNs) to robot control and neural networks. The performance of these systems has been compared and contrasted with the efficacy of evolutionary runs for the same task over the whole range of network and delay types. It has been shown that delayed dynamic neural systems are at least as capable as traditional Continuous Time Recurrent Neural Networks (CTRNNs) and show significant performance improvements in the control of robot gaits. Experiments in adaptive behaviour, where there is not such a direct link between the enhanced system dynamics and performance, showed no such discernible improvement. Whilst we hypothesise that the ability of such delayed networks to generate switched pattern generating nodes may be useful in Evolutionary Robotics (ER) this was not borne out here. The spatial representation of delays was shown to be more efficient for larger networks, however these techniques restricted the search to lower complexity solutions or led to a significant falloff as the network structure becomes more complex. This would suggest that for anything other than a simple genotype, the direct method for encoding delays is likely most appropriate. With proven benefits for robot locomotion and the open potential for adaptive behaviour delayed dynamic systems for evolved control remain an interesting and promising field in complex systems research

Durham e-Theses

Convergence of Bayesian Histogram Filters for Location Estimation

Author: De Avik
Koditschek Daniel E
Moran William
Ribeiro Alejandro
Publication venue: ScholarlyCommons
Publication date: 01/01/2013
Field of study

We prove convergence of an approximate Bayesian estimator for the (scalar) location estimation problem by recourse to a histogram approximant. We exploit its tractability to present a simple strategy for managing the tradeoff between accuracy and complexity through the cardinality of the underlying partition. Our theoretical results provide explicit (conservative) sufficient conditions under which convergence is guaranteed. Numerical simulations reveal certain extreme cases in which the conditions may be tight, and suggest that this procedure has performance and computational efficiency favorably comparable to particle filters, while affording the aforementioned analytical benefits. We posit that more sophisticated algorithms can make such piecewise-constant representations similarly feasible for very high-dimensional problems. For more information: Kod*La

Crossref

RMIT Research Repository

ScholarlyCommons@Penn

Fast biped walking with a neuronal controller and physical computation

Author: Geng Tao
Publication venue: University of Stirling
Publication date: 01/04/2007
Field of study

Biped walking remains a difficult problem and robot models can greatly {facilitate} our understanding of the underlying biomechanical principles as well as their neuronal control. The goal of this study is to specifically demonstrate that stable biped walking can be achieved by combining the physical properties of the walking robot with a small, reflex-based neuronal network, which is governed mainly by local sensor signals. This study shows that human-like gaits emerge without {specific} position or trajectory control and that the walker is able to compensate small disturbances through its own dynamical properties. The reflexive controller used here has the following characteristics, which are different from earlier approaches: (1) Control is mainly local. Hence, it uses only two signals (AEA=Anterior Extreme Angle and GC=Ground Contact) which operate at the inter-joint level. All other signals operate only at single joints. (2) Neither position control nor trajectory tracking control is used. Instead, the approximate nature of the local reflexes on each joint allows the robot mechanics itself (e.g., its passive dynamics) to contribute substantially to the overall gait trajectory computation. (3) The motor control scheme used in the local reflexes of our robot is more straightforward and has more biological plausibility than that of other robots, because the outputs of the motorneurons in our reflexive controller are directly driving the motors of the joints, rather than working as references for position or velocity control. As a consequence, the neural controller and the robot mechanics are closely coupled as a neuro-mechanical system and this study emphasises that dynamically stable biped walking gaits emerge from the coupling between neural computation and physical computation. This is demonstrated by different walking experiments using two real robot as well as by a Poincar\'{e} map analysis applied on a model of the robot in order to assess its stability. In addition, this neuronal control structure allows the use of a policy gradient reinforcement learning algorithm to tune the parameters of the neurons in real-time, during walking. This way the robot can reach a record-breaking walking speed of 3.5 leg-lengths per second after only a few minutes of online learning, which is even comparable to the fastest relative speed of human walking

Stirling Online Research Repository