Search CORE

133 research outputs found

Evolvability signatures of generative encodings: beyond standard performance benchmarks

Author: Mouret Jean-Baptiste
Tarapore Danesh
Publication venue
Publication date: 01/01/2015
Field of study

Evolutionary robotics is a promising approach to autonomously synthesize machines with abilities that resemble those of animals, but the field suffers from a lack of strong foundations. In particular, evolutionary systems are currently assessed solely by the fitness score their evolved artifacts can achieve for a specific task, whereas such fitness-based comparisons provide limited insights about how the same system would evaluate on different tasks, and its adaptive capabilities to respond to changes in fitness (e.g., from damages to the machine, or in new situations). To counter these limitations, we introduce the concept of "evolvability signatures", which picture the post-mutation statistical distribution of both behavior diversity (how different are the robot behaviors after a mutation?) and fitness values (how different is the fitness after a mutation?). We tested the relevance of this concept by evolving controllers for hexapod robot locomotion using five different genotype-to-phenotype mappings (direct encoding, generative encoding of open-loop and closed-loop central pattern generators, generative encoding of neural networks, and single-unit pattern generators (SUPG)). We observed a predictive relationship between the evolvability signature of each encoding and the number of generations required by hexapods to adapt from incurred damages. Our study also reveals that, across the five investigated encodings, the SUPG scheme achieved the best evolvability signature, and was always foremost in recovering an effective gait following robot damages. Overall, our evolvability signatures neatly complement existing task-performance benchmarks, and pave the way for stronger foundations for research in evolutionary robotics.Comment: 24 pages with 12 figures in the main text, and 4 supplementary figures. Accepted at Information Sciences journal (in press). Supplemental videos are available online at, see http://goo.gl/uyY1R

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Reset-free Trial-and-Error Learning for Robot Damage Recovery

Author: Baranes
Blanke
Bongard
Browne
Calandra
Carlson
Corbato
Cully
DeDonato
Deisenroth
Deisenroth
Deisenroth
Droniou
Durrant-Whyte
Guizzo
Hester
Isermann
Jean-Baptiste Mouret
Kavraki
Kober
Konstantinos Chatzilygeroudis
Koos
LaValle
LaValle
Lengagne
Mnih
Mostafa
Mouret
Nguyen
Nguyen-Tuong
Nori
Peters
Pugh
Quiñonero-Candela
Rasmussen
Ren
Shahriari
Silver
Stulp
Sutton
Vassilis Vassiliades
Verma
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

The high probability of hardware failures prevents many advanced robots (e.g., legged robots) from being confidently deployed in real-world situations (e.g., post-disaster rescue). Instead of attempting to diagnose the failures, robots could adapt by trial-and-error in order to be able to complete their tasks. In this situation, damage recovery can be seen as a Reinforcement Learning (RL) problem. However, the best RL algorithms for robotics require the robot and the environment to be reset to an initial state after each episode, that is, the robot is not learning autonomously. In addition, most of the RL methods for robotics do not scale well with complex robots (e.g., walking robots) and either cannot be used at all or take too long to converge to a solution (e.g., hours of learning). In this paper, we introduce a novel learning algorithm called "Reset-free Trial-and-Error" (RTE) that (1) breaks the complexity by pre-generating hundreds of possible behaviors with a dynamics simulator of the intact robot, and (2) allows complex robots to quickly recover from damage while completing their tasks and taking the environment into account. We evaluate our algorithm on a simulated wheeled robot, a simulated six-legged robot, and a real six-legged walking robot that are damaged in several ways (e.g., a missing leg, a shortened leg, faulty motor, etc.) and whose objective is to reach a sequence of targets in an arena. Our experiments show that the robots can recover most of their locomotion abilities in an environment with obstacles, and without any human intervention.Comment: 18 pages, 16 figures, 3 tables, 6 pseudocodes/algorithms, video at https://youtu.be/IqtyHFrb3BU, code at https://github.com/resibots/chatzilygeroudis_2018_rt

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Robots that can adapt like animals

Author: A Borji
A Fuchs
A Pouget
A Sproewitz
B Siciliano
BD Argall
CE Rasmussen
DJ Christensen
DM Wolpert
E Broadbent
G Antonelli
J Bongard
J Carlson
J Kluger
J Kober
J Mockus
K Nagatani
K Sanderson
KP Körding
M Blanke
M Ito
M Santello
RR Murphy
S Benson-Amram
S Derégnaucourt
S Grillner
S Thrun
SL Jarvis
U Wagner
V Verma
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/05/2015
Field of study

As robots leave the controlled environments of factories to autonomously function in more complex, natural environments, they will have to respond to the inevitable fact that they will become damaged. However, while animals can quickly adapt to a wide variety of injuries, current robots cannot "think outside the box" to find a compensatory behavior when damaged: they are limited to their pre-specified self-sensing abilities, can diagnose only anticipated failure modes, and require a pre-programmed contingency plan for every type of potential damage, an impracticality for complex robots. Here we introduce an intelligent trial and error algorithm that allows robots to adapt to damage in less than two minutes, without requiring self-diagnosis or pre-specified contingency plans. Before deployment, a robot exploits a novel algorithm to create a detailed map of the space of high-performing behaviors: This map represents the robot's intuitions about what behaviors it can perform and their value. If the robot is damaged, it uses these intuitions to guide a trial-and-error learning algorithm that conducts intelligent experiments to rapidly discover a compensatory behavior that works in spite of the damage. Experiments reveal successful adaptations for a legged robot injured in five different ways, including damaged, broken, and missing legs, and for a robotic arm with joints broken in 14 different ways. This new technique will enable more robust, effective, autonomous robots, and suggests principles that animals may use to adapt to injury

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

INRIA a CCSD electronic archive server

HAL-Rennes 1

Proprioception Based Behavioral Advances in a Hexapod Robot

Author: Buehler Martin
Koditschek Daniel E
Komsuoglu Haldun
McMordie Dave
Moore Ned
Saranli Uluc
Publication venue: ScholarlyCommons
Publication date: 01/01/2001
Field of study

We report on our progress in extending the behavioral repertoire of RHex, a compliant leg hexapod robot. We introduce two new controllers, one for climbing constant slope inclinations and one for achieving higher speeds via pronking, a gait that incorporates a, substantial aerial phase. In both cases, we make use of an underlying open-loop control strategy, combined with low bandwidth feedback to modulate its parameters. The inclination behavior arises from our initial alternating tripod walking controller and adjusts the angle offsets of individual leg motion profiles based on inertial sensing of the average surface slope. Similarly, the pronking controller makes use of a virtual leg touchdown sensing mechanism to adjust the frequency of the open-loop pronking, effectively synchronizing the controller with the natural oscillations of the mechanical system. Experimental results demonstrate good performance on slopes inclined up to /spl sim/250 and pronking up to speeds approaching 2 body lengths per second (/spl sim/1.0 m/s)