Search CORE

3,081 research outputs found

Embodied Evolution in Collective Robotics: A Review

Author: Alba
Alba
Amato
Anderson
Aplin
Arthur
Axelrod
Bangel
Barrett
Bayindir
Bedau
Bellingham
Beni
Bentham
Bernard
Bernstein
Bianco
Blount
Bongard
Bongard
Boumaza
Brambilla
Bredeche
Bredeche
Bredeche
Bredeche
Bredeche
Brodbeck
Camazine
Charlesworth
Christensen
Cully
Deutsch
Dibangoye
Doncieux
Eiben
Eiben
Eiben
Eiben
Fernandez Pérez
Fernandez Pérez
Fernandez Pérez
Ferrante
Ficici
Floreano
García-Sánchez
Gauci
Geritz
Good
Haasdijk
Haasdijk
Haasdijk
Haasdijk
Haasdijk
Haasdijk
Haasdijk
Hardin
Hart
Hauert
Heinerman
Heinerman
Hettiarachchi
Hettiarachchi
Huijsman
Jakobi
Karafotias
Kemeling
König
König
Lehman
Long
Maynard Smith
Mitri
Montanier
Montanier
Montanier
Moor
Mouret
Mouret
Nelson
Nolfi
Nordin
Noskov
Nouyan
O’Dowd
Parker
Perez
Pfeifer
Prieto
Prieto
Prieto
Prieto
Pugh
Ray
Rubenstein
Schut
Schwarzer
Schwarzer
Shapley
Silva
Silva
Silva
Silva
Silva
Simões
Soros
Stanley
Steyven
Stone
Stone
Stone
Taylor
Thrun
Tonelli
Trianni
Trueba
Trueba
Trueba
Trueba
Urzelai
Usui
Vanderelst
Waibel
Wakeley
Walker
Watson
Weel
Weel
Werfel
West
Wischmann
Wiser
Wolpert
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

This paper provides an overview of evolutionary robotics techniques applied to on-line distributed evolution for robot collectives -- namely, embodied evolution. It provides a definition of embodied evolution as well as a thorough description of the underlying concepts and mechanisms. The paper also presents a comprehensive summary of research published in the field since its inception (1999-2017), providing various perspectives to identify the major trends. In particular, we identify a shift from considering embodied evolution as a parallel search method within small robot collectives (fewer than 10 robots) to embodied evolution as an on-line distributed learning method for designing collective behaviours in swarm-like collectives. The paper concludes with a discussion of applications and open questions, providing a milestone for past and an inspiration for future research.Comment: 23 pages, 1 figure, 1 tabl

arXiv.org e-Print Archive

VU Research Portal

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

Towards adaptive multi-robot systems: self-organization and self-adaptation

Author: Albayrak Sahin
Hrabia Christopher-Eyk
Lützenberger Marco
Publication venue
Publication date: 04/10/2018
Field of study

Dieser Beitrag ist mit Zustimmung des Rechteinhabers aufgrund einer (DFG geförderten) Allianz- bzw. Nationallizenz frei zugänglich.This publication is with permission of the rights owner freely accessible due to an Alliance licence and a national licence (funded by the DFG, German Research Foundation) respectively.The development of complex systems ensembles that operate in uncertain environments is a major challenge. The reason for this is that system designers are not able to fully specify the system during specification and development and before it is being deployed. Natural swarm systems enjoy similar characteristics, yet, being self-adaptive and being able to self-organize, these systems show beneficial emergent behaviour. Similar concepts can be extremely helpful for artificial systems, especially when it comes to multi-robot scenarios, which require such solution in order to be applicable to highly uncertain real world application. In this article, we present a comprehensive overview over state-of-the-art solutions in emergent systems, self-organization, self-adaptation, and robotics. We discuss these approaches in the light of a framework for multi-robot systems and identify similarities, differences missing links and open gaps that have to be addressed in order to make this framework possible

DepositOnce

Fast Damage Recovery in Robotics with the T-Resilience Algorithm

Author: Cully Antoine
Koos Sylvain
Mouret Jean-Baptiste
Publication venue: 'SAGE Publications'
Publication date: 02/02/2013
Field of study

Damage recovery is critical for autonomous robots that need to operate for a long time without assistance. Most current methods are complex and costly because they require anticipating each potential damage in order to have a contingency plan ready. As an alternative, we introduce the T-resilience algorithm, a new algorithm that allows robots to quickly and autonomously discover compensatory behaviors in unanticipated situations. This algorithm equips the robot with a self-model and discovers new behaviors by learning to avoid those that perform differently in the self-model and in reality. Our algorithm thus does not identify the damaged parts but it implicitly searches for efficient behaviors that do not use them. We evaluate the T-Resilience algorithm on a hexapod robot that needs to adapt to leg removal, broken legs and motor failures; we compare it to stochastic local search, policy gradient and the self-modeling algorithm proposed by Bongard et al. The behavior of the robot is assessed on-board thanks to a RGB-D sensor and a SLAM algorithm. Using only 25 tests on the robot and an overall running time of 20 minutes, T-Resilience consistently leads to substantially better results than the other approaches

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository

A survey on policy search algorithms for learning robot controllers in a handful of trials

Author: Calinon Sylvain
Chatzilygeroudis Konstantinos
Mouret Jean-Baptiste
Stulp Freek
Vassiliades Vassilis
Publication venue
Publication date: 04/12/2019
Field of study

Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". We show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or the dynamical model (e.g., model-based policy search), so that the policy optimizer queries the model instead of the real system. Overall, all successful micro-data algorithms combine these two strategies by varying the kind of model and prior knowledge. The current scientific challenges essentially revolve around scaling up to complex robots (e.g., humanoids), designing generic priors, and optimizing the computing time.Comment: 21 pages, 3 figures, 4 algorithms, accepted at IEEE Transactions on Robotic

arXiv.org e-Print Archive

Institute of Transport Research:Publications

ZENODO

INRIA a CCSD electronic archive server

HAL Descartes

An empirical evaluation of evolutionary controller design methods for collective gathering task

Author: Jang Jae
Publication venue: Department of Computer Science
Publication date: 01/09/2016
Field of study

This research aims to evaluate the performance of evolutionary controller design methods for developing a collective behaviour for a team of robots. The methods tested in this research are NEAT which is capable of finding minimal solution quickly, and SANE which maintains high genetic diversity through neuron level evolution. The task chosen for these methods was a collective gathering task which required a team of robots to cooperate in finding and retrieving item of interest. Our results showed that NEAT consistently produced better controllers compared to SANE

Cape Town University OpenUCT

Adaptive and learning-based formation control of swarm robots

Author: Salimi Mahsoo
Publication venue
Publication date: 14/10/2021
Field of study

Autonomous aerial and wheeled mobile robots play a major role in tasks such as search and rescue, transportation, monitoring, and inspection. However, these operations are faced with a few open challenges including robust autonomy, and adaptive coordination based on the environment and operating conditions, particularly in swarm robots with limited communication and perception capabilities. Furthermore, the computational complexity increases exponentially with the number of robots in the swarm. This thesis examines two different aspects of the formation control problem. On the one hand, we investigate how formation could be performed by swarm robots with limited communication and perception (e.g., Crazyflie nano quadrotor). On the other hand, we explore human-swarm interaction (HSI) and different shared-control mechanisms between human and swarm robots (e.g., BristleBot) for artistic creation. In particular, we combine bio-inspired (i.e., flocking, foraging) techniques with learning-based control strategies (using artificial neural networks) for adaptive control of multi- robots. We first review how learning-based control and networked dynamical systems can be used to assign distributed and decentralized policies to individual robots such that the desired formation emerges from their collective behavior. We proceed by presenting a novel flocking control for UAV swarm using deep reinforcement learning. We formulate the flocking formation problem as a partially observable Markov decision process (POMDP), and consider a leader-follower configuration, where consensus among all UAVs is used to train a shared control policy, and each UAV performs actions based on the local information it collects. In addition, to avoid collision among UAVs and guarantee flocking and navigation, a reward function is added with the global flocking maintenance, mutual reward, and a collision penalty. We adapt deep deterministic policy gradient (DDPG) with centralized training and decentralized execution to obtain the flocking control policy using actor-critic networks and a global state space matrix. In the context of swarm robotics in arts, we investigate how the formation paradigm can serve as an interaction modality for artists to aesthetically utilize swarms. In particular, we explore particle swarm optimization (PSO) and random walk to control the communication between a team of robots with swarming behavior for musical creation

Simon Fraser University Institutional Repository