Search CORE

1,485 research outputs found

Evolving controllers for simulated car racing

Author: Lucas Simon M.
Togelius Julian
Publication venue: IEEE Press
Publication date: 01/01/2005
Field of study

This paper describes the evolution of controllers for racing a simulated radio-controlled car around a track, modelled on a real physical track. Five different controller architectures were compared, based on neural networks, force fields and action sequences. The controllers use either egocentric (first person), Newtonian (third person) or no information about the state of the car (open-loop controller). The only controller that is able to evolve good racing behaviour is based on a neural network acting on egocentric inputs

arXiv.org e-Print Archive

CogPrints Cognitive Sciences Eprint Archive

UltraSwarm: A Further Step Towards a Flock of Miniature Helicopters

Author: De nardi Renzo
Holland Owen
Publication venue: Springer
Publication date: 01/01/2006
Field of study

We describe further progress towards the development of a MAV (micro aerial vehicle) designed as an enabling tool to investigate aerial flocking. Our research focuses on the use of low cost off the shelf vehicles and sensors to enable fast prototyping and to reduce development costs. Details on the design of the embedded electronics and the modification of the chosen toy helicopter are presented, and the technique used for state estimation is described. The fusion of inertial data through an unscented Kalman filter is used to estimate the helicopter’s state, and this forms the main input to the control system. Since no detailed dynamic model of the helicopter in use is available, a method is proposed for automated system identification, and for subsequent controller design based on artificial evolution. Preliminary results obtained with a dynamic simulator of a helicopter are reported, along with some encouraging results for tackling the problem of flocking

CogPrints Cognitive Sciences Eprint Archive

Neuroevolution in Games: State of the Art and Open Challenges

Author: Risi Sebastian
Togelius Julian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

This paper surveys research on applying neuroevolution (NE) to games. In neuroevolution, artificial neural networks are trained through evolutionary algorithms, taking inspiration from the way biological brains evolved. We analyse the application of NE in games along five different axes, which are the role NE is chosen to play in a game, the different types of neural networks used, the way these networks are evolved, how the fitness is determined and what type of input the network receives. The article also highlights important open research challenges in the field.Comment: - Added more references - Corrected typos - Added an overview table (Table 1

arXiv.org e-Print Archive

CiteSeerX

Crossref

The IT University of Copenhagen's Repository

Never Too Old To Learn: On-line Evolution of Controllers in Swarm- and Modular Robotics

Author: Haasdijk E.W.
Publication venue: Amsterdam: Vrije Universiteit
Publication date: 01/01/2012
Field of study

Eiben, A.E. [Promotor

VU Research Portal

Reinforcement Learning for UAV Attitude Control

Author: Bestavros Azer
Koch William
Mancuso Renato
West Richard
Publication venue
Publication date: 01/04/2018
Field of study

Autopilot systems are typically composed of an "inner loop" providing stability and control, while an "outer loop" is responsible for mission-level objectives, e.g. way-point navigation. Autopilot systems for UAVs are predominately implemented using Proportional, Integral Derivative (PID) control systems, which have demonstrated exceptional performance in stable environments. However more sophisticated control is required to operate in unpredictable, and harsh environments. Intelligent flight control systems is an active area of research addressing limitations of PID control most recently through the use of reinforcement learning (RL) which has had success in other applications such as robotics. However previous work has focused primarily on using RL at the mission-level controller. In this work, we investigate the performance and accuracy of the inner control loop providing attitude control when using intelligent flight control systems trained with the state-of-the-art RL algorithms, Deep Deterministic Gradient Policy (DDGP), Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO). To investigate these unknowns we first developed an open-source high-fidelity simulation environment to train a flight controller attitude control of a quadrotor through RL. We then use our environment to compare their performance to that of a PID controller to identify if using RL is appropriate in high-precision, time-critical flight control.Comment: 13 pages, 9 figure

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

odNEAT: an algorithm for decentralised online evolution of robotic controllers

Author: Christensen A. L.
Correia L.
Silva F.
Urbano P.
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2015
Field of study

Online evolution gives robots the capacity to learn new tasks and to adapt to changing environmental conditions during task execution. Previous approaches to online evolution of neural controllers are typically limited to the optimisation of weights in networks with a prespecified, fixed topology. In this article, we propose a novel approach to online learning in groups of autonomous robots called odNEAT. odNEAT is a distributed and decentralised neuroevolution algorithm that evolves both weights and network topology. We demonstrate odNEAT in three multirobot tasks: aggregation, integrated navigation and obstacle avoidance, and phototaxis. Results show that odNEAT approximates the performance of rtNEAT, an efficient centralised method, and outperforms IM-( mu + 1), a decentralised neuroevolution algorithm. Compared with rtNEAT and IM( mu + 1), odNEAT's evolutionary dynamics lead to the synthesis of less complex neural controllers with superior generalisation capabilities. We show that robots executing odNEAT can display a high degree of fault tolerance as they are able to adapt and learn new behaviours in the presence of faults. We conclude with a series of ablation studies to analyse the impact of each algorithmic component on performance.info:eu-repo/semantics/submittedVersio

Repositório Institucional do ISCTE-IUL

A Survey on Passing-through Control of Multi-Robot Systems in Cluttered Environments

Author: Bai Chenggang
Gao Yan
Quan Quan
Publication venue
Publication date: 13/11/2023
Field of study

This survey presents a comprehensive review of various methods and algorithms related to passing-through control of multi-robot systems in cluttered environments. Numerous studies have investigated this area, and we identify several avenues for enhancing existing methods. This survey describes some models of robots and commonly considered control objectives, followed by an in-depth analysis of four types of algorithms that can be employed for passing-through control: leader-follower formation control, multi-robot trajectory planning, control-based methods, and virtual tube planning and control. Furthermore, we conduct a comparative analysis of these techniques and provide some subjective and general evaluations.Comment: 18 pages, 19 figure

arXiv.org e-Print Archive

Neuroevolutionary reinforcement learning for generalized control of simulated helicopters

Author: A Moore
B Tanner
D Floreano
D Floreano
D Goldberg
D Pratihar
DE Moriarty
G Harik
G Tesauro
J Gauci
J Hurst
K.O Stanley
KO Stanley
L Cardamone
LP Kaelbling
LP Kaelbling
M Schmidt
N Hansen
O Maron
O Mihatsch
O Sigaud
P Abbeel
P De Boer
P Geibel
P Nordin
P Ponterosso
P Schroder
P Stagge
P Stone
R Brafman
R Regis
RE Bellman
RE Bellman
Rogier Koppejan
RS Sutton
RS Sutton
S Chen
S Whiteson
S Whiteson
S Whiteson
S Whiteson
S Wilson
Shimon Whiteson
X Yao
Y Jin
Y Jin
Y Ong
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

This article presents an extended case study in the application of neuroevolution to generalized simulated helicopter hovering, an important challenge problem for reinforcement learning. While neuroevolution is well suited to coping with the domain’s complex transition dynamics and high-dimensional state and action spaces, the need to explore efficiently and learn on-line poses unusual challenges. We propose and evaluate several methods for three increasingly challenging variations of the task, including the method that won first place in the 2008 Reinforcement Learning Competition. The results demonstrate that (1) neuroevolution can be effective for complex on-line reinforcement learning tasks such as generalized helicopter hovering, (2) neuroevolution excels at finding effective helicopter hovering policies but not at learning helicopter models, (3) due to the difficulty of learning reliable models, model-based approaches to helicopter hovering are feasible only when domain expertise is available to aid the design of a suitable model representation and (4) recent advances in efficient resampling can enable neuroevolution to tackle more aggressively generalized reinforcement learning tasks

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Controlling maximum evaluation duration in on-line and on-board evolutionary robotics

Author: AE Eiben
AE Eiben
E Haasdijk
Herbert Robbins
JC Gittins
M Sayed-Mouchaweh
N Kasabov
P Auer
Stefano Nolfi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref