Search CORE

8,065 research outputs found

Reinforcement Learning for UAV Attitude Control

Author: Bestavros Azer
Koch William
Mancuso Renato
West Richard
Publication venue
Publication date: 01/04/2018
Field of study

Autopilot systems are typically composed of an "inner loop" providing stability and control, while an "outer loop" is responsible for mission-level objectives, e.g. way-point navigation. Autopilot systems for UAVs are predominately implemented using Proportional, Integral Derivative (PID) control systems, which have demonstrated exceptional performance in stable environments. However more sophisticated control is required to operate in unpredictable, and harsh environments. Intelligent flight control systems is an active area of research addressing limitations of PID control most recently through the use of reinforcement learning (RL) which has had success in other applications such as robotics. However previous work has focused primarily on using RL at the mission-level controller. In this work, we investigate the performance and accuracy of the inner control loop providing attitude control when using intelligent flight control systems trained with the state-of-the-art RL algorithms, Deep Deterministic Gradient Policy (DDGP), Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO). To investigate these unknowns we first developed an open-source high-fidelity simulation environment to train a flight controller attitude control of a quadrotor through RL. We then use our environment to compare their performance to that of a PID controller to identify if using RL is appropriate in high-precision, time-critical flight control.Comment: 13 pages, 9 figure

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Towards adaptive multi-robot systems: self-organization and self-adaptation

Author: Albayrak Sahin
Hrabia Christopher-Eyk
Lützenberger Marco
Publication venue
Publication date: 04/10/2018
Field of study

Dieser Beitrag ist mit Zustimmung des Rechteinhabers aufgrund einer (DFG geförderten) Allianz- bzw. Nationallizenz frei zugänglich.This publication is with permission of the rights owner freely accessible due to an Alliance licence and a national licence (funded by the DFG, German Research Foundation) respectively.The development of complex systems ensembles that operate in uncertain environments is a major challenge. The reason for this is that system designers are not able to fully specify the system during specification and development and before it is being deployed. Natural swarm systems enjoy similar characteristics, yet, being self-adaptive and being able to self-organize, these systems show beneficial emergent behaviour. Similar concepts can be extremely helpful for artificial systems, especially when it comes to multi-robot scenarios, which require such solution in order to be applicable to highly uncertain real world application. In this article, we present a comprehensive overview over state-of-the-art solutions in emergent systems, self-organization, self-adaptation, and robotics. We discuss these approaches in the light of a framework for multi-robot systems and identify similarities, differences missing links and open gaps that have to be addressed in order to make this framework possible

Separating Agent-Functioning and Inter-Agent Coordination by Activated Modules: The DECOMAS Architecture

Author: A. S. Rao
Alessandro F. Garcia
Alessandro Garcia
Amit Shabtay
Anand S. Rao
Cidiane Lobato
D. Garlan
D. L. Parnas
David Gelernter
E. Bonabeau
G. D. M. Serugendo
G. Di Marzo Serugendo
Gordon D. Plotkin
Gregor Kiczales
Jan Sudeikat
Jan Sudeikat
Jan Sudeikat
Jan Sudeikat
Jan Sudeikat
Jan Sudeikat
Jan Sudeikat
Jan Sudeikat
Koen Hindriks
L. Braubach
Lars Braubach
Linda M. Seiter
M. Birna van Riemsdijk
Mark Miller
Mazeiar Salehie
Mehdi Dastani
Mikhail Prokopenko
Paolo Busetta
Rafael Bordini
Rajarshi Das Jeffrey
Renata Vieira
T. DeWolf
Tom Van Cutsem
Wolfgang Renz
Yuriy Brun
Publication venue: 'Open Publishing Association'
Publication date: 01/06/2010
Field of study

The embedding of self-organizing inter-agent processes in distributed software applications enables the decentralized coordination system elements, solely based on concerted, localized interactions. The separation and encapsulation of the activities that are conceptually related to the coordination, is a crucial concern for systematic development practices in order to prepare the reuse and systematic integration of coordination processes in software systems. Here, we discuss a programming model that is based on the externalization of processes prescriptions and their embedding in Multi-Agent Systems (MAS). One fundamental design concern for a corresponding execution middleware is the minimal-invasive augmentation of the activities that affect coordination. This design challenge is approached by the activation of agent modules. Modules are converted to software elements that reason about and modify their host agent. We discuss and formalize this extension within the context of a generic coordination architecture and exemplify the proposed programming model with the decentralized management of (web) service infrastructures

arXiv.org e-Print Archive

Directory of Open Access Journals