Search CORE

644 research outputs found

Reinforcement Learning for UAV Attitude Control

Author: Bestavros Azer
Koch William
Mancuso Renato
West Richard
Publication venue
Publication date: 01/04/2018
Field of study

Autopilot systems are typically composed of an "inner loop" providing stability and control, while an "outer loop" is responsible for mission-level objectives, e.g. way-point navigation. Autopilot systems for UAVs are predominately implemented using Proportional, Integral Derivative (PID) control systems, which have demonstrated exceptional performance in stable environments. However more sophisticated control is required to operate in unpredictable, and harsh environments. Intelligent flight control systems is an active area of research addressing limitations of PID control most recently through the use of reinforcement learning (RL) which has had success in other applications such as robotics. However previous work has focused primarily on using RL at the mission-level controller. In this work, we investigate the performance and accuracy of the inner control loop providing attitude control when using intelligent flight control systems trained with the state-of-the-art RL algorithms, Deep Deterministic Gradient Policy (DDGP), Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO). To investigate these unknowns we first developed an open-source high-fidelity simulation environment to train a flight controller attitude control of a quadrotor through RL. We then use our environment to compare their performance to that of a PID controller to identify if using RL is appropriate in high-precision, time-critical flight control.Comment: 13 pages, 9 figure

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Mathematical modeling and vertical flight control of a tilt-wing UAV

Author: Aksit Mahmut Faruk
Akşit Mahmut Faruk
Cetinsoy Ertugrul
Gulez Kayhan
Gülez Kayhan
Hancer Cevdet
Hançer Cevdet
Kandemir İlyas
Kandemir Ilyas
Oner Kaan Taha
Sirimoglu Efe
Sırımoğlu Efe
Unel Mustafa
Çetinsoy Ertuğrul
Öner Kaan Taha
Ünel Mustafa
Publication venue: 'The Scientific and Technological Research Council of Turkey'
Publication date: 01/01/2012
Field of study

This paper presents a mathematical model and vertical flight control algorithms for a new tilt-wing unmanned aerial vehicle (UAV). The vehicle is capable of vertical take-off and landing (VTOL). Due to its tilt-wing structure, it can also fly horizontally. The mathematical model of the vehicle is obtained using Newton-Euler formulation. A gravity compensated PID controller is designed for altitude control, and three PID controllers are designed for attitude stabilization of the vehicle. Performances of these controllers are found to be quite satisfactory as demonstrated by indoor and outdoor flight experiments

Sabanci University Research Database

Learning Unmanned Aerial Vehicle Control for Autonomous Target Following

Author: Li Siyi
Liu Tianbo
Shen Shaojie
Yeung Dit-Yan
Zhang Chi
Publication venue
Publication date: 24/09/2017
Field of study

While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process. However, real-world robotic applications often need a data-efficient learning process with safety-critical constraints. In this paper, we consider the challenging problem of learning unmanned aerial vehicle (UAV) control for tracking a moving target. To acquire a strategy that combines perception and control, we represent the policy by a convolutional neural network. We develop a hierarchical approach that combines a model-free policy gradient method with a conventional feedback proportional-integral-derivative (PID) controller to enable stable learning without catastrophic failure. The neural network is trained by a combination of supervised learning from raw images and reinforcement learning from games of self-play. We show that the proposed approach can learn a target following policy in a simulator efficiently and the learned behavior can be successfully transferred to the DJI quadrotor platform for real-world UAV control

arXiv.org e-Print Archive

Crossref

Adaptive and learning-based formation control of swarm robots

Author: Salimi Mahsoo
Publication venue
Publication date: 14/10/2021
Field of study

Autonomous aerial and wheeled mobile robots play a major role in tasks such as search and rescue, transportation, monitoring, and inspection. However, these operations are faced with a few open challenges including robust autonomy, and adaptive coordination based on the environment and operating conditions, particularly in swarm robots with limited communication and perception capabilities. Furthermore, the computational complexity increases exponentially with the number of robots in the swarm. This thesis examines two different aspects of the formation control problem. On the one hand, we investigate how formation could be performed by swarm robots with limited communication and perception (e.g., Crazyflie nano quadrotor). On the other hand, we explore human-swarm interaction (HSI) and different shared-control mechanisms between human and swarm robots (e.g., BristleBot) for artistic creation. In particular, we combine bio-inspired (i.e., flocking, foraging) techniques with learning-based control strategies (using artificial neural networks) for adaptive control of multi- robots. We first review how learning-based control and networked dynamical systems can be used to assign distributed and decentralized policies to individual robots such that the desired formation emerges from their collective behavior. We proceed by presenting a novel flocking control for UAV swarm using deep reinforcement learning. We formulate the flocking formation problem as a partially observable Markov decision process (POMDP), and consider a leader-follower configuration, where consensus among all UAVs is used to train a shared control policy, and each UAV performs actions based on the local information it collects. In addition, to avoid collision among UAVs and guarantee flocking and navigation, a reward function is added with the global flocking maintenance, mutual reward, and a collision penalty. We adapt deep deterministic policy gradient (DDPG) with centralized training and decentralized execution to obtain the flocking control policy using actor-critic networks and a global state space matrix. In the context of swarm robotics in arts, we investigate how the formation paradigm can serve as an interaction modality for artists to aesthetically utilize swarms. In particular, we explore particle swarm optimization (PSO) and random walk to control the communication between a team of robots with swarming behavior for musical creation

Simon Fraser University Institutional Repository

Review of PID Controller Applications for UAVs

Author: Ma Yudong
Oersted Hans
Publication venue
Publication date: 12/11/2023
Field of study

Unmanned Aerial Vehicles (UAVs) have gained widespread recognition for their diverse applications, ranging from surveillance to delivery services. Among the various control algorithms employed to stabilize and navigate UAVs, the Proportional-Integral-Derivative (PID) controller stands out as a classical yet robust solution. This review provides a comprehensive examination of PID controller applications in the context of UAVs, addressing their fundamental principles, dynamics modeling, stability control, navigation tasks, parameter tuning methods, challenges, and future directions

arXiv.org e-Print Archive

High fidelity progressive reinforcement learning for agile maneuvering UAVs

Author: Abbeel P.
Faust A.
Kim H. J.
Lillicrap T. P.
Uzun S.
Yuksek B.
Zhang T.
Publication venue: 'American Institute of Aeronautics and Astronautics (AIAA)'
Publication date: 05/01/2020
Field of study

In this work, we present a high fidelity model based progressive reinforcement learning method for control system design for an agile maneuvering UAV. Our work relies on a simulation-based training and testing environment for doing software-in-the-loop (SIL), hardware-in-the-loop (HIL) and integrated flight testing within photo-realistic virtual reality (VR) environment. Through progressive learning with the high fidelity agent and environment models, the guidance and control policies build agile maneuvering based on fundamental control laws. First, we provide insight on development of high fidelity mathematical models using frequency domain system identification. These models are later used to design reinforcement learning based adaptive flight control laws allowing the vehicle to be controlled over a wide range of operating conditions covering model changes on operating conditions such as payload, voltage and damage to actuators and electronic speed controllers (ESCs). We later design outer flight guidance and control laws. Our current work and progress is summarized in this work

Crossref

Cranfield CERES