Search CORE

1,738 research outputs found

상대적 안전비행영역과 상대적 번스타인 다항식을 이용한 다수 쿼드로터의 경로 계획

Author: 박정원
Publication venue: 서울대학교 대학원
Publication date: 01/08/2020
Field of study

학위논문 (석사) -- 서울대학교 대학원 : 공과대학 기계항공공학부, 2020. 8. 김현진.Multi-agent systems consisting of unmanned aerial vehicles (UAVs) are receiving attention from many industrial domains due to their mobility, and applicability. To safely operate these multiagent systems, path planning algorithm that can generate safe, dynamically feasible trajectory is required. However, existing multi-agent trajectory planning methods may fail to generate multiagent trajectory in obstacle-dense environment due to deadlock or optimization failure caused by infeasible collision constraints. In this paper, we presents a new e client algorithm which guarantees a solution for a class of multi-agent trajectory planning problems in obstacle-dense environments. Our algorithm combines the advantages of both grid-based and optimization-based approaches, and generates safe, dynamically feasible trajectories without su ering from an erroneous optimization setup such as imposing infeasible collision constraints. We adopt a sequential optimization method with dummy agents to improve the scalability of the algorithm, and utilize the convex hull property of Bernstein polynomial to replace non-convex collision avoidance constraints to convex ones. We validate the proposed algorithm through the comparison with our previous work and SCP-based method. The proposed method reduces more than 50% of the objective cost compared to our previous work, and reduces more than 75% of the computation time compared to SCP-based method. Furthermore, the proposed method can compute the trajectory for 64 agents on average 6.36 seconds with Intel Core i7-7700 @ 3.60GHz CPU and 16G RAM.무인비행체(UAV)로 구성된 다중 에이전트 시스템은 높은 기동성 및 응용 가능성으로 많은 산업 분야에서 관심을 받고 있다. 이러한 다중 에이전트 시스템을 안전하게 운용하려면 안전하고 동적으로 실현 가능 경로를 생성할 수 있는 경로 계획 알고리즘이 필요하다. 그러나 기존의 다중 에이전트 경로 계획 방법은 장애물 환경에서 교착 상태나 부적절한 충돌 회피 조건으로 인한 최적화 실패가 일어날 수 있다는 한계가 있다. 본 논문에서는 장애물 환경에서 해의 존재를 보장하도록 다중 에이전트 경로 계획 문제를 변환한 뒤 이를 효율적으로 풀어낼 수 있는 새로운 경로 계획 알고리즘을 제시한다. 이 알고리즘은 그리드 기반 접근법과 최적화 기반 접근법의 장점을 모두 가지도록 설계되었으며, 불가능한 충돌 구속조건을 부과하지 않고 안전하고 동적으로 실현 가능한 궤적을 생성할 수 있다. 이 알고리즘은 더미 에이전트(dummy agents)을 이용한 순차 최적화 방법을 사용하여 알고리즘의 확장성(scalability)을 높였으며, 번스타인(Bernstein) 다항식의 볼록 껍질(convex hull) 성질을 활용하여 볼록하지 않은 충돌 회피 제약 조건을 볼록화하였다. 제안된 알고리즘의 성능은 선행 연구와 SCP 기반 방법과의 비교를 통해 검증되었다. 제안된 방법은 선행 연구에 비해 목표 비용의 50% 이상 절감하였으며, SCP 기반 방법에 비해 계산 시간의 75% 이상 감소하였다. 또한 제안된 방법은 인텔 코어 i7-7700 @ 3.60GHz CPU 및 16G RAM 환경에서 64개 에이전트의 궤적을 계산하는데 평균 6.36초가 소요된다.1 Introduction 1 1.1 Literature review 2 1.2 Thesis contribution 3 1.3 Thesis outline 3 2 Bernstein polynomial 4 2.1 Definition 4 2.2 Properties 5 2.2.1 Convex hull property 5 2.2.2 Endpoint interpolation property 5 2.2.3 Arithmetic operations and derivatives 6 3 Multi-agent trajectory optimization 7 3.1 Problem formulation 7 3.1.1 Assumption 7 3.1.2 Trajectory Representation 8 3.1.3 Objective function 9 3.1.4 Convex constraints 9 3.1.5 Non-convex collision avoidance constraints 10 3.2 Collision constraints construction 11 3.2.1 Initial trajectory planning 12 3.2.2 Safe flight corridor 14 3.2.3 Relative safe flight corridor 16 3.3 Trajectory optimization 18 4 Sequential optimization with dummy agents 20 5 Experimental results 24 5.1 Comparison with the previous work 24 5.1.1 Success rate 25 5.1.2 Solution quality 26 5.1.3 Scalability analysis 26 5.2 Comparison with SCP-based method 27 5.3 Flight test 29 6 Conclusion 31Maste

Reinforcement Learning and Planning for Preference Balancing Tasks

Author: Faust Aleksandra
Publication venue: UNM Digital Repository
Publication date: 01/07/2014
Field of study

Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving motion problems computationally challenging. One solution has been reinforcement learning (RL), which learns through experimentation to automatically perform the near-optimal motions that complete a task. However, high-dimensional problems and task formulation often prove challenging for RL. We address these problems with PrEference Appraisal Reinforcement Learning (PEARL), which solves Preference Balancing Tasks (PBTs). PBTs define a problem as a set of preferences that the system must balance to achieve a goal. The method is appropriate for acceleration-controlled systems with continuous state-space and either discrete or continuous action spaces with unknown system dynamics. We show that PEARL learns a sub-optimal policy on a subset of states and actions, and transfers the policy to the expanded domain to produce a more refined plan on a class of robotic problems. We establish convergence to task goal conditions, and even when preconditions are not verifiable, show that this is a valuable method to use before other more expensive approaches. Evaluation is done on several robotic problems, such as Aerial Cargo Delivery, Multi-Agent Pursuit, Rendezvous, and Inverted Flying Pendulum both in simulation and experimentally. Additionally, PEARL is leveraged outside of robotics as an array sorting agent. The results demonstrate high accuracy and fast learning times on a large set of practical applications

Optimal Guidance and Control with Nonlinear Dynamics Using Sequential Convex Programming

Author: Chung Soon-Jo
Foust Rebecca
Hadaegh Fred Y.
Publication venue: 'American Institute of Aeronautics and Astronautics (AIAA)'
Publication date: 01/04/2020
Field of study

This paper presents a novel method for expanding the use of sequential convex programming (SCP) to the domain of optimal guidance and control problems with nonlinear dynamics constraints. SCP is a useful tool in obtaining real-time solutions to direct optimal control, but it is unable to adequately model nonlinear dynamics due to the linearization and discretization required. As nonlinear program solvers are not yet functioning in real-time, a tool is needed to bridge the gap between satisfying the nonlinear dynamics and completing execution fast enough to be useful. Two methods are proposed, sequential convex programming with nonlinear dynamics correction (SCPn) and modified SCPn (M-SCPn), which mixes SCP and SCPn to reduce runtime and improve algorithmic robustness. Both methods are proven to generate optimal state and control trajectories that satisfy the nonlinear dynamics. Simulations are presented to validate the efficacy of the methods as compared to SCP

UAVs for Enhanced Communication and Computation

Author: Donevski Igor
Publication venue: Aalborg University
Publication date: 01/01/2022
Field of study

VBN

Proximal operators for multi-agent path planning

Author: Bento José
Derbinsky Nate
Mathy Charles
Yedidia Jonathan S.
Publication venue
Publication date: 04/03/2015
Field of study

We address the problem of planning collision-free paths for multiple agents using optimization methods known as proximal algorithms. Recently this approach was explored in Bento et al. 2013, which demonstrated its ease of parallelization and decentralization, the speed with which the algorithms generate good quality solutions, and its ability to incorporate different proximal operators, each ensuring that paths satisfy a desired property. Unfortunately, the operators derived only apply to paths in 2D and require that any intermediate waypoints we might want agents to follow be preassigned to specific agents, limiting their range of applicability. In this paper we resolve these limitations. We introduce new operators to deal with agents moving in arbitrary dimensions that are faster to compute than their 2D predecessors and we introduce landmarks, space-time positions that are automatically assigned to the set of agents under different optimality criteria. Finally, we report the performance of the new operators in several numerical experiments.Comment: See movie at http://youtu.be/gRnsjd_ocx

arXiv.org e-Print Archive

CiteSeerX

Association for the Advancement of Artificial Intelligence: AAAI Publications