Search CORE

89 research outputs found

Decentralization of Multiagent Policies by Learning What to Communicate

Author: Chen Steven W.
Kumar Vijay
Paulos James
Shishika Daigo
Publication venue
Publication date: 25/03/2019
Field of study

Effective communication is required for teams of robots to solve sophisticated collaborative tasks. In practice it is typical for both the encoding and semantics of communication to be manually defined by an expert; this is true regardless of whether the behaviors themselves are bespoke, optimization based, or learned. We present an agent architecture and training methodology using neural networks to learn task-oriented communication semantics based on the example of a communication-unaware expert policy. A perimeter defense game illustrates the system's ability to handle dynamically changing numbers of agents and its graceful degradation in performance as communication constraints are tightened or the expert's observability assumptions are broken.Comment: 7 page

arXiv.org e-Print Archive

Crossref

Two-Player Reconnaissance Game with Half-Planar Target and Retreat Regions

Author: Bakolas Efstathios
Lee Yoonjae
Publication venue
Publication date: 04/10/2022
Field of study

This paper is concerned with a variation of a pursuit-evasion game called the Reconnaissance game, in which an Intruder attempts to approach a territory of interest (target region) and return to a safe zone (retreat region) in the presence of a faster Defender. The target and retreat regions are taken to be disjoint closed half-planes. The objective of the Intruder is two-fold: 1) minimize the distance between her position and the target region and 2) escape to the retreat region before being captured by the Defender. The Defender, on the other hand, strives to maximize the same distance and, if possible, capture the Intruder outside the retreat region. The game is decomposed into a series of two subgames: a Target game and an Escape game. A closed-form solution to each subgame, including the Value function and state-feedback saddle-point strategies, is derived separately. Furthermore, winning regions and barrier surfaces are constructed analytically. Numerical simulation results are presented to showcase the efficacy of the proposed solutions

arXiv.org e-Print Archive

The Barrier Surface in the Cooperative Football Differential Game

Author: Casbeer David W.
Garcia Eloy
Pachter Meir
Publication venue
Publication date: 05/06/2020
Field of study

This paper considers the blocking or football pursuit-evasion differential game. Two pursuers cooperate and try to capture the ball carrying evader as far as possible from the goal line. The evader wishes to be as close as possible to the goal line at the time of capture and, if possible, reach the line. In this paper the solution of the game of kind is provided: The Barrier surface that partitions the state space into two winning sets, one for the pursuer team and one for the evader, is constructed. Under optimal play, the winning team is determined by evaluating the associated Barrier function.Comment: 5 pages, 1 figur

arXiv.org e-Print Archive

AFTI Scholar (Air Force Institute of Technology)

Coordinated Defense Allocation in Reach-Avoid Scenarios with Efficient Online Optimization

Author: Chen Hua
Liu Junwei
Lu Haibo
Ouyang Zikai
Yang Jiahui
Zhang Wei
Publication venue
Publication date: 02/06/2023
Field of study

In this paper, we present a dual-layer online optimization strategy for defender robots operating in multiplayer reach-avoid games within general convex environments. Our goal is to intercept as many attacker robots as possible without prior knowledge of their strategies. To balance optimality and efficiency, our approach alternates between coordinating defender coalitions against individual attackers and allocating coalitions to attackers based on predicted single-attack coordination outcomes. We develop an online convex programming technique for single-attack defense coordination, which not only allows adaptability to joint states but also identifies the maximal region of initial joint states that guarantees successful attack interception. Our defense allocation algorithm utilizes a hierarchical iterative method to approximate integer linear programs with a monotonicity constraint, reducing computational burden while ensuring enhanced defense performance over time. Extensive simulations conducted in 2D and 3D environments validate the efficacy of our approach in comparison to state-of-the-art approaches, and show its applicability in wheeled mobile robots and quadcopters

arXiv.org e-Print Archive

Synchronous intercept strategies for a robotic defense-intrusion game with two defenders

Author: Clark Ruaridh
Huang Yunke
Lei Xiaokang
Liu Mingyong
Yang Panpan
Zhang Shuai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/01/2021
Field of study

We study the defense-intrusion game, in which a single attacker robot tries to reach a stationary target that is protected by two defender robots. We focus on the "synchronous intercept problem", where both robots have to reach the attacker robot synchronously to intercept it. Assume that the attacker robot has the control policy which is based on attraction to the target and repulsion from the defenders, two kinds of synchronous intercept strategies are proposed for the defense-intrusion game, introduced here as Attacker-oriented and Neutral-position-oriented. Theoretical analysis and simulation results show that: (1) the two strategies are able to generate different synchronous intercept patterns: contact intercept pattern and stable non-contact intercept pattern, respectively. (2) The contact intercept pattern allows the defender robots to intercept the attacker robot in finite time, while the stable non-contact intercept pattern generates a periodic attractor that prevents the attack robot from reaching the target for infinite time. There is potential to apply the insights obtained into defense-intrusion in real systems, including aircraft escort and the defense of military targets or territorial boundaries

University of Strathclyde Institutional Repository