Goals are Enough: Inducing AdHoc cooperation among unseen Multi-Agent
  systems in IMFs

Das, Abir; Dey, Kaushik; Perepu, Satheesh K.

Goals are Enough: Inducing AdHoc cooperation among unseen Multi-Agent systems in IMFs

Authors: Abir Das
Kaushik Dey
Satheesh K. Perepu
Publication date: 26 October 2023
Publisher

Abstract

Intent-based management will play a critical role in achieving customers' expectations in the next-generation mobile networks. Traditional methods cannot perform efficient resource management since they tend to handle each expectation independently. Existing approaches, e.g., based on multi-agent reinforcement learning (MARL) allocate resources in an efficient fashion when there are conflicting expectations on the network slice. However, in reality, systems are often far more complex to be addressed by a standalone MARL formulation. Often there exists a hierarchical structure of intent fulfilment where multiple pre-trained, self-interested agents may need to be further orchestrated by a supervisor or controller agent. Such agents may arrive in the system adhoc, which then needs to be orchestrated along with other available agents. Retraining the whole system every time is often infeasible given the associated time and cost. Given the challenges, such adhoc coordination of pre-trained systems could be achieved through an intelligent supervisor agent which incentivizes pre-trained RL/MARL agents through sets of dynamic contracts (goals or bonuses) and encourages them to act as a cohesive unit towards fulfilling a global expectation. Some approaches use a rule-based supervisor agent and deploy the hierarchical constituent agents sequentially, based on human-coded rules. In the current work, we propose a framework whereby pre-trained agents can be orchestrated in parallel leveraging an AI-based supervisor agent. For this, we propose to use Adhoc-Teaming approaches which assign optimal goals to the MARL agents and incentivize them to exhibit certain desired behaviours. Results on the network emulator show that the proposed approach results in faster and improved fulfilment of expectations when compared to rule-based approaches and even generalizes to changes in environments.Comment: Accepted for publication in IEEE CCNC 2024 conferenc

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2310.17416

Last time updated on 16/01/2024