241,702 research outputs found
Decentralized Adaptive Formation via Consensus-Oriented Multi-Agent Communication
Adaptive multi-agent formation control, which requires the formation to
flexibly adjust along with the quantity variations of agents in a decentralized
manner, belongs to one of the most challenging issues in multi-agent systems,
especially under communication-limited constraints. In this paper, we propose a
novel Consensus-based Decentralized Adaptive Formation (Cons-DecAF) framework.
Specifically, we develop a novel multi-agent reinforcement learning method,
Consensus-oriented Multi-Agent Communication (ConsMAC), to enable agents to
perceive global information and establish the consensus from local states by
effectively aggregating neighbor messages. Afterwards, we leverage policy
distillation to accomplish the adaptive formation adjustment. Meanwhile,
instead of pre-assigning specific positions of agents, we employ a
displacement-based formation by Hausdorff distance to significantly improve the
formation efficiency. The experimental results through extensive simulations
validate that the proposed method has achieved outstanding performance in terms
of both speed and stability.Comment: 6 pages, 5 figure
Recommended from our members
Application of Techniques for MAP Estimation to Distributed Constraint Optimization Problem
The problem of efficiently finding near-optimal decisions in multi-agent systems has become increasingly important because of the growing number of multi-agent applications with large numbers of agents operating in real-world environments. In these systems, agents are often subject to tight resource constraints and agents have only local views. When agents have non-global constraints, each of which is independent, the problem can be formalized as a distributed constraint optimization problem (DCOP). The DCOP is closely associated with the problem of inference on graphical models. Many approaches from inference literature have been adopted to solve DCOPs. We focus on the Max-Sum algorithm and the Action-GDL algorithm that are DCOP variants of the popular inference algorithm called the Max-Product algorithm and the Belief Propagation algorithm respectively. The Max-Sum algorithm and the Action-GDL algorithm are well-suited for multi-agent systems because it is distributed by nature and requires less communication than most DCOP algorithms. However, the resource requirements of these algorithms are still high for some multi-agent domains and various aspects of the algorithms have not been well studied for use in general multi-agent settings.
This thesis is concerned with a variety of issues of applying the Max-Sum algorithms and the Action-GDL algorithm to general multi-agent settings. We develop a hybrid algorithm of ADOPT and Action-GDL in order to overcome the communication complexity of DCOPs. Secondly, we extend the Max-Sum algorithm to operate more efficiently in more general multi-agent settings in which computational complexity is high. We provide an algorithm that has a lower expected computational complexity for DCOPs even with n-ary constraints. Finally, In most DCOP literature, a one-to-one mapping between a variable and an agent is assumed. However, in real applications, many-to-one mappings are prevalent and can also be beneficial in terms of communication and hardware cost in situations where agents are acting as independent computing units. We consider how to exploit such mapping in order to increase efficiency
Efficient performative actions for e-commerce agents
The foundational features of multi-agent systems are communication and interaction with other agents. To achieve these features, agents have to transfer messages in the predefined format and semantics. The communication among these agents takes place with the help of ACL (Agent Communication Language). ACL is a predefined language for communication among agents that has been standardised by the FIPA (Foundation for Intelligent Physical Agent). FIPA-ACL defines different performatives for communication among the agents. These performatives are generic, and it becomes computationally expensive to use them for a specific domain like e-commerce. These performatives do not define the exact meaning of communication for any specific domain like e-commerce. In the present research, we introduced new performatives specifically for e-commerce domain. Our designed performatives are based on FIPA-ACL so that they can still support communication within diverse agent platforms. The proposed performatives are helpful in modelling e-commerce negotiation protocol applications using the paradigm of multi-agent systems for efficient communication. For exact semantic interpretation of the proposed performatives, we also performed formal modelling of these performatives using BNF. The primary objective of our research was to provide the negotiation facility to agents, working in an e-commerce domain, in a succinct way to reduce the number of negotiation messages, time consumption and network overhead on the platform. We used an e-commerce based bidding case study among agents to demonstrate the efficiency of our approach. The results showed that there was a lot of reduction in total time required for the bidding process
Communication-Efficient Decentralized Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control
Connected and autonomous vehicles (CAVs) promise next-gen transportation
systems with enhanced safety, energy efficiency, and sustainability. One
typical control strategy for CAVs is the so-called cooperative adaptive cruise
control (CACC) where vehicles drive in platoons and cooperate to achieve safe
and efficient transportation. In this study, we formulate CACC as a multi-agent
reinforcement learning (MARL) problem. Diverging from existing MARL methods
that use centralized training and decentralized execution which require not
only a centralized communication mechanism but also dense inter-agent
communication, we propose a fully-decentralized MARL framework for enhanced
efficiency and scalability. In addition, a quantization-based communication
scheme is proposed to reduce the communication overhead without significantly
degrading the control performance. This is achieved by employing randomized
rounding numbers to quantize each piece of communicated information and only
communicating non-zero components after quantization. Extensive experimentation
in two distinct CACC settings reveals that the proposed MARL framework
consistently achieves superior performance over several contemporary benchmarks
in terms of both communication efficiency and control efficacy.Comment: 11 pages, 7 figure
Collaborative Multi-Agent Video Fast-Forwarding
Multi-agent applications have recently gained significant popularity. In many
computer vision tasks, a network of agents, such as a team of robots with
cameras, could work collaboratively to perceive the environment for efficient
and accurate situation awareness. However, these agents often have limited
computation, communication, and storage resources. Thus, reducing resource
consumption while still providing an accurate perception of the environment
becomes an important goal when deploying multi-agent systems. To achieve this
goal, we identify and leverage the overlap among different camera views in
multi-agent systems for reducing the processing, transmission and storage of
redundant/unimportant video frames. Specifically, we have developed two
collaborative multi-agent video fast-forwarding frameworks in distributed and
centralized settings, respectively. In these frameworks, each individual agent
can selectively process or skip video frames at adjustable paces based on
multiple strategies via reinforcement learning. Multiple agents then
collaboratively sense the environment via either 1) a consensus-based
distributed framework called DMVF that periodically updates the fast-forwarding
strategies of agents by establishing communication and consensus among
connected neighbors, or 2) a centralized framework called MFFNet that utilizes
a central controller to decide the fast-forwarding strategies for agents based
on collected data. We demonstrate the efficacy and efficiency of our proposed
frameworks on a real-world surveillance video dataset VideoWeb and a new
simulated driving dataset CarlaSim, through extensive simulations and
deployment on an embedded platform with TCP communication. We show that
compared with other approaches in the literature, our frameworks achieve better
coverage of important frames, while significantly reducing the number of frames
processed at each agent.Comment: IEEE Transactions on Multimedia, 2023. arXiv admin note: text overlap
with arXiv:2008.0443
- …