Search CORE

22 research outputs found

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

Author: Du Yali
Huang Kaiqi
Lou Xingzhou
Norman Timothy J.
Zhang Junge
Publication venue
Publication date: 15/01/2024
Field of study

Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years. However, centralized critics in state-of-the-art MAPG methods still face the centralized-decentralized mismatch (CDM) issue, which means sub-optimal actions by some agents will affect other agent's policy learning. While using individual critics for policy updates can avoid this issue, they severely limit cooperation among agents. To address this issue, we propose an agent topology framework, which decides whether other agents should be considered in policy gradient and achieves compromise between facilitating cooperation and alleviating the CDM issue. The agent topology allows agents to use coalition utility as learning objective instead of global utility by centralized critics or local utility by individual critics. To constitute the agent topology, various models are studied. We propose Topology-based multi-Agent Policy gradiEnt (TAPE) for both stochastic and deterministic MAPG methods. We prove the policy improvement theorem for stochastic TAPE and give a theoretical explanation for the improved cooperation among agents. Experiment results on several benchmarks show the agent topology is able to facilitate agent cooperation and alleviate CDM issue respectively to improve performance of TAPE. Finally, multiple ablation studies and a heuristic graph search algorithm are devised to show the efficacy of the agent topology

arXiv.org e-Print Archive

TAPE: leveraging agent topology for cooperative multi-agent policy gradient

Author: Du Yali
Huang Kaiqi
Luo Xingzhou
Norman Tim
Zhang Junge
Publication venue: AAAI Press
Publication date: 24/03/2024
Field of study

Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years. However, centralized critics in state-of-the-art MAPG methods still face the centralized-decentralized mismatch (CDM) issue, which means sub-optimal actions by some agents will affect other agent's policy learning. While using individual critics for policy updates can avoid this issue, they severely limit cooperation among agents. To address this issue, we propose an agent topology framework, which decides whether other agents should be considered in policy gradient and achieves compromise between facilitating cooperation and alleviating the CDM issue. The agent topology allows agents to use coalition utility as learning objective instead of global utility by centralized critics or local utility by individual critics. To constitute the agent topology, various models are studied. We propose Topology-based multi-Agent Policy gradiEnt (TAPE) for both stochastic and deterministic MAPG methods. We prove the policy improvement theorem for stochastic TAPE and give a theoretical explanation for the improved cooperation among agents. Experiment results on several benchmarks show the agent topology is able to facilitate agent cooperation and alleviate CDM issue respectively to improve performance of TAPE. Finally, multiple ablation studies and a heuristic graph search algorithm are devised to show the efficacy of the agent topology

Southampton (e-Prints Soton)

Targeting miR‐193a‐AML1‐ETO‐β‐catenin axis by melatonin suppresses the self‐renewal of leukaemia stem cells in leukaemia with t (8;21) translocation

Author: Bin Liang
Bin Zhou
Chongyun Xing
Haige Ye
Haiying Li
Li H
Linling Chen
Martinez‐Soria N
Shenmeng Gao
Xingzhou Huang
Yanfei Wu
Publication venue: 'Wiley'
Publication date
Field of study

InsectBase: a resource for insect genomes and transcriptomes

Author: Chuanlin Yin
Dianhao Guo
Fei Li
Gengyu Shen
Huamei Xiao
Jinding Liu
Kaixiang Yu
Legeai
Lipman
Shuiqing Huang
Shuping Wang
Xingzhou Ma
Ying Liu
Yiqun Zhang
Zan Zhang
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Study on application of polyenzyme method to offal ofHarengula zunasi

Author: C. H. Zhang
Dalian Light Industry College South China Technology University
Deng Shanggui
G. Y. Wang
J. C. Wu
S. G. Deng
S. G. Deng
S. G. Deng
Xia Xingzhou
Y. S. Chi
Yang Ping
Z. B. Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

An advanced wet method for simultaneous removal of SO2 and NO from coal-fired flue gas by utilizing a complex absorbent

Author: Abdulhamid
Amin
Beattie
Bo Yuan
Ding
Ding
Dong
Fang
Hu
Huang
Hutson
Ighigeanu
Ishizuka
Jin
Lee
Li
Li
Liu
Ma
Mitchell
Nasonova
Obradović
Onda
Runlong Hao
Sun
Tatzber
Wang
Wang
Wang
Xingzhou Mao
Yaoyu Zhang
Yi Zhao
Yuanpeng Li
Zhao
Zhao
Zhao
Zhao
Zhaoyue Wang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Mineralogy and inorganic geochemistry of the Es4 shales of the Damintun Sag, northeast of the Bohai Bay Basin: Implication for depositional environment

Author: Abanda
Anders
Balaram
Berner
Berner
Bhatia
Bo Liu
Condie
Cox
Cullers
Elderfield
Espitalié
Fedo
Ghosh
Haixue Wang
Harnois
Hayashi
Hongxia Li
Huang
Jones
Li
Lijuan Cheng
Lina Meng
Liu
McLennan
Middleburg
Nesbitt
Nesbitt
Pourmand
Qiu
Rimstidt
Roser
Sun
Tao
Taylor
Tole
Wang
Westrich
Wronkiewicz
Wronkiewicz
Xia
Xingzhou Liu
Yang
Zhang
Zhang
Zhou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Diagenesis and Very Low-grade Metamorphism of the Upper Permian Yangjiagou Formation in Eastern Changchun, China: Evidence from Clay Mineral Geothermobarometers

Author: Brime
Bureau of Geology and Mineral Resources of Jilin Province
Chengwen
Chengwen
Chunyu
Daqian
Daqian
Daqian
Daqian
Daqian
Deyou
Essene
Frey
Frey
Fuyuan
Greenwood
Guidotti
Guidotti
Hejin
Hejin
Hejin
Hejin
Hejin
Huang
Jianchang
Jiang
Jiang
Kemp
Kisch
Kurt
Kübler
Lu
Manyun
Marsch
Michael
Ming
Nieto
Padan
Qihan
Rausell-Colom
Robinson
Sengör
Shenbao
Shenbao
Stefano
Sébastien
Uysal
Weber
Wenliang
Xianmei
Xingzhou
Yanbin
Yuansheng
Yunsheng
Yuqi
Publication venue: 'Wiley'
Publication date
Field of study

Simultaneous removal of SO2, NO and Hg0 from flue gas using vaporized oxidant catalyzed by Fe/ZSM-5

Author: Adewuyi
Adewuyi
Adewuyi
Adewuyi
Adewuyi
Ay
Bo Yuan
Cheng
Costa
Ding
Dong
Fang
Hao
Hao
Hao
Huang
Lam
Li
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Ramirez
Runlong Hao
Rusevova
Sashkina
Segura
Wang
Wang
Wen
Wu
Wu
Xingzhou Mao
Xiong
Yi Zhao
Zehui Zheng
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zhou
Zhou
Zhou
Zou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Pathway-based evaluation of 380 candidate genes and lung cancer susceptibility suggests the importance of the cell cycle pathway

Author: Alberts
Barrett
Benjamini
Chapman
Chari
Chen
Dudbridge
Duronio
Excoffier
Franke
Garcia-Closas
Gauvreau
Gorgoulis
H.Dean Hosgood
Hartl
Hu
Huang
Idan Menashe
Jeff Yuenger
Kandel
Kang
Lan
Lan
Lan
Lan
Lan
Landi
Laurence
Lee
Lim
Meredith Yeager
Min Shen
Mumford
Mumford
Neil E. Caporaso
Nilanjan Chatterjee
Ougolkov
Parkin
Pass
Preetha Rajaraman
Qing Lan
Schaid
Shakoori
Shen
Stephen J. Chanock
Straif
Tateishi
Tian
Tongzhang Zheng
Wacholder
Xingzhou He
Yong Zhu
Zheng
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

core

core