TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

Authors: Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du

AAAI 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Experiment results on several benchmarks show the agent topology is able to facilitate agent cooperation and alleviate CDM issue respectively to improve performance of TAPE. Finally, multiple ablation studies and a heuristic graph search algorithm are devised to show the efficacy of the agent topology.
Researcher Affiliation Academia 1School of Artificial Intelligence, University of Chinese Academy of Sciences 2Institute of Automation, Chinese Academy of Sciences 3University of Southampton 4King s College London
Pseudocode Yes Pseudo-code and more details of stochastic TAPE are provided in the appendix E.1.
Open Source Code Yes Our code is available here1. 1github.com/LxzGordon/TAPE
Open Datasets Yes Level-based foraging (Papoudakis et al. 2021) and Starcraft II Multi-Agent Challenge (SMAC) (Samvelyan et al. 2019)
Dataset Splits No The paper mentions using common benchmarks (LBF, SMAC) but does not explicitly provide details on train/validation/test dataset splits, specific percentages, or sample counts, nor does it cite a resource that defines these splits for its experiments.
Hardware Specification No The paper does not provide any specific hardware details such as GPU/CPU models, processor types, or memory amounts used for running experiments.
Software Dependencies No The paper does not provide specific version numbers for any software dependencies, libraries, or solvers used in the experiments.
Experiment Setup No The paper states that "All algorithms are run for four times with different random seeds. Each run lasts for 5 × 10^6 environmental steps. During training, each algorithm has four parallel environment to collect training data," but it does not explicitly provide specific hyperparameter values (e.g., learning rate, batch size, optimizer settings) or detailed configuration steps for the experimental setup in the main text.