reproducibilityindex.ai

Recursive Reasoning Graph for Multi-Agent Reinforcement Learning

Authors: Xiaobai Ma, David Isele, Jayesh K. Gupta, Kikuo Fujimura, Mykel J. Kochenderfer7664-7671

AAAI 2022 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	The proposed algorithm, referred to as the Recursive Reasoning Graph (R2G), shows state-of-the-art performance on multiple multi-agent particle and robotics games.
Researcher Affiliation	Collaboration	Xiaobai Ma1, David Isele2, Jayesh K. Gupta1, Kikuo Fujimura2, Mykel J. Kochenderfer1 1Stanford University 2Honda Research Institute US
Pseudocode	Yes	Algorithm 1: Recursive Reasoning Graph (R2G)
Open Source Code	No	The paper does not contain an explicit statement about releasing the source code for the described methodology or a direct link to a code repository.
Open Datasets	Yes	Particle World (Lowe et al. 2017) and Robo Sumo (Al Shedivat et al. 2018) are cited, indicating the use of established environments/benchmarks, which typically imply publicly available data or simulators.
Dataset Splits	No	The paper describes experiments conducted within multi-agent simulation environments (Particle World, Robo Sumo) rather than on fixed datasets with traditional train/validation/test splits. Therefore, it does not provide explicit dataset split percentages or counts.
Hardware Specification	No	The paper does not provide specific details about the hardware (e.g., GPU/CPU models, memory specifications) used for running the experiments.
Software Dependencies	No	The paper does not specify software dependencies with version numbers (e.g., Python, PyTorch, TensorFlow versions, or specific library versions) that would be needed for reproducibility.
Experiment Setup	No	While the paper mentions aspects like '5 random seeds' for evaluation and discusses the experimental environments, it does not provide specific hyperparameter values (e.g., learning rate, batch size, number of epochs) or detailed training configurations within the main text.