Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Model-Based Opponent Modeling

Authors: XiaoPeng Yu, Jiechuan Jiang, Wanpeng Zhang, Haobin Jiang, Zongqing Lu

NeurIPS 2022 | Venue PDF | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Empirically, we show that MBOM achieves more effective adaptation than existing methods in a variety of tasks, respectively with different types of opponents, i.e., fixed policy, naïve learner, and reasoning learner.
Researcher Affiliation Academia Xiaopeng Yu Jiechuan Jiang Wanpeng Zhang Haobin Jiang Zongqing Lu School of Computer Science, Peking University
Pseudocode Yes Algorithm 1 MBOM
Open Source Code No The code for the main experiments will be released after acceptance.
Open Datasets Yes Triangle Game is an asymmetric zero-sum game implemented on Multi-Agent Particle Environments (MPE) [21].
Dataset Splits No The paper describes interacting with environments like MPE and Google Research Football but does not provide explicit training, validation, and test dataset splits with percentages or sample counts, as these are simulated environments rather than static datasets.
Hardware Specification Yes All the experiments were run on NVIDIA GeForce RTX 3090 GPUs.
Software Dependencies Yes We use PyTorch (v1.8.0) to implement our method and baselines.
Experiment Setup Yes More details about experimental settings and hyperparameters are available in Appendix E.