Mastering Zero-Shot Interactions in Cooperative and Competitive Simultaneous Games
Authors: Yannik Mahlau, Frederik Schubert, Bodo Rosenhahn
ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We perform an extensive evaluation of Albatross on a set of cooperative and competitive simultaneous perfect-information games. In contrast to Alpha Zero, Albatross is able to exploit weak agents in the competitive game of Battlesnake. |
| Researcher Affiliation | Academia | Yannik Mahlau 1 Frederik Schubert 1 Bodo Rosenhahn 1 1Department for Information Processing, Leibniz University Hannover, Germany. |
| Pseudocode | Yes | Algorithm 1 Training of Alpha Zero for simultaneous games |
| Open Source Code | Yes | To support reproducibility, all of our code as well as the trained models are open source1. 1https://github.com/ymahlau/albatross |
| Open Datasets | Yes | We evaluate Albatross in the Overcooked benchmark (Carroll et al., 2019) and the competitive game of Battlesnake (Chung et al., 2020). |
| Dataset Splits | No | The paper mentions 'training episodes' and evaluation on 'five different seeds' but does not provide explicit details about train/validation/test dataset splits with percentages or counts for reproducibility. |
| Hardware Specification | Yes | Unless specified otherwise, we used Nvidia RTX3090 GPU and 14 Intel Xeon Gold 6258R CPU for each GPU. Those numbers were chosen to optimally saturate the compute cluster used. |
| Software Dependencies | No | The paper mentions reimplementing Overcooked in C++ and discusses the use of Python (e.g., in the context of the environment), but it does not provide specific version numbers for any software dependencies or libraries. |
| Experiment Setup | Yes | To support reproducibility of our results, we report all hyperparameters used in our experiments. In this section, we list common hyperparameters used across all experiments. Hyperparameters, which differ between experiments are listed in Table 2. |