reproducibilityindex.ai

Finding Friend and Foe in Multi-Agent Games

Authors: Jack Serrino, Max Kleiman-Weiner, David C. Parkes, Josh Tenenbaum

NeurIPS 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Empirical game-theoretic methods show that Deep Role outperforms other hand-crafted and learned agents in ﬁve-player Avalon. Deep Role played with and against human players on the web in hybrid human-agent teams. We ﬁnd that Deep Role outperforms human players as both a cooperator and a competitor.
Researcher Affiliation	Collaboration	Jack Serrino MIT jserrino@mit.edu Max Kleiman-Weiner Harvard, MIT, Diffeo maxkleimanweiner@fas.harvard.edu David C. Parkes Harvard University parkes@eecs.harvard.edu Joshua B. Tenenbaum MIT, CBMM jbt@mit.edu
Pseudocode	Yes	See Appendix A and Alg. 3 for details of the network training algorithm, procedure, parameters and compute details.
Open Source Code	Yes	Source code and experimental data is available here: https://github.com/Detry322/Deep Role.
Open Datasets	No	The paper does not provide concrete access information (link, DOI, repository, or formal citation) for a publicly available or open dataset. It describes generating its own training data through self-play and human interactions but does not make this data publicly accessible.
Dataset Splits	No	The paper does not provide specific dataset split information (percentages, sample counts, or citations to predefined splits) needed to reproduce the data partitioning.
Hardware Specification	No	The paper does not provide specific hardware details (exact GPU/CPU models, processor types, or memory amounts) used for running its experiments.
Software Dependencies	No	The paper does not provide specific ancillary software details (e.g., library or solver names with version numbers) needed to replicate the experiment.
Experiment Setup	Yes	Our network feeds a one-hot encoded vector of the proposer player i and the belief vector b into two fully-connected hidden layers of 80 Re LU units. These feed into a fully-connected win probability layer with sigmoid activation.