Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games
Authors: Brian Zhang, Tuomas Sandholm
NeurIPS 2022 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We back up our theoretical claims with experiments on a suite of standard benchmark games. ... We ran our algorithm for communication and full-certification equilibria on various two-player games, and compared the results to those given by notions of optimal correlation in games. |
| Researcher Affiliation | Collaboration | Brian Hu Zhang Computer Science Department Carnegie Mellon University EMAIL Tuomas Sandholm Computer Science Department, CMU Strategic Machine, Inc. Strategy Robot, Inc. Optimized Markets, Inc. EMAIL |
| Pseudocode | No | The paper describes the construction of the mediator-augmented game and refers to solving a linear program but does not provide pseudocode or an algorithm block. |
| Open Source Code | No | Did you include the code, data, and instructions needed to reproduce the main experimental results (either in the supplemental material or as a URL)? [No] |
| Open Datasets | Yes | We ran our algorithm for communication and full-certification equilibria on various two-player games... The games used in the experiments are given in Appendix D. All experiments were allocated four CPU cores and 64 GB of RAM. |
| Dataset Splits | No | The paper does not specify any training, validation, or test dataset splits. |
| Hardware Specification | Yes | All experiments were allocated four CPU cores and 64 GB of RAM. |
| Software Dependencies | Yes | Linear programs were solved with Gurobi 9.5. |
| Experiment Setup | Yes | When payments are used, the allowable payment range is [0, M] where M is the reward range of the game. |