reproducibilityindex.ai

Local and Adaptive Mirror Descents in Extensive-Form Games

Authors: Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Remi Munos, Vianney Perchet, Michal Valko

NeurIPS 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We implemented Local OMD, with the parameters of Theorem 4.1 and Theorem 4.2, then tested it against the theoretically optimal Balanced CFR (Bai et al., 2022) using the balanced policy as the sample policy, and Balanced FTRL (Fiegel et al., 2023). The algorithms were compared on three standard benchmark games: Kuhn poker (Kuhn, 1950), Leduc poker (Southey et al., 2005) and liars dice, using the version 1.4 of the Open Spiel library (Lanctot et al., 2019) under the Apache 2.0 license.
Researcher Affiliation	Collaboration	Côme Fiegel CREST Fair Play, ENSAE Paris Palaiseau, France come.fiege@normalesup.org Pierre Ménard ENS Lyon Lyon, France Tadashi Kozuno OMRON SINIC X Tokyo, Japan Rémi Munos Google Deep Mind Paris, France Vianney Perchet CREST Fair Play, ENSAE Paris, Criteo AI Lab Paris, France Michal Valko INRIA
Pseudocode	Yes	Algorithm 1 Learning procedures with fixed sampling policies for two players; Algorithm 2 Local OMD
Open Source Code	Yes	The code is available at https: //github.com/anon5493/Local OMD-experiments.
Open Datasets	Yes	The algorithms were compared on three standard benchmark games: Kuhn poker (Kuhn, 1950), Leduc poker (Southey et al., 2005) and liars dice, using the version 1.4 of the Open Spiel library (Lanctot et al., 2019) under the Apache 2.0 license.
Dataset Splits	No	The paper does not specify explicit training, validation, or test dataset splits. It mentions learning over a 'total number of episodes' and 'self-play framework' for policy updates, which differs from traditional dataset splitting for supervised learning.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., CPU, GPU models, memory, or cloud resources) used for running the experiments.
Software Dependencies	Yes	using the version 1.4 of the Open Spiel library (Lanctot et al., 2019) under the Apache 2.0 license.
Experiment Setup	Yes	The learning rates (and the IX parameters for the relevant algorithms) were optimized independently for each algorithm using a grid search.