Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning

Authors: Noam Brown, Tuomas Sandholm

ICML 2017 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Experiments show that BRP results in a factor of 7 reduction in space, and the reduction factor increases with game size.
Researcher Affiliation Academia 1Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA. Correspondence to: Noam Brown <noamb@cs.cmu.edu>, Tuomas Sandholm <sandholm@cs.cmu.edu>.
Pseudocode No No pseudocode or algorithm blocks are explicitly provided in the paper.
Open Source Code No The paper does not provide concrete access to source code for the methodology described.
Open Datasets Yes The experiments are conducted on Leduc Hold em (Southey et al., 2005) and Leduc-5 (Brown & Sandholm, 2015a).
Dataset Splits No The paper evaluates performance within game environments (Leduc Hold'em, Leduc-5) and discusses convergence over iterations, but does not define traditional train/validation/test dataset splits as it's not a typical supervised learning setup with fixed datasets.
Hardware Specification No The paper uses 'Nodes touched' as a hardware and implementation-independent proxy for time and does not provide specific details on the hardware used for experiments.
Software Dependencies No The paper discusses various algorithms (CFR, CFR+, RM, Fictitious Play) but does not specify any software dependencies or library version numbers used for implementation.
Experiment Setup Yes Figure 1 and Figure 2 show the reduction in space needed to store the average strategy and regrets for BRP for various values of the constant threshold C, where an action s probability is set to zero if it is reached with probability less than C T in the average strategy, as we explained in Section 3.1. In both games, a threshold between 0.01 and 0.1 performed well in both space and number of iterations, with the lower thresholds converging somewhat faster and the higher thresholds reducing space faster.