Generalized Rapid Action Value Estimation

Authors: Tristan Cazenave

IJCAI 2015 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental We test the resulting algorithm named GRAVE for Atarigo, Knighthrough, Domineering and Go.
Researcher Affiliation Academia Tristan Cazenave LAMSADE Universite Paris-Dauphine Paris, France cazenave@lamsade.dauphine.fr
Pseudocode Yes Algorithm 1 The GRAVE algorithm
Open Source Code No The paper does not provide concrete access to source code (e.g., specific repository link, explicit code release statement, or code in supplementary materials) for the methodology described.
Open Datasets No The paper discusses using various games (Atarigo, Knightthrough, Domineering, Go) as environments for experimentation, but does not provide specific access information (links, DOIs, citations to predefined splits) for publicly available datasets in the typical machine learning sense.
Dataset Splits No The paper mentions tuning parameters for RAVE and GRAVE, which implies a form of validation, but it does not provide specific dataset split information (exact percentages, sample counts, citations to predefined splits, or detailed splitting methodology) needed to reproduce data partitioning for training/validation/test sets.
Hardware Specification No The paper states: 'This work was granted access to the HPC resources of Meso PSL...', but it does not provide specific details such as exact GPU/CPU models, processor types, or memory amounts used for the experiments.
Software Dependencies No The paper does not provide specific ancillary software details (e.g., library or solver names with version numbers) needed to replicate the experiment.
Experiment Setup Yes the RAVE bias is first tuned playing different RAVE bias against UCT with a 0.4 exploration parameter (the exploration parameter used for GGP), then the GRAVE bias as well as the ref constant are tuned against the tuned RAVE. ... We test as bias all the powers of 10 between 10 1 and 10 15. Additionally the ref constants tested for GRAVE are 25, 50, 100, 200 and 400. ... The algorithms are tested for 1,000 and 10,000 playouts.