Generalized Rapid Action Value Estimation
Authors: Tristan Cazenave
IJCAI 2015 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We test the resulting algorithm named GRAVE for Atarigo, Knighthrough, Domineering and Go. |
| Researcher Affiliation | Academia | Tristan Cazenave LAMSADE Universite Paris-Dauphine Paris, France cazenave@lamsade.dauphine.fr |
| Pseudocode | Yes | Algorithm 1 The GRAVE algorithm |
| Open Source Code | No | The paper does not provide concrete access to source code (e.g., specific repository link, explicit code release statement, or code in supplementary materials) for the methodology described. |
| Open Datasets | No | The paper discusses using various games (Atarigo, Knightthrough, Domineering, Go) as environments for experimentation, but does not provide specific access information (links, DOIs, citations to predefined splits) for publicly available datasets in the typical machine learning sense. |
| Dataset Splits | No | The paper mentions tuning parameters for RAVE and GRAVE, which implies a form of validation, but it does not provide specific dataset split information (exact percentages, sample counts, citations to predefined splits, or detailed splitting methodology) needed to reproduce data partitioning for training/validation/test sets. |
| Hardware Specification | No | The paper states: 'This work was granted access to the HPC resources of Meso PSL...', but it does not provide specific details such as exact GPU/CPU models, processor types, or memory amounts used for the experiments. |
| Software Dependencies | No | The paper does not provide specific ancillary software details (e.g., library or solver names with version numbers) needed to replicate the experiment. |
| Experiment Setup | Yes | the RAVE bias is first tuned playing different RAVE bias against UCT with a 0.4 exploration parameter (the exploration parameter used for GGP), then the GRAVE bias as well as the ref constant are tuned against the tuned RAVE. ... We test as bias all the powers of 10 between 10 1 and 10 15. Additionally the ref constants tested for GRAVE are 25, 50, 100, 200 and 400. ... The algorithms are tested for 1,000 and 10,000 playouts. |