Show Me the Way! Bilevel Search for Synthesizing Programmatic Strategies

Authors: David S. Aleixo, Levi H.S. Lelis

AAAI 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental We evaluated our bilevel algorithm in Micro RTS, a real-time strategy game. Our results show that the bilevel search synthesizes stronger strategies than methods that search only in the program space. Also, the strategies our method synthesizes obtained the highest winning rate in a simulated tournament with several baseline agents, including the best agents from the two latest Micro RTS competitions.
Researcher Affiliation Academia 1Departamento de Inform atica, Universidade Federal de Vic osa, Brazil 2Department of Computing Science, Alberta Machine Intelligence Institute (Amii), University of Alberta, Canada
Pseudocode Yes Algorithm 1 shows Bi-S s pseudocode.
Open Source Code Yes 1https://github.com/dsaleixo/Bilevel Searchfor Synthesizing.
Open Datasets Yes We use Micro RTS, a real-time strategy game widely used to evaluate intelligent systems (Onta n on 2020). It is a two-player zero-sum game where each player controls a set of units in real time... We use the maps bases Workers24x24A, bases Workers32x32A, and (4)Blood Bath.scm B, whose sizes are 24 24, 32 32, and 64 64, respectively, from Micro RTS s official code base.
Dataset Splits No The paper does not provide specific details on traditional training, validation, and testing dataset splits, as it operates in a game-playing synthesis context rather than a fixed dataset learning setup.
Hardware Specification Yes All experiments were run on a single 2.4 GHz CPU with 8 GB of RAM and a time limit of 2 days.
Software Dependencies No The paper does not provide specific version numbers for software dependencies or libraries used in the experiments.
Experiment Setup Yes We use α = 0.9, β = 200, T1 = 100, ϵ = 1 with our SA implementation as these are the values Medeiros, Aleixo, and Lelis (2022) used in their experiments. We use the following exploration rates for the first phase of NS: ϵ1 0 = 0.7, ϵ1 l = 0.7, and ϵ1 g = 0.4; and the following for the second phase: ϵ1 0 = 0.1, ϵ1 l = 0.3, and ϵ1 g = 0.1. We use γ = 0.95.