Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Feasible Action Search for Bandit Linear Programs via Thompson Sampling

Authors: Aditya Gangrade, Aldo Pacchiano, Clayton Scott, Venkatesh Saligrama

ICML 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Our main simulations focus on constructing a practical methodology out of the theoretical study of FAST. These are presented in B.1 below. Additionally, we do auxiliary simulations that compare the behaviour of FAST and EOGT, which are presented in B.2. Figure 1. Estimates of the Optimism Rates under the Coupled Design Drawn According to Unif(γd Sd) as γ is varied. Top. Medians over 100 runs. Table 1. Comparison of the Quality of aout in the fixed d, varying M setting for Feasible Instances, with ε = 0.5.
Researcher Affiliation	Academia	1Boston University 2Broad Institute of MIT and Harvard 3University of Michigan. Correspondence to: A. Gangrade <EMAIL>.
Pseudocode	Yes	Algorithm 1 Feasible Action Search via Thompson Sampling (FAST(µ, ε, δ))
Open Source Code	No	The paper does not contain any explicit statement about releasing source code or a link to a code repository.
Open Datasets	Yes	Throughout, we set Φ to be a certain 9 × 9 directed adjacency matrix, A, obtained from https://sparse.tamu.edu/van Heukelum/cage4
Dataset Splits	No	The paper describes generating different instances by varying parameters (e.g., optimal safety margin M, constraint levels) for simulation purposes, but it does not specify conventional training, validation, and testing dataset splits as typically found in supervised machine learning experiments.
Hardware Specification	Yes	Indeed, with this change, these simulations took about 3 hours to execute on a midrange laptop computer running a 2022 Ryzen-5U CPU.
Software Dependencies	No	All instances are simulated 50 times using a MATLAB environment. The paper mentions a software environment (MATLAB) but does not provide a specific version number.
Experiment Setup	Yes	We execute FAST on the described instance with ε = 0.9, and ϕ chosen so that M = 1. As mentioned above, the noise laws investigated are the coupled design, with ζ ∼ Unif(γd Sd), for γ ∈ {3i : i ∈ [−4, 1]}. Note that since d = 9, the parameter γ = 3−2 boils down to choosing the uniform law on Sd. Throughout, we let the feedback noise have variance 1. We proceed by setting M ∈ {0.15, 0.2, . . . , 0.5}, and ε ∈ {0.5, 0.7, 0.9}, and simulating the behaviour of FAST on the described instances 100 times for each pair of parameters. We note that feedback noise is set to be independent Gaussian with standard deviation 0.1, and we use this value to adjust our confidence sets.