Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Adversarial Linear Contextual Bandits with Graph-Structured Side Observations

Authors: Lingda Wang, Bingcong Li, Huozhi Zhou, Georgios B. Giannakis, Lav R. Varshney, Zhizhen Zhao10156-10164

AAAI 2021 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Numerical tests corroborate the efﬁciency of proposed algorithms. ... We conduct the numerical tests on synthetic data to demonstrate the efﬁciency of the novel EXP3-LGC-U and EXP3-LGC-IX algorithms. ... Figure 2 presents the expected cumulative regret2, where the results are averaged over 100 independent trials.
Researcher Affiliation	Academia	Lingda Wang1, Bingcong Li2, Huozhi Zhou1, Georgios B. Giannakis2, Lav R. Varshney1, Zhizhen Zhao1 1University of Illinois at Urbana-Champaign 2University of Minnesota Twin Cities EMAIL, EMAIL
Pseudocode	Yes	Algorithm 1 EXP3-LGC-U Input: Learning rate η > 0, uniform exploration rate γ (0, 1), covariance Σ, and action set V . For t = 1, . . . , T, do: ... Algorithm 2 EXP3-LGC-IX Parameters: Learning rate ηt > 0, implicit exploration rate βt (0, 1), and covariance Σ,and action set V . For t = 1, . . . , T, do:
Open Source Code	No	The paper does not provide concrete access to source code for the methodology described. There are no explicit statements about releasing code or links to repositories.
Open Datasets	No	The paper uses synthetic data generated by the authors, providing details on its generation (e.g., "Each coordinate of context Xt (or Xt) is generated i.i.d. from the Bernoulli distribution with support {0, 1/d} and p = 0.5"). It does not provide access information for a publicly available or open dataset.
Dataset Splits	No	The paper describes generating synthetic data and simulating a bandit problem over a time horizon, but it does not specify explicit training/validation/test dataset splits. The evaluation is performed over the entire time horizon, characteristic of online learning problems, without traditional dataset partitioning.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., GPU/CPU models, memory amounts) used for running its experiments. It only mentions "numerical tests" on synthetic data.
Software Dependencies	No	The paper describes its algorithms and numerical tests but does not specify any ancillary software or library names with version numbers needed to replicate the experiment.
Experiment Setup	Yes	We consider a setting of K = 10 actions, with d = 10 dimensional contexts observed iteratively on a T = 10^5 time horizon. ... The parameters of EXP3-LGC-U and EXP3-LGC-IX are chosen according to Corollary 1 and Theorem 2, respectively. For EXP3-LGC-U and EXP3-LGC-IX , the parameter selection methods are identical as before, except for setting α(G) = K. The parameters of Robust Lin EXP3 are tuned exactly the same as those in Neu and Olkhovskaya (2020).