reproducibilityindex.ai

Exploiting Correlated Auxiliary Feedback in Parameterized Bandits

Authors: Arun Verma, Zhongxiang Dai, YAO SHU, Bryan Kian Hsiang Low

NeurIPS 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	To validate our theoretical results, we empirically demonstrate the performance gain due to auxiliary feedback in different settings of parameterized bandits. We repeat all our experiments 50 times and show the regret as defined in Eq. (1) with a 95% confidence interval.
Researcher Affiliation	Academia	Arun Verma Zhongxiang Dai Yao Shu Bryan Kian Hsiang Low Department of Computer Science, National University of Singapore, Republic of Singapore {arun, daizhongxiang, shuyao, lowkh}@comp.nus.edu.sg
Pseudocode	Yes	OFUL-AF Algorithm for Linear Bandits with Auxiliary Feedback
Open Source Code	No	The paper does not contain any explicit statements about open-source code availability, repository links, or code in supplementary materials.
Open Datasets	No	The paper describes synthetic datasets generated by the authors (e.g., "We first generate a 2-dimensional synthetic dataset with 5000 data samples"), but does not provide concrete access information (link, DOI, repository, or formal citation) for them to be publicly available.
Dataset Splits	No	The paper mentions generating data samples (e.g., "5000 data samples") but does not provide specific details on training, validation, or test splits (percentages, absolute counts, or references to predefined splits).
Hardware Specification	No	The paper does not provide specific hardware details (e.g., exact GPU/CPU models, processor types, or memory amounts) used for running its experiments.
Software Dependencies	No	The paper does not provide specific ancillary software details, such as library or solver names with version numbers, needed to replicate the experiment.
Experiment Setup	Yes	In all linear bandits experiments, we use λ = 0.01, L = 2.236, S = 1, and δ = 0.05. ... The default value of σ2 v = 0.01 and σ2 w = 0.01.