Exploiting Correlated Auxiliary Feedback in Parameterized Bandits

Authors: Arun Verma, Zhongxiang Dai, YAO SHU, Bryan Kian Hsiang Low

NeurIPS 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental To validate our theoretical results, we empirically demonstrate the performance gain due to auxiliary feedback in different settings of parameterized bandits. We repeat all our experiments 50 times and show the regret as defined in Eq. (1) with a 95% confidence interval.
Researcher Affiliation Academia Arun Verma Zhongxiang Dai Yao Shu Bryan Kian Hsiang Low Department of Computer Science, National University of Singapore, Republic of Singapore {arun, daizhongxiang, shuyao, lowkh}@comp.nus.edu.sg
Pseudocode Yes OFUL-AF Algorithm for Linear Bandits with Auxiliary Feedback
Open Source Code No The paper does not contain any explicit statements about open-source code availability, repository links, or code in supplementary materials.
Open Datasets No The paper describes synthetic datasets generated by the authors (e.g., "We first generate a 2-dimensional synthetic dataset with 5000 data samples"), but does not provide concrete access information (link, DOI, repository, or formal citation) for them to be publicly available.
Dataset Splits No The paper mentions generating data samples (e.g., "5000 data samples") but does not provide specific details on training, validation, or test splits (percentages, absolute counts, or references to predefined splits).
Hardware Specification No The paper does not provide specific hardware details (e.g., exact GPU/CPU models, processor types, or memory amounts) used for running its experiments.
Software Dependencies No The paper does not provide specific ancillary software details, such as library or solver names with version numbers, needed to replicate the experiment.
Experiment Setup Yes In all linear bandits experiments, we use λ = 0.01, L = 2.236, S = 1, and δ = 0.05. ... The default value of σ2 v = 0.01 and σ2 w = 0.01.