A Gang of Adversarial Bandits
Authors: Mark Herbster, Stephen Pasteris, Fabio Vitale, Massimiliano Pontil
NeurIPS 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Theoretical | We present two learning algorithms, GABA-I and GABA-II which exploit the network structure to bias towards functions of low Ψ values. We show that GABA-I has an expected regret bound of O( p ln(NK/Ψ)ΨKT) and per-trial time complexity of O(K ln(N)), whilst GABA-II has a weaker O( p ln(N/Ψ) ln(NK/Ψ)ΨKT) regret, but a better O(ln(K) ln(N)) per-trial time complexity. |
| Researcher Affiliation | Academia | Mark Herbster*, Stephen Pasteris* Department of Computer Science University College London London WC1E 6BT {m.herbster,s.pasteris}@cs.ucl.ac.uk Fabio Vitale University of Lille 59653 Villeneuve d Ascq CEDEX France fabio.vitale2@univ-lille.fr Massimiliano Pontil CSML, Istituto Italiano di Tecnologia and Department of Computer Science University College London massimiliano.pontil@iit.it |
| Pseudocode | Yes | Figure 1: Binary Support Tree Construction Algorithm, Figure 2: SPECIALISTEXP Algorithm, Figure 3: GABA-I Algorithm, Figure 4: GABA-II Algorithm |
| Open Source Code | No | The paper does not contain any explicit statements about releasing code or links to source code repositories for the described methodology. |
| Open Datasets | No | The paper is theoretical and does not mention any specific dataset used for training or provide access information for any dataset. |
| Dataset Splits | No | The paper focuses on theoretical analysis and algorithm design, and thus does not provide details on training, validation, or test dataset splits. |
| Hardware Specification | No | The paper is theoretical in nature, presenting algorithms and regret bounds, and therefore does not specify any hardware used for experiments. |
| Software Dependencies | No | The paper is a theoretical work and does not specify software dependencies with version numbers for experimental reproducibility. |
| Experiment Setup | No | The paper presents theoretical algorithms and their analysis, without detailing an empirical experimental setup or specific hyperparameters. |