reproducibilityindex.ai

Contextual Multinomial Logit Bandits with General Value Functions

Authors: Mengxiao Zhang, Haipeng Luo

NeurIPS 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	In this work, we consider contextual MNL bandits with a general value function class that contains the ground truth, borrowing ideas from a recent trend of studies on contextual bandits. Speciﬁcally, we consider both the stochastic and the adversarial settings, and propose a suite of algorithms, each with different computation-regret trade-off. ... Throughout the paper, we use two running examples to illustrate the concrete regret bounds our different algorithms achieve: the ﬁnite class and the linear class. In particular, for the linear class, this leads to ﬁve new results, summarized in Table 1 together with previous results.
Researcher Affiliation	Academia	Mengxiao Zhang University of Iowa mengxiao-zhang@uiowa.edu Haipeng Luo University of Southern California haipengl@usc.edu
Pseudocode	Yes	Algorithm 1 Contextual MNL Algorithms with an Ofﬂine Regression Oracle, Algorithm 2 Contextual MNL Algorithms via an Online Regression Oracle, Algorithm 3 Feel-Good Thompson Sampling for Contextual MNL bandits
Open Source Code	No	The paper does not contain any statement about releasing source code for the described methodology, nor does it provide any links to a code repository.
Open Datasets	No	The paper is purely theoretical and does not conduct experiments using publicly available datasets. It refers to 'ﬁnite class' and 'linear class' as abstract function classes for theoretical analysis, not concrete datasets.
Dataset Splits	No	This paper is theoretical and does not involve the use of datasets with training, validation, or test splits for empirical evaluation.
Hardware Specification	No	This paper is purely theoretical and does not describe any specific hardware (e.g., GPU models, CPU types, memory) used for running experiments.
Software Dependencies	No	This paper is purely theoretical and does not mention any specific software dependencies with version numbers (e.g., programming languages, libraries, frameworks) required to reproduce experiments.
Experiment Setup	No	This paper is purely theoretical and does not describe any experimental setup details, such as hyperparameter values, optimization settings, or training configurations.