reproducibilityindex.ai

Benign Overfitting in Multiclass Classification: All Roads Lead to Interpolation

Authors: Ke Wang, Vidya Muthukumar, Christos Thrampoulidis

NeurIPS 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Our numerical results show excellent agreement with our theoretical findings. The main contributions of our paper are theoretical, and our simulations on synthetic are intended to support these results rather than constitute results.
Researcher Affiliation	Academia	Ke Wang Department of Statistics and Applied Probability University of California, Santa Barbara Santa Barbara, CA 93106 kewang01@ucsb.edu Vidya Muthukumar School of Electrical and Computer Engineering & Industrial and Systems Engineering Georgia Institute of Technology Atlanta, GA 30332 vmuthukumar8@gatech.edu Christos Thrampoulidis Department of Electrical and Computer Engineering University of British Columbia Vancouver, BC Canada V6T 1Z4 cthrampo@ece.ubc.ca
Pseudocode	No	The paper describes algorithms using mathematical formulations (e.g., equations 3, 4, 5) but does not include any structured pseudocode or algorithm blocks.
Open Source Code	Yes	We include the code that creates the ﬁgures in our paper and will submit it as supplementary material.
Open Datasets	No	We assume that the data pairs {xi, yi}n i=1 are generated IID. We will consider two models for the distribution of (x, y). For both models, we deﬁne the mean vectors {µj}k j=1 2 Rp, and the mean matrix is given by M := [µ1 µ2 µk] 2 Rp k. Gaussian Mixture Model (GMM)... Multinomial Logit Model (MLM)... The paper states in the ethics review that simulations are on 'synthetic' data. No concrete access information for a public dataset is provided.
Dataset Splits	No	The paper mentions 'training data' and a 'fresh sample (x, y)' (test data) but does not provide specific details on the dataset splits (e.g., percentages or counts for training, validation, and test sets).
Hardware Specification	No	The paper does not provide any specific hardware details (e.g., GPU/CPU models, memory amounts, or detailed computer specifications) used for running its experiments.
Software Dependencies	No	The paper does not provide specific ancillary software details (e.g., library or solver names with version numbers) needed to replicate the experiment.
Experiment Setup	Yes	We set the number of classes k = 4, ﬁx n = 40, and vary p = 50, . . . , 1200 to guarantee sufﬁcient overparameterization. We consider the case of orthogonal and equalnorm mean vectors kµk2 = µpp, with µ = 0.2, 0.3 and 0.4.