reproducibilityindex.ai

Data driven semi-supervised learning

Authors: Maria-Florina F. Balcan, Dravyansh Sharma

NeurIPS 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	In this section we evaluate the performance of our learning procedures when ﬁnding application-speciﬁc semi-supervised learning algorithms (i.e. graph parameters). Our experiments3 demonstrate that the best parameter for different applications varies greatly, and that the techniques presented in this paper can lead to large gains. We look at image classiﬁcation based on standard pixel embedding.
Researcher Affiliation	Academia	Maria-Florina Balcan School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 ninamf@cs.cmu.edu Dravyansh Sharma Department of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 dravyans@cs.cmu.edu
Pseudocode	Yes	Algorithm 1 Data-driven Graph-based SSL and Algorithm 2 Efﬁcient Data-driven Graph-based SSL are provided in the paper.
Open Source Code	Yes	Code: https://drive.google.com/drive/folders/1Iq Iw2Mp23W35UUwlz1hy24Eba5s Pp VH_
Open Datasets	Yes	We use three popular benchmark datasets MNIST [Le Cun et al., 1998], Omniglot [Lake et al., 2015] and CIFAR-10 [Szegedy et al., 2015].
Dataset Splits	No	The paper describes a sampling method for creating semi-supervised learning instances (e.g., "sampling 100 random examples and further sampling L random examples from the subset for labeling") but does not provide traditional training/test/validation dataset splits (e.g., percentages or fixed counts for global dataset partitions).
Hardware Specification	No	The paper does not explicitly mention the specific hardware (e.g., GPU/CPU models, memory) used for running the experiments in the main text. The information in the checklist is meta-information and not part of the paper's content.
Software Dependencies	No	The paper does not provide specific software names with version numbers for dependencies used in the experiments.
Experiment Setup	Yes	Setup: ... n = 100 ... L = 10 for MNIST, while L = 20 for Omniglot and CIFAR-10. ... determine data-speciﬁc good values for σ, when predictions are made by optimizing the harmonic objective (Table 1). ... σ [0, 10]. ... average over 50 iterations for learning from 50 problem instances each (T = 50, Figure 3).