reproducibilityindex.ai

Same-Cluster Querying for Overlapping Clusters

Authors: Wasim Huleihel, Arya Mazumdar, Muriel Medard, Soumyabrata Pal

NeurIPS 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Finally, we test our algorithms over both synthetic and real-world data, showing their practicality and effectiveness.
Researcher Affiliation	Academia	Wasim Huleihel Department of Electrical Engineering Tel-Aviv University Tel-Aviv, Israel 6997801 wasimh@mail.tau.ac.il Arya Mazumdar College of Information & Computer Sciences University of Massachusetts Amherst Amherst, MA 01003 arya@cs.umass.edu Muriel Médard Electrical Engineering & Computer Science Massachusetts Institute of Technology Cambridge, MA 02139 medard@mit.edu Soumyabrata Pal College of Information & Computer Sciences University of Massachusetts Amherst Amherst, MA 01003 soumyabratap@umass.edu
Pseudocode	Yes	Algorithm 1 Findmembership; Algorithm 2 Find Similarity; Algorithm 3 Worst-case quantized responses
Open Source Code	No	The paper does not provide any statement or link indicating that the source code for the described methodology is publicly available.
Open Datasets	Yes	We focus on real-world data from the popular Movielens dataset (https://grouplens.org/datasets/movielens/) for our experiments.
Dataset Splits	No	The paper describes experiments involving random sampling of elements (set S) and conducting queries based on these samples to recover clusters. It does not define traditional train/validation/test splits of the dataset for model training or hyperparameter tuning. The experimental setup involves iterating trials for different sample sizes, but this is distinct from a validation set split.
Hardware Specification	No	The paper does not specify any details about the hardware (e.g., CPU, GPU, memory, cloud instances) used for running the experiments.
Software Dependencies	No	The paper discusses concepts related to software (e.g., 'low-rank matrix completion'), and mentions an example of a specific solver ('CPLEX 12.4') in the context of related work. However, it does not list any specific software dependencies or their version numbers that were used in the paper's own experiments.
Experiment Setup	Yes	We ran Algorithm 3 (...) with different values of \|S\| (number of movies randomly chosen in the ﬁrst step) and since the movies are sampled randomly, we ran 50 trials for each value of \|S\|.; Instead of sampling a ﬁxed number of movies a-priori, we randomly select k = 5 movies and make all pairwise queries restricted to those 5 movies. Then, we check if the 5 × 5 gram matrix (...) is of rank 5 and if yes, then we will use that matrix for further calculations. If not, we sample again until we succeed.