reproducibilityindex.ai

The Power of Uniform Sampling for k-Median

Authors: Lingxiao Huang, Shaofeng H.-C. Jiang, Jianing Lou

ICML 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments Our experiments focus on validating the performance of running an algorithm for k-MEDIAN (instead of (k, β)-MEDIAN) on top of the uniform S, since it is arguably the most natural approach and is likely to be used in practice. Our experiments were conducted on various real datasets of different types of metric spaces including Euclidean Rd and shortest-path metrics. We find that the solution returned by the k-MEDIAN algorithm is fairly balanced, which effectively enforces the balancedness constraint of (k, β)-MEDIAN.
Researcher Affiliation	Academia	1State Key Laboratory of Novel Software Technology, Nanjing University, Jiangsu, China 2Center on Frontiers of Computing Studies, Peking University, Beijing, China.
Pseudocode	No	The paper describes algorithms and methods but does not include any clearly labeled pseudocode blocks or algorithm listings.
Open Source Code	No	The paper does not provide any links to open-source code for the methodology described.
Open Datasets	Yes	Our experiments are conducted on 3 real datasets: Twitter (Chan et al., 2018), Census1990 (Meek et al., 1990) and NY (Open Street Map contributors, 2017).
Dataset Splits	No	The paper mentions taking 'm uniform samples S from the dataset with varying m' but does not specify training, validation, or test dataset splits or how data was partitioned for cross-validation.
Hardware Specification	Yes	All experiments are conducted on a PC with Intel Core i7 CPU and 16 GB memory
Software Dependencies	No	The paper states 'algorithms are implemented in C++ 11'. While 'C++ 11' specifies a language standard version, it does not include version numbers for any specific libraries, frameworks, or compilers, which are typically needed for full reproducibility.
Experiment Setup	No	The paper mentions running a 'local search algorithm (Arya et al., 2001)' but does not specify any hyperparameters or system-level training settings for this algorithm.