reproducibilityindex.ai

Cakewalk Sampling

Authors: Uri Patish, Shimon Ullman2400-2407

AAAI 2020 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	As a ﬁrst use case, we empirically study the efﬁciency in which sampling methods can recover locally maximal cliques in undirected graphs. Not only do we show how our adaptive sampler outperforms related methods, we also show how it can even approach the performance of established clique algorithms. As a second use case, we consider how greedy algorithms can be combined with our adaptive sampler, and we demonstrate how this leads to superior performance in k-medoid clustering.
Researcher Affiliation	Academia	Uri Patish, Shimon Ullman Weizmann Institute of Science, Rehovot, Israel
Pseudocode	Yes	Algorithm 1 Cakewalk
Open Source Code	No	The paper does not provide an explicit statement or link for open-source code availability for the described methodology.
Open Datasets	Yes	As a benchmark for the clique problem, we used 80 undirected graphs that were published as part of the second DIMACS challenge (Johnson and Trick 1996). Using a publicly available collection (White 2017), we gathered 38 datasets that had between 500 and 1000 data points.
Dataset Splits	No	The paper mentions using datasets for experiments but does not specify train/validation/test splits, percentages, or absolute counts for reproducibility. It implies a sampling process rather relevant to the optimization problem rather than explicit data splits for training/validation.
Hardware Specification	No	The paper does not specify any hardware details such as CPU, GPU models, or memory used for running the experiments.
Software Dependencies	No	The paper mentions the use of optimizers like Adam and Ada Grad, but does not provide specific version numbers for any software dependencies, libraries, or programming languages used in the implementation.
Experiment Setup	Yes	We have executed a method for 100 \|V \| samples (hence runtime is ﬁxed per graph), and recorded the following items at the execution s end. We tested each method on all 80 graphs, letting it maximize the softclique-size function using various values of κ. Since a-priori we do not know which κ will lead some method towards an inclusion maximal clique, we have executed each method with each of the values 0.0, 0.1, . . . , 1.0 as κ. For estimating ˆμ, ˆσ and ˆF, we have used the last 100 objective values.