reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Connecting Spectral Clustering to Maximum Margins and Level Sets

Authors: David P. Hofmeyr

JMLR 2020 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	The main contributions of this work are in theoretically connecting spectral clustering with the problems of maximum margin clustering and of level set estimation. As we discussed in the previous section, there is an intuitive connection between these problems when maximum margin clustering is applied to a truncated sample, in which observations with low empirical density are removed. We state the results at this stage in a simpliﬁed form which captures the main points of the results. Technical details regarding assumptions, etc., are deferred to the relevant sections. We begin by introducing notation and terminology which will be useful here, and in the remaining paper. In this section we present complete derivations of the theoretical contributions of this paper. We ﬁrst derive bounds on the eigenvectors and eigenvalues of graph Laplacian matrices, in terms of the within cluster connectedness and between cluster separation.
Researcher Affiliation	Academia	David P. Hofmeyr EMAIL Department of Statistics and Actuarial Science Stellenbosch University Stellenbosch, South Africa
Pseudocode	No	The paper describes algorithms and methods using mathematical formulations, theorems, and proofs, but does not include any explicitly labeled 'Pseudocode' or 'Algorithm' blocks with structured, code-like steps.
Open Source Code	No	The paper is theoretical and focuses on mathematical derivations and proofs. It does not contain any statements or links indicating that source code for the described methodology is publicly available or released.
Open Datasets	No	The paper focuses on theoretical aspects of spectral clustering, maximum margin clustering, and level set estimation, often discussing abstract 'samples' or 'sequences of random variables'. It does not refer to specific, named, or publicly accessible datasets for empirical evaluation.
Dataset Splits	No	The paper is theoretical and does not describe experiments performed on datasets, thus no information regarding training, testing, or validation dataset splits is provided.
Hardware Specification	No	The paper is theoretical and focuses on mathematical derivations and proofs. It does not describe any experimental setup or mention specific hardware used for running experiments.
Software Dependencies	No	The paper is theoretical and focuses on mathematical derivations and proofs. It does not mention any software dependencies with specific version numbers for replicating experimental results.
Experiment Setup	No	The paper is theoretical and primarily deals with mathematical proofs and connections between different clustering methods. It does not contain details about experimental setup, hyperparameters, or training configurations.