Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
Connecting Spectral Clustering to Maximum Margins and Level Sets
Authors: David P. Hofmeyr
JMLR 2020 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Theoretical | The main contributions of this work are in theoretically connecting spectral clustering with the problems of maximum margin clustering and of level set estimation. As we discussed in the previous section, there is an intuitive connection between these problems when maximum margin clustering is applied to a truncated sample, in which observations with low empirical density are removed. We state the results at this stage in a simplified form which captures the main points of the results. Technical details regarding assumptions, etc., are deferred to the relevant sections. We begin by introducing notation and terminology which will be useful here, and in the remaining paper. In this section we present complete derivations of the theoretical contributions of this paper. We first derive bounds on the eigenvectors and eigenvalues of graph Laplacian matrices, in terms of the within cluster connectedness and between cluster separation. |
| Researcher Affiliation | Academia | David P. Hofmeyr EMAIL Department of Statistics and Actuarial Science Stellenbosch University Stellenbosch, South Africa |
| Pseudocode | No | The paper describes algorithms and methods using mathematical formulations, theorems, and proofs, but does not include any explicitly labeled 'Pseudocode' or 'Algorithm' blocks with structured, code-like steps. |
| Open Source Code | No | The paper is theoretical and focuses on mathematical derivations and proofs. It does not contain any statements or links indicating that source code for the described methodology is publicly available or released. |
| Open Datasets | No | The paper focuses on theoretical aspects of spectral clustering, maximum margin clustering, and level set estimation, often discussing abstract 'samples' or 'sequences of random variables'. It does not refer to specific, named, or publicly accessible datasets for empirical evaluation. |
| Dataset Splits | No | The paper is theoretical and does not describe experiments performed on datasets, thus no information regarding training, testing, or validation dataset splits is provided. |
| Hardware Specification | No | The paper is theoretical and focuses on mathematical derivations and proofs. It does not describe any experimental setup or mention specific hardware used for running experiments. |
| Software Dependencies | No | The paper is theoretical and focuses on mathematical derivations and proofs. It does not mention any software dependencies with specific version numbers for replicating experimental results. |
| Experiment Setup | No | The paper is theoretical and primarily deals with mathematical proofs and connections between different clustering methods. It does not contain details about experimental setup, hyperparameters, or training configurations. |