Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
Incremental Clustering: The Case for Extra Clusters
Authors: Margareta Ackerman, Sanjoy Dasgupta
NeurIPS 2014 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Theoretical | In this paper, we initiate the formal analysis of incremental clustering methods focusing on the types of cluster structure that they are able to detect. We ο¬nd that the incremental setting is strictly weaker than the batch model, proving that a fundamental class of cluster structures that can readily be detected in the batch setting is impossible to identify using any incremental method. Furthermore, we show how the limitations of incremental clustering can be overcome by allowing additional clusters. |
| Researcher Affiliation | Academia | Margareta Ackerman Florida State University 600 W College Ave, Tallahassee, FL 32306 EMAIL Sanjoy Dasgupta UC San Diego 9500 Gilman Dr, La Jolla, CA 92093 EMAIL |
| Pseudocode | Yes | Algorithm 2.2. Sequential k-means. ... Algorithm 2.3. Sequential agglomerative clustering. ... Algorithm 2.4. Sequential nearest-neighbour clustering. ... Algorithm 5.2. Incremental clustering with extra centers. ... Algorithm 5.9. Algorithm subsample. |
| Open Source Code | No | The paper does not provide any links or explicit statements about making its source code publicly available. |
| Open Datasets | No | The paper is theoretical and does not conduct experiments on specific datasets requiring public access information for training data. |
| Dataset Splits | No | The paper is theoretical and does not conduct experiments with dataset splits for validation. |
| Hardware Specification | No | The paper is theoretical and does not describe any specific hardware used for experiments. |
| Software Dependencies | No | The paper is theoretical and focuses on algorithms and proofs; it does not specify any software dependencies with version numbers for implementation or replication. |
| Experiment Setup | No | The paper is theoretical and does not describe an experimental setup with hyperparameters or system-level training settings. |