reproducibilityindex.ai

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel $k$-means Clustering

Authors: Taisuke Yasuda, David Woodruff, Manuel Fernandez

ICML 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	In this work, we present tight lower bounds on the number of kernel evaluations required to approximately solve kernel ridge regression (KRR) and kernel k-means clustering (KKMC) on n input points. For KRR, our bound for relative error approximation to the minimizer of the objective function is Ω(ndλ eﬀ/ε) where dλ eﬀ is the effective statistical dimension, which is tight up to a log(dλ eﬀ/ε) factor. For KKMC, our bound for ﬁnding a k-clustering achieving a relative error approximation of the objective function is Ω(nk/ε), which is tight up to a log(k/ε) factor.
Researcher Affiliation	Academia	1Computer Science Department, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA 2Department of Mathematical Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA.
Pseudocode	No	The paper describes algorithms and their properties (e.g., in Section 5.1 'Proof overview'), but it does not include any formally labeled 'Pseudocode' or 'Algorithm' blocks with structured steps.
Open Source Code	No	The paper does not contain any explicit statements or links indicating that open-source code for the described methodology is available.
Open Datasets	No	This paper is theoretical, focusing on lower bounds and algorithm design. It defines 'hard input distribution' instances for its theoretical proofs (e.g., 'µKRR(n, J)', 'µKKMC(n, k, ε)') rather than using conventional public datasets for empirical training or evaluation. Therefore, it does not provide access information for a public dataset.
Dataset Splits	No	The paper is theoretical and does not conduct empirical experiments with data splits. Therefore, it does not provide details on validation dataset splits.
Hardware Specification	No	The paper is theoretical and does not conduct empirical experiments that would require specific hardware. No hardware specifications are mentioned.
Software Dependencies	No	The paper is theoretical and focuses on mathematical proofs and algorithm design. It does not mention any specific software dependencies with version numbers.
Experiment Setup	No	The paper is theoretical and does not conduct empirical experiments in the traditional sense. Therefore, it does not provide details about an experimental setup with hyperparameters or training settings.