Outlier Robust Mean Estimation with Subgaussian Rates via Stability
Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia
NeurIPS 2020 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Theoretical | We study the problem of outlier robust high-dimensional mean estimation under a finite covariance assumption, and more broadly under finite low-degree moment assumptions. We consider a standard stability condition from the recent robust statistics literature and prove that, except with exponentially small failure probability, there exists a large fraction of the inliers satisfying this condition. As a corollary, it follows that a number of recently developed algorithms for robust mean estimation, including iterative filtering and non-convex gradient descent, give optimal error estimators with (near-)subgaussian rates. |
| Researcher Affiliation | Academia | Ilias Diakonikolas University of Wisconsin-Madison ilias@cs.wisc.edu Daniel M. Kane University of California, San Diego dakane@cs.ucsd.edu Ankit Pensia University of Wisconsin-Madison ankitp@cs.wisc.edu |
| Pseudocode | No | The paper describes steps for algorithms in prose, such as the pre-processing step using the median-of-means principle. However, it does not include explicitly labeled 'Pseudocode' or 'Algorithm' blocks with structured code-like formatting. |
| Open Source Code | No | The paper does not contain any statement about making its source code publicly available or provide a link to a code repository. |
| Open Datasets | No | The paper is theoretical and discusses properties of distributions and samples, but does not refer to specific, named datasets (e.g., MNIST, CIFAR-10) or provide information about public access to any training data used for experiments. |
| Dataset Splits | No | The paper is theoretical and does not describe empirical experiments with data splits, thus no validation split information is provided. |
| Hardware Specification | No | The paper is theoretical and does not describe any computational experiments that would require specifying hardware used. |
| Software Dependencies | No | The paper is theoretical and does not describe any computational experiments that would require specifying software dependencies with version numbers. |
| Experiment Setup | No | The paper is theoretical and does not describe empirical experiments with specific hyperparameters or training configurations. |