reproducibilityindex.ai

Outlier Detection and Robust PCA Using a Convex Measure of Innovation

Authors: Mostafa Rahmani, Ping Li

NeurIPS 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	This paper presents a provable and strong algorithm, termed Innovation Search (i Search), to robust Principal Component Analysis (PCA) and outlier detection. [...] The theoretical and numerical results showed that ﬁnding the optimal directions makes i Search signiﬁcantly robust to the outliers which carry weak innovation. Moreover, the experiments with real and synthetic data demonstrate the robustness of the proposed method against the strong presence of noise.
Researcher Affiliation	Industry	Mostafa Rahmani and Ping Li Cognitive Computing Lab Baidu Research 10900 NE 8th St. Bellevue, WA 98004, USA {mostafarahmani,liping11}@baidu.com
Pseudocode	Yes	Algorithm 1 Subspace Recovery Using i Search
Open Source Code	No	The paper does not provide any links to open-source code for the methodology, nor does it explicitly state that the code will be released or is available.
Open Datasets	Yes	We use the Hopkins155 dataset [33], which contains data matrices with 2 or 3 clusters. [...] We use the Waving Tree video ﬁle [21].
Dataset Splits	No	The paper mentions synthetic data generation parameters and how a 'trial is considered successful', but it does not specify explicit dataset splits (e.g., train/validation/test percentages or counts) or cross-validation methods for reproducibility.
Hardware Specification	No	The paper does not provide any specific details about the hardware (e.g., CPU, GPU models, memory) used to conduct the experiments.
Software Dependencies	No	The paper states 'we use an ADMM solver to solve (1)' but does not provide specific version numbers for this solver or any other software dependencies, which are necessary for reproducibility.
Experiment Setup	Yes	In this experiment, M1 = 100, r = 5, and ni = 100. The data contains 300 unstructured and 10 structured outliers. The distribution of the structured outliers follow Assumption 2 with η = 0.1. [...] In addition, we identify column d as outlier if d ˆU ˆUd 2/ d 2 0.2 where ˆU is the recovered subspace.