Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Removing systematic errors for exoplanet search via latent causes

Authors: Bernhard Schölkopf, David Hogg, Dun Wang, Dan Foreman-Mackey, Dominik Janzing, Carl-Johann Simon-Gabriel, Jonas Peters

ICML 2015 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We analyze two simulated data sets that illustrate the identiﬁability statements from Sections 2.1 and 2.2. ... In Fig. 6, we present corrected light curves for three typical stars of different magnitudes, using both CPM and PDC. ... Fig. 7 presents our CDPP comparison of CPM and PDC, showing that our method outperforms PDC.
Researcher Affiliation	Academia	Bernhard Sch olkopf EMAIL Max Planck Institute for Intelligent Systems, 72076 T ubingen, GERMANY David W. Hogg EMAIL Dun Wang EMAIL Daniel Foreman-Mackey EMAIL Center for Cosmology and Particle Physics, New York University, New York, NY 10003, USA Dominik Janzing EMAIL Carl-Johann Simon-Gabriel EMAIL Jonas Peters EMAIL Max Planck Institute for Intelligent Systems, 72076 T ubingen, GERMANY
Pseudocode	No	The paper does not contain structured pseudocode or algorithm blocks.
Open Source Code	Yes	Our code is available at https://github.com/jvc2688/Kepler Pixel Model.
Open Datasets	Yes	We obtained the data from the Mikulski Archive for Space Telescopes (MAST) (see http://archive.stsci. edu/index.html).
Dataset Splits	Yes	We train the model separately for each month, which contains about 1300 data points. Standard L2 regularization is employed to avoid overﬁtting, and parameters (regularization strength and number of input pixels) were optimized using cross-validation.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., exact GPU/CPU models, processor types) used for running its experiments.
Software Dependencies	No	The paper mentions using 'the function gam (penalized regression splines) from the R-package mgcv' but does not provide specific version numbers for R or the mgcv package.
Experiment Setup	Yes	Speciﬁcally, we use 4000 predictor pixels from about 150 stars... Standard L2 regularization is employed to avoid overﬁtting, and parameters (regularization strength and number of input pixels) were optimized using cross-validation. We train the model separately for each month, which contains about 1300 data points... we report results where the AR component uses as inputs the three closest future and the three closest past time points, subject to the constraint that a window of 9 hours around the considered time point is excluded.