reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Semi-Supervised Interpolation in an Anticausal Learning Scenario

Authors: Dominik Janzing, Bernhard Schölkopf

JMLR 2015 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	We prove that unlabelled data help for the problem of interpolating a monotonically increasing function if and only if the orthogonality conditions are violated which we only expect for the anticausal direction. Here, performance of SSL and its supervised baseline analogue is measured in terms of two diﬀerent loss functions: ﬁrst, the mean squared error and second the surprise in a Bayesian prediction scenario. ... The main contribution of this paper is to describe the relation between the performance of SSL to a mathematically well-deﬁned notion of dependence between PX and PY \|X. ... We prove that unlabelled data help for the problem of interpolating a monotonically increasing function...
Researcher Affiliation	Academia	Dominik Janzing EMAIL Bernhard Sch olkopf EMAIL Max Planck Institute for Intelligent Systems Spemannstr. 38 72076 T ubingen, Germany
Pseudocode	No	The paper primarily presents mathematical definitions, lemmas, theorems, and proofs. There are no explicitly labeled pseudocode blocks or algorithm sections.
Open Source Code	No	The paper does not contain any statements about code release, links to source code repositories, or mentions of code in supplementary materials.
Open Datasets	No	The paper introduces a theoretical 'simple interpolation problem: Let X and Y be random variables attaining values in [0, 1], deterministically related by Y = f(X)'. It does not use or refer to any publicly available external datasets.
Dataset Splits	No	The paper is theoretical and does not conduct experiments on datasets, thus no dataset split information (training/test/validation) is provided.
Hardware Specification	No	The paper describes theoretical concepts and mathematical proofs, not empirical experiments that would require specific hardware. Therefore, no hardware specifications are mentioned.
Software Dependencies	No	The paper is theoretical and focuses on mathematical models and proofs. It does not describe any implementation or experimental setup that would require specific software libraries or tools with version numbers.
Experiment Setup	No	The paper is theoretical and does not describe an experimental setup, hyperparameters, training configurations, or other system-level settings for reproducing experiments.