Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Disentangling Influence: Using disentangled representations to audit model predictions

Authors: Charles Marx, Richard Phillips, Sorelle Friedler, Carlos Scheidegger, Suresh Venkatasubramanian

NeurIPS 2019 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We show through both theory and experiments that disentangled inﬂuence audits can both detect proxy features and show, for each individual or in aggregate, which of these proxy features affects the classiﬁer being audited the most.
Researcher Affiliation	Academia	Charles T. Marx Haverford College EMAIL Richard Lanas Phillips Cornell University EMAIL Sorelle A. Friedler Haverford College EMAIL Carlos Scheidegger University of Arizona EMAIL Suresh Venkatasubramanian University of Utah EMAIL
Pseudocode	Yes	DISENTANGLED-INFLUENCE-AUDIT(X, M) 1 for p in FEATURES(X) 2 (f , g, h) = DISENTANGLED-REPRESENTATION(X, p) // (h is not used) 3 M = g M 4 X = {f(x) for x in X} 5 SHAPp = DIRECT-INFLUENCE(X , p, M ) 6 return {SHAPp for p in FEATURES(X)}
Open Source Code	Yes	All data and code4 for the described method and below experiments is available in the Supplementary Materials. 4Code is available at: https://github.com/charliemarx/disentangling-influence
Open Datasets	Yes	It includes 5,000 instances of two variables x and y drawn independently from a uniform distribution over [0, 1]... The second synthetic dataset is the d Sprites dataset commonly used in the disentangled representations literature to disentangle independent factors that are sources of variation [23]... Finally, we will consider a real-world dataset containing Adult Income data that is commonly used as a test case in the fairness-aware machine learning community... [27].
Dataset Splits	No	The paper does not provide specific details on validation dataset splits, such as percentages or sample counts.
Hardware Specification	Yes	The Titan Xp used for this research was donated by the NVIDIA Corporation.
Software Dependencies	No	The paper mentions software like 'shap' and 'Black Box Auditing' (available via pip install) but does not provide specific version numbers for these or any other software dependencies.
Experiment Setup	No	Preprocessing, model, and disentangled representation training information are included in the Supplementary Material.