reproducibilityindex.ai

Strategic Instrumental Variable Regression: Recovering Causal Relationships From Strategic Responses

Authors: Keegan Harris, Dung Daniel T Ngo, Logan Stapleton, Hoda Heidari, Steven Wu

ICML 2022 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We empirically evaluate our model on a semi-synthetic dataset inspired by our running university admissions example. We compare our 2SLS-based method against ordinary least squares (OLS), which directly regresses observed outcomes y on observable features x.
Researcher Affiliation	Academia	1School of Computer Science, Carnegie Mellon University, Pittsburgh, USA 2Computer Science Department, University of Minnesota, Minneapolis, USA.
Pseudocode	No	The paper describes the steps for Two-Stage Least Squares regression (2SLS) in Section 2, including derivations in Appendix A.2, but it does not provide any explicitly labeled pseudocode blocks or algorithm figures.
Open Source Code	No	The paper provides a link to a publicly available dataset ('https://www.openintro.org/data/ index.php?data=satgpa') but does not include any explicit statement or link for the source code of the methodology described in the paper.
Open Datasets	Yes	We constructed a semi-synthetic dataset based on the SATGPA dataset, a collection of real university admissions data. ... Originally collected by the Educational Testing Service, the SATGPA dataset is publicly available and can be found here: https://www.openintro.org/data/ index.php?data=satgpa.
Dataset Splits	No	The paper describes the construction and characteristics of its semi-synthetic dataset, including how data points are generated, but it does not provide explicit details about how the dataset is split into training, validation, or test sets (e.g., percentages, sample counts, or cross-validation setup).
Hardware Specification	Yes	We ran our experiments on a 2020 Mac Book Air laptop with 16GB of RAM.
Software Dependencies	No	The paper describes its methodology and experimental setup but does not list any specific software dependencies (e.g., programming languages, libraries, or frameworks) along with their version numbers.
Experiment Setup	Yes	For simplicity, we let the true effect θ = [0, 0.5] . ... For any applicant t, we randomly deploy assessment rule θt = [θSAT t , θHS GPA t ] where θSAT t N(1, 10) and θHS GPA t N(1, 2). ... For each applicant t, we perturb E[Et] with random noise drawn from N(0.5, 0.25) to the top left entry and noise drawn from N(0.1, 0.01) the bottom right entry to produce Et. ... We ran each method for 1000 time-steps with a decaying learning rate of 0.001 / T.