reproducibilityindex.ai

Convergent Representations of Computer Programs in Human and Artificial Neural Networks

Authors: Shashank Srikant, Ben Lipkin, Anna Ivanova, Evelina Fedorenko, Una-May O'Reilly

NeurIPS 2022 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We investigate this question by analyzing brain recordings derived from functional magnetic resonance imaging (fMRI) studies of programmers comprehending Python code. We first evaluate a selection of static and dynamic code properties, such as abstract syntax tree (AST)-related and runtime-related metrics and study how they relate to neural brain signals. Then, to learn whether brain representations encode fine-grained information about computer programs, we train a probe to align brain recordings with representations learned by a suite of ML models trained on code.
Researcher Affiliation	Collaboration	Shashank Srikant 1,4 Benjamin Lipkin 2 Anna A. Ivanova1,2,3 Evelina Fedorenko2,3 Una-May O Reilly1,4 * Equal contribution 1CSAIL, MIT 2 BCS, MIT 3 Mc Govern Institute for Brain Research MIT-IBM Watson AI Lab {shash, lipkinb, annaiv, evelina9}@mit.edu, unamay@csail.mit.edu
Pseudocode	No	The paper does not contain any sections or figures explicitly labeled 'Pseudocode' or 'Algorithm', nor does it present structured steps formatted like code.
Open Source Code	Yes	We make all the corresponding code, data, and analysis publicly available at https://github.com/ALFA-group/code-representations-ml-brain
Open Datasets	Yes	We utilize the publicly available dataset from Ivanova et al. [2020] for all our analyses as it offers high quality, granular brain response data on code comprehension stimuli controlled across multiple code properties. We use the publicly available brain recordings released as part of the study by Ivanova et al. [2020] (MIT license).
Dataset Splits	Yes	For each participant, we train a linear regression/classification model with L2-regularization for on unique cross-validated leave-one-run-out folds of the 72 programs they attempted (or 48 programs when sentences are removed).
Hardware Specification	Yes	All experiments in this work were run on a single 8-core laptop in under an hour following setup.
Software Dependencies	No	The paper mentions various models and general programming languages (Python), but does not provide specific version numbers for software dependencies or libraries used to run the experiments.
Experiment Setup	Yes	See Appendix B for a detailed description of model hyper-parameters, cross-validation settings, and significance testing.