Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Evolutionary Prediction Games

Authors: Eden Saig, Nir Rosenfeld

NeurIPS 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We conclude by complementing our analysis with experiments using both synthetic and real data and coupled with simulated dynamics. Our empirical results shed light on when and how certain user groups are likely to dominate, disappear, or coexist, and demonstrate how different design choices can shape social outcomes even if inadvertently.
Researcher Affiliation	Academia	Eden Saig Technion Israel Institute of Technology Haifa, Israel EMAIL Nir Rosenfeld Technion Israel Institute of Technology Haifa, Israel EMAIL
Pseudocode	No	The paper does not contain explicit pseudocode or algorithm blocks. Methods are described using mathematical formulations and textual explanations.
Open Source Code	Yes	Code is available at: https://github.com/edensaig/evolutionary-prediction-games.
Open Datasets	Yes	Method. We use the CIFAR-10 image recognition dataset [36]... Method. We use the MNIST dataset...
Dataset Splits	Yes	The training and test sets are split independently, and for each p we measure the prediction model s accuracy on both splits of the test sets (representing acc A(hp) and acc B(hp)).
Hardware Specification	Yes	Synthetic data simulations were run on a single Macbook Pro laptop, with 16GB of RAM, M2 processor, and no GPU. Experiments involving neural networks (Section 6) were run on a dedicated server with an AMD EPYC 7502 CPU, 503GB of RAM, and an Nvidia RTX A4000 GPU.
Software Dependencies	No	We implement our simulations and analysis in Python. Our synthetic-data experiments rely on scikit-learn [47] for learning algorithm implementations, our CIFAR-10 and MNIST experiments rely on Py Torch [46] and ffcv [38], and our ACSIncome experiment relies on scikit-learn and XGBoost [10]. We use matplotlib [29] for plotting, and mpltern [30] for ternary plots. (No specific version numbers are provided for these software components, only citations to the libraries/papers).
Experiment Setup	Yes	Training is performed for 200 epochs using SGD with learning rate 0.01 and momentum 0.5.