Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
Journal of Artificial Intelligence Research (JAIR)
The Percentage of Empirical Papers Documenting Each Reproducibility Variable
| Venue |
Reproducibility Score based on Gundersen et al. (2025)
|
Global mean is the average score over the seven reproducibility variables for empirical research papers.
|
Percentage of papers that are empirical research vs theoretical research
|
Percentage of empirical research papers with at least one author from Industry
|
Website | ||
|---|---|---|---|---|---|---|---|
| JAIR | 2025 | 132 | 0.51 | 3.89 | 72.73% | 14.58% | |
| JAIR | 2024 | 105 | 0.63 | 4.19 | 68.57% | 16.67% | |
| JAIR | 2023 | 100 | 0.51 | 4.11 | 61.0% | 19.67% | |
| JAIR | 2022 | 118 | 0.5 | 4.13 | 72.03% | 23.53% | |
| JAIR | 2021 | 104 | 0.53 | 3.79 | 63.46% | 25.76% | |
| JAIR | 2020 | 84 | 0.48 | 3.83 | 71.43% | 16.67% | |
| JAIR | 2019 | 75 | 0.41 | 3.71 | 65.33% | 22.45% | |
| JAIR | 2018 | 69 | 0.37 | 3.61 | 63.77% | 27.27% | |
| JAIR | 2017 | 68 | 0.41 | 3.96 | 66.18% | 11.11% | |
| JAIR | 2016 | 65 | 0.44 | 3.94 | 80.0% | 19.23% | |
| JAIR | 2015 | 48 | 0.39 | 4.18 | 68.75% | 21.21% | |
| JAIR | 2014 | 68 | 0.35 | 3.77 | 69.12% | 4.26% |