Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
Transactions on Machine Learning Research (TMLR)
The Percentage of Empirical Papers Documenting Each Reproducibility Variable
| Venue |
Reproducibility Score based on Gundersen et al. (2025)
|
Global mean is the average score over the seven reproducibility variables for empirical research papers.
|
Percentage of papers that are empirical research vs theoretical research
|
Percentage of empirical research papers with at least one author from Industry
|
Website | ||
|---|---|---|---|---|---|---|---|
| TMLR | 2025 | 1418 | 0.62 | 4.51 | 94.71% | 33.8% | |
| TMLR | 2024 | 947 | 0.61 | 4.36 | 94.51% | 34.64% | |
| TMLR | 2023 | 608 | 0.59 | 4.29 | 94.41% | 42.51% | |
| TMLR | 2022 | 216 | 0.61 | 4.23 | 96.3% | 48.08% |