Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

How Far Are We From AGI: Are LLMs All We Need?

Authors: Tao Feng, Chuanyang Jin, Jingyu Liu, Kunlun Zhu, Haoqin Tu, Zirui Cheng, Guanyu Lin, Jiaxuan You

TMLR 2024 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	Unlike previous survey papers, this work goes beyond summarizing LLMs by addressing key questions about our progress toward AGI and outlining the strategies essential for its realization through comprehensive analysis, in-depth discussions, and novel insights.
Researcher Affiliation	Academia	1University of Illinois Urbana-Champaign 2Johns Hopkins University 3University of Chicago 4University of California, Santa Cruz 5Carnegie Mellon University
Pseudocode	No	The paper is a comprehensive survey and does not present any novel algorithms or procedures in pseudocode blocks.
Open Source Code	Yes	1Project website: https://github.com/ulab-uiuc/AGI-survey. Unlike traditional publications that remain static, we embrace an innovative approach by treating this paper as a living document. We warmly welcome feedback from the community and plan to update the paper annually. Contributors on the project website will be gratefully acknowledged in future revisions.
Open Datasets	No	The paper is a survey and does not present new experimental results based on a specific dataset. It discusses many existing datasets in the context of other research, but no dataset is used for experiments conducted by the authors of this paper.
Dataset Splits	No	The paper is a survey and does not conduct experiments with specific datasets, therefore it does not provide dataset splits.
Hardware Specification	No	The paper is a survey and does not conduct experiments, therefore it does not specify hardware used for running experiments.
Software Dependencies	No	The paper is a survey and does not conduct experiments, therefore it does not specify software dependencies with version numbers.
Experiment Setup	No	The paper is a survey and does not conduct original experiments, therefore it does not provide an experimental setup with hyperparameter values or training configurations.