Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Rectified CFG++ for Flow Based Models

Authors: Shreshth Saini, Shashank Gupta, Alan Bovik

NeurIPS 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Extensive experiments on large-scale text-to-image models (Flux, Stable Diffusion 3/3.5, Lumina) show that Rectiﬁed-CFG++ consistently outperforms standard CFG on benchmark datasets such as MS-COCO, LAION-Aesthetic, and T2I-Comp Bench. ... In this section, we present a comprehensive empirical evaluation of Rectiﬁed-CFG++ for text-to-image (T2I) generation using large-scale models. Our experiments aim to rigorously demonstrate the effectiveness of our approach at improving text image alignment, color ﬁdelity, and the preservation of ﬁne details, generating high-quality samples while expending comparable inference costs as competing baseline methods. ... Table 1: Comprehensive Quantitative Evaluation of CFG against Rectiﬁed-CFG++ when both are integrated into leading T2I Models on MS-COCO 10K validation samples.
Researcher Affiliation	Academia	Shreshth Saini Shashank Gupta Alan C. Bovik The University of Texas at Austin EMAIL
Pseudocode	Yes	Algorithm 1 Rectiﬁed-CFG++
Open Source Code	Yes	Project page: https://rectified-cfgpp.github.io/. ... Reproducibility: Apart from the pseudocode and implementation details provided in the paper, the source code is available on the project page: https://rectiﬁed-cfgpp.github.io/. The complete source code will be released upon acceptance.
Open Datasets	Yes	Extensive experiments on large-scale text-to-image models (Flux, Stable Diffusion 3/3.5, Lumina) show that Rectiﬁed-CFG++ consistently outperforms standard CFG on benchmark datasets such as MS-COCO, LAION-Aesthetic, and T2I-Comp Bench. ... Speciﬁcally, we used subsets of the MS-COCO dataset [20, 5] ... We also used a subset of 1,000 image-text pairs from LAION-Aesthetic [35] ... and the 1,000 prompts from Pick-A-Pic [18].
Dataset Splits	Yes	Speciﬁcally, we used subsets of the MS-COCO dataset [20, 5], comprising 10,000 and 1,000 image-text pairs (referred to as MS-COCO 10K and MS-COCO 1K, respectively). We also used a subset of 1,000 image-text pairs from LAION-Aesthetic [35] (LAION-Aesthetic 1K) and the 1,000 prompts from Pick-A-Pic [18].
Hardware Specification	Yes	All experiments were performed using a single NVIDIA A100 40GB GPU.
Software Dependencies	Yes	Code was written in Python 3.10, using Py Torch 2.0.1 and the latest Hugging Face Diffusers library.
Experiment Setup	Yes	When using our proposed method, Rectiﬁed-CFG++, we determined a set of effective hyperparameters which were kept consistent across all datasets and when integrated into baseline models. For all the compared methods, we utilized the default settings and conﬁgurations as reported in their original publications to ensure fair comparisons. Further detailed information regarding the experimental setup and hyperparameter settings can be found in Appendix D.1.