Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..
Interpreting and Improving Diffusion Models from an Optimization Perspective
Authors: Frank Permenter, Chenyang Yuan
ICML 2024 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | 6. Experiments |
| Researcher Affiliation | Industry | 1Toyota Research Institute, Cambridge, Massachusetts, USA. Correspondence to: Chenyang Yuan <EMAIL>, Frank Permenter <EMAIL>. |
| Pseudocode | Yes | Algorithm 1 DDIM sampler (Song et al., 2020a) Require: (σN, . . . , σ0), x N N(0, I), ϵθ Ensure: Compute x0 with N evaluations of ϵθ for t = N, . . . , 1 do xt 1 xt + (σt 1 σt)ϵθ(xt, σt) return x0 |
| Open Source Code | Yes | Code for the experiments is available at https: //github.com/Toyota Research Institute/ gradient-estimation-sampler |
| Open Datasets | Yes | We use denoisers from (Ho et al., 2020; Song et al., 2020a) that were pretrained on the CIFAR10 (32x32) and Celeb A (64x64) datasets (Krizhevsky et al., 2009; Liu et al., 2015). |
| Dataset Splits | No | The paper mentions using 'training images' and evaluating on the 'MS COCO validation set', but does not provide specific details on train/validation/test splits for the datasets used to train or evaluate their models. |
| Hardware Specification | Yes | All the experiments were run on a single Nvidia RTX 4090 GPU. |
| Software Dependencies | Yes | We also use Stable Diffusion 2.1 provided in https://huggingface.co/stabilityai/stable-diffusion-2-1. |
| Experiment Setup | Yes | For the CIFAR-10 and Celeb A models, we choose σ1 = q σDDIM(N) 1 and σ0 = 0.01. For CIFAR-10 N = 5, 10, 20, 50 we choose σN = 40 and for Celeb A N = 5, 10, 20, 50 we choose σN = 40, 80, 100, 120 respectively. For Stable Diffusion, we use the same sigma schedule as that in DDIM. ... We found that setting γ = 2 works well for N < 20; for larger N slightly increasing γ also improves sample quality (see Appendix E for more details). |