reproducibilityindex.ai

Efficient Integrators for Diffusion Generative Models

Authors: Kushagra Pandey, Maja Rudolph, Stephan Mandt

ICLR 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Applied to Phase Space Langevin Diffusion [Pandey & Mandt, 2023] on CIFAR-10, our deterministic and stochastic samplers achieve FID scores of 2.11 and 2.36 in only 100 network function evaluations (NFE) as compared to 2.57 and 2.63 for the best-performing baselines, respectively.
Researcher Affiliation	Collaboration	Kushagra Pandey Department of Computer Science University of California, Irvine pandeyk1@uci.edu; Maja Rudolph Bosch Center for Artificial Intelligence Maja.Rudolph@us.bosch.com; Stephan Mandt Department of Computer Science University of California, Irvine mandt@uci.edu
Pseudocode	Yes	Algorithm 1 Conjugate Integrators (defined in Eqn. 8); Algorithm 2 Conjugate Symplectic Euler; Algorithm 3 Conjugate Velocity Verlet; Algorithm 4 Conjugate OBA
Open Source Code	Yes	Our code and model checkpoints will be made publicly available at https://github.com/mandt-lab/PSLD.
Open Datasets	Yes	We use the CIFAR-10 (Krizhevsky, 2009) (50k images), Celeb A-64 (downsampled to 64 x 64 resolution, 200k images) (Liu et al., 2015) and the AFHQv2-64 (Choi et al., 2020) (downsampled to 64 x 64 resolution, 15k images) datasets for comparisons.
Dataset Splits	No	The paper mentions using CIFAR-10, Celeb A-64, and AFHQv2-64 datasets but does not explicitly state specific train/validation/test splits (e.g., percentages or exact counts for each split).
Hardware Specification	Yes	Table 3: # GPUs 8
Software Dependencies	Yes	The above matrix exponential can be computed using standard scientific libraries like Py Torch (Paszke et al., 2019) or Sci Py (Virtanen et al., 2020). ... For numerical integration, we use the odeint method from the torchdiffeq package (Chen, 2018).
Experiment Setup	Yes	For all ablation results in Section 3 in the main text, we use pre-trained PSLD (Pandey & Mandt, 2023) models for CIFAR-10 with SDE hyperparameters Γ = 0.01, ν = 4.01 and β = 8.0. ... Table 3 summarizes the different training hyperparameters across datasets. We use the Hybrid Score Matching (HSM) objective during training. ... We use quadratic striding for timestep discretization proposed in Dockhorn et al. (2022b) during sampling, which ensures more number of score function... Similar to PSLD, we set ϵ = 1e 3 during sampling for all experiments.