Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Unlocking Point Processes through Point Set Diffusion

Authors: David Lüdke, Enric Rabasseda Raventós, Marcel Kollovieh, Stephan Günnemann

ICLR 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on synthetic and real-world datasets demonstrate that POINT SET DIFFUSION achieves state-of-the-art performance in unconditional and conditional generation of spatial and spatiotemporal point processes while providing up to orders of magnitude faster sampling.
Researcher Affiliation	Academia	David L udke , Enric Rabasseda Ravent os , Marcel Kollovieh, Stephan G unnemann Department of Informatics & Munich Data Science Institute Technical University of Munich, Germany EMAIL
Pseudocode	Yes	Algorithm 1 Conditional sampling Require: Xc 0 = C(X0) 1: XT λϵ 2: for t = T, . . . , 1 do 3: e X0 pθ(X0\|Xt) 4: e Xt 1 q(Xt 1\| e X0, Xt) (reverse 3.2) 5: Xc t 1 q(Xc t 1\|Xc 0) (forward 3.1) 6: Xt 1 = C ( e Xt 1) C(Xc t 1) 7: end for 8: return C (X0)
Open Source Code	Yes	Code is available at https://www.cs.cit.tum.de/daml/point-set-diffusion
Open Datasets	Yes	We evaluate our model on four benchmark datasets with their proposed pre-processing and splits: three real-world datasets Japan Earthquakes (U.S. Geological Survey, 2024), New Jersey COVID-19 Cases (The New York Times, 2024), and Citibike Pickups (Citi Bike, 2024) and one synthetic dataset, Pinwheel, based on a multivariate Hawkes process (Soni, 2019).
Dataset Splits	Yes	We follow Chen et al. (2021) and evaluate our model on four benchmark datasets with their proposed pre-processing and splits
Hardware Specification	Yes	All models have been trained on an NVIDIA A100-PCIE-40GB.
Software Dependencies	No	We use Adam as the optimizer and a fixed weight decay of 0.0001 to avoid overfitting.
Experiment Setup	Yes	Hyperparameters: We use the same hyperparameters for all datasets and types of point processes. In a hyperparameter study A.8, we have found T = 100 for our cosine noise schedule (Nichol et al., 2021) to give a good trade off between sampling time and quality. Further, we leverage a hidden dimension and embedding size of 32. For training, we use a batch size of 128 and a learning rate of 0.001. We use Adam as the optimizer and a fixed weight decay of 0.0001 to avoid overfitting. To avoid exploding gradients, we clip the gradients to have a norm lower than 2.