Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..
Unlocking Point Processes through Point Set Diffusion
Authors: David Lüdke, Enric Rabasseda Raventós, Marcel Kollovieh, Stephan Günnemann
ICLR 2025 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | Experiments on synthetic and real-world datasets demonstrate that POINT SET DIFFUSION achieves state-of-the-art performance in unconditional and conditional generation of spatial and spatiotemporal point processes while providing up to orders of magnitude faster sampling. |
| Researcher Affiliation | Academia | David L udke , Enric Rabasseda Ravent os , Marcel Kollovieh, Stephan G unnemann Department of Informatics & Munich Data Science Institute Technical University of Munich, Germany EMAIL |
| Pseudocode | Yes | Algorithm 1 Conditional sampling Require: Xc 0 = C(X0) 1: XT λϵ 2: for t = T, . . . , 1 do 3: e X0 pθ(X0|Xt) 4: e Xt 1 q(Xt 1| e X0, Xt) (reverse 3.2) 5: Xc t 1 q(Xc t 1|Xc 0) (forward 3.1) 6: Xt 1 = C ( e Xt 1) C(Xc t 1) 7: end for 8: return C (X0) |
| Open Source Code | Yes | Code is available at https://www.cs.cit.tum.de/daml/point-set-diffusion |
| Open Datasets | Yes | We evaluate our model on four benchmark datasets with their proposed pre-processing and splits: three real-world datasets Japan Earthquakes (U.S. Geological Survey, 2024), New Jersey COVID-19 Cases (The New York Times, 2024), and Citibike Pickups (Citi Bike, 2024) and one synthetic dataset, Pinwheel, based on a multivariate Hawkes process (Soni, 2019). |
| Dataset Splits | Yes | We follow Chen et al. (2021) and evaluate our model on four benchmark datasets with their proposed pre-processing and splits |
| Hardware Specification | Yes | All models have been trained on an NVIDIA A100-PCIE-40GB. |
| Software Dependencies | No | We use Adam as the optimizer and a fixed weight decay of 0.0001 to avoid overfitting. |
| Experiment Setup | Yes | Hyperparameters: We use the same hyperparameters for all datasets and types of point processes. In a hyperparameter study A.8, we have found T = 100 for our cosine noise schedule (Nichol et al., 2021) to give a good trade off between sampling time and quality. Further, we leverage a hidden dimension and embedding size of 32. For training, we use a batch size of 128 and a learning rate of 0.001. We use Adam as the optimizer and a fixed weight decay of 0.0001 to avoid overfitting. To avoid exploding gradients, we clip the gradients to have a norm lower than 2. |