reproducibilityindex.ai

GraphDF: A Discrete Flow Model for Molecular Graph Generation

Authors: Youzhi Luo, Keqiang Yan, Shuiwang Ji

ICML 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Comprehensive experimental results show that Graph DF outperforms prior methods on random generation, property optimization, and constrained optimization tasks. In this section, we evaluate our Graph DF on three tasks of molecule generation, as mentioned in Section 2.1.
Researcher Affiliation	Academia	1Department of Computer Science & Engineering, Texas A&M University, TX, USA.
Pseudocode	No	The paper states, 'We summarize the detailed generation and training algorithms of Graph DF in Appendix A and B, respectively.', but these appendices containing pseudocode or algorithm blocks are not provided in the main paper text.
Open Source Code	Yes	The implementation of Graph DF has been integrated into the DIG (Liu et al., 2021) framework.
Open Datasets	Yes	For random generation of molecular graphs, we train and evaluate Graph DF on three molecule datasets, ZINC250K (Irwin et al., 2012), QM9 (Ramakrishnan et al., 2014), and MOSES (Polykovskiy et al., 2020).
Dataset Splits	No	The paper mentions training on datasets like ZINC250K and evaluating on generated graphs, but does not explicitly provide details about training, validation, or test dataset splits (e.g., percentages or specific partitioning methods).
Hardware Specification	No	The paper does not provide specific details about the hardware (e.g., GPU/CPU models, memory) used for running experiments.
Software Dependencies	No	The paper mentions using 'Adam optimizer' and 'RDKit (Landrum, 2016)', but does not provide specific version numbers for general software dependencies or libraries required for reproducibility.
Experiment Setup	Yes	The ﬂow model in Graph DF consists of 12 modulo shift modules, where the shared R-GCN has 3 message passing layers. The hidden dimension and output dimension of node embeddings are 128. ... Adam (Kingma & Ba, 2015) optimizer is used to train Graph DF models. On each molecule dataset, Graph DF is trained for 10 epochs, where the batch size is 32 and the learning rate is 0.001. ... where we ﬁx the temperature τ = 0.1 in experiments.