reproducibilityindex.ai

Improving Molecular Design by Stochastic Iterative Target Augmentation

Authors: Kevin Yang, Wengong Jin, Kyle Swanson, Dr.Regina Barzilay, Tommi Jaakkola

ICML 2020 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We demonstrate significant gains over strong baselines for both unconditional and conditional molecular design. In particular, our approach outperforms the previous state-of-the-art in conditional molecular design by over 10% in absolute gain. Finally, we show that our approach is useful in other domains as well, such as program synthesis. As shown in Table 1, our iterative augmentation paradigm significantly improves the performance of VSeq2Seq and Hier GNN.
Researcher Affiliation	Academia	1UC Berkeley 2MIT 3University of Cambridge. Correspondence to: Kevin Yang <yangk@berkeley.edu>.
Pseudocode	Yes	Algorithm 1 Stochastic iterative target augmentation
Open Source Code	No	The paper does not include an unambiguous statement that the authors are releasing their code for the work described, nor does it provide a direct link to a source-code repository for their methodology.
Open Datasets	Yes	Additionally, we evaluate our augmentation of VSeq2Seq in a transductive setting, as well as in a semi-supervised setting where we provide 100K additional source-side precursors from the ZINC database (Sterling & Irwin, 2015). Our task is based on the educational Karel programming language (Pattis, 1981) used for evaluation in Bunel et al. (2018) and Chen et al. (2019).
Dataset Splits	No	The paper mentions "validation set performance" in Figure 5, but it does not provide specific dataset split information (exact percentages, sample counts, citations to predefined splits, or detailed splitting methodology) needed to reproduce the data partitioning in the provided text.
Hardware Specification	No	The paper does not provide specific hardware details (exact GPU/CPU models, processor types with speeds, memory amounts, or detailed computer specifications) used for running its experiments.
Software Dependencies	No	The paper mentions 'Chemprop package', but does not provide specific ancillary software details, such as library or solver names with version numbers, needed to replicate the experiment.
Experiment Setup	Yes	Full hyperparameters are provided in Appendix E.1.