reproducibilityindex.ai

Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization

Authors: Animesh Basak Chowdhury, Marco Romanelli, Benjamin Tan, Ramesh Karri, Siddharth Garg

ICLR 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	This study conducts a thorough examination of learning and search techniques for logic synthesis...Our findings showcase substantial enhancements...Furthermore, ABC-RL achieves an impressive up to 9x reduction in runtime...3 EMPIRICAL EVALUATION
Researcher Affiliation	Academia	Animesh Basak Chowdhury1 Marco Romanelli1 Benjamin Tan2 Ramesh Karri1 Siddharth Garg1 1 New York University 2 University of Calgary
Pseudocode	Yes	Algorithm 1 ABC-RL: Policy agent pre-training
Open Source Code	No	The paper provides a reproducibility statement but does not include any explicit statement about releasing their own code or a link to a code repository.
Open Datasets	Yes	We consider three datasets used by logic synthesis community: MCNC Yang (1991), EPFL arithmetic and EPFL random control benchmarks Amarú et al. (2015).
Dataset Splits	Yes	We randomly split the 56 total netlists obtained from all three benchmarks into 23 netlists for training 13 for validation (11 MCNC, 1 EPFL-arith, 1 EPFLrand) and remaining 20 for test (see Table 1).
Hardware Specification	Yes	We performed the training on a server machine with one NVIDIA RTX A4000 with 16GB VRAM.
Software Dependencies	No	The paper describes various methods and models (e.g., GCN, BERT, Adam), citing their original papers, but it does not specify the version numbers for software libraries or dependencies used in the implementation.
Experiment Setup	Yes	Agents are trained for 50 epochs using Adam with an initial learning rate of 0.01. In each training epoch, we perform MCTS on all netlists with an MCTS search budget K = 512 per synthesis level...We set T = 100 and δth = 0.007 based on our validation data.