reproducibilityindex.ai

Helpful Information Sharing for Partially Informed Planning Agents

Authors: Sarah Keren, David Wies, Sara Bernardini

IJCAI 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	6 Empirical Evaluation The objective of the evaluation is to examine our approaches in terms of the computational resources used and their ability to find helpful interventions in a variety of domains. ... We compare three HIS solution approaches: BFS, Lazy-BFS (Section 4.1), and the Tka transformation (Section 4.2). ... Table 1 compares the computational resources used by BFS, Lazy-BFS and Tka.
Researcher Affiliation	Academia	1The Taub Faculty of Computer Science, Technion Israel Institute of Technology 2 Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem 3Department of Computer Science, Royal Holloway University of London
Pseudocode	No	A full description of Lazy BFS is given in the appendix1.
Open Source Code	Yes	1Appendix: https://github.com/sarah-keren/HIS-IJCAI-23
Open Datasets	Yes	We use seven partially observable planning domains: WUMPUS, TRAIL, COLOR-BALLS, COLORBALLS-E, LOGISTICS, and UNIX [Bonet and Geffner, 2011; Albore et al., 2009]. The Appendix includes a description of all the domains and the complete dataset and code.
Dataset Splits	No	The paper states 'For each domain, we generated at least 100 benchmarks by randomly selecting the initial state and the set of items that are known to the helper' but does not specify how the data is split into train, validation, and test sets for the experiments.
Hardware Specification	Yes	All evaluations were run on an 11th Gen Intel Core i9-11900F @ 2.50GHz 16.
Software Dependencies	No	The paper mentions software like PDDL, pyperplan, K-planner, Fast-Downward, and Lm-cut heuristic, but does not provide specific version numbers for any of them.
Experiment Setup	Yes	We run each instance on the three approaches with a time limit of 10 minutes, a memory limit of 2548 MB, and 1000 explored nodes.