reproducibilityindex.ai

Learning Causal Dynamics Models in Object-Oriented Environments

Authors: Zhongwei Yu, Jingqing Ruan, Dengpeng Xing

ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on large-scale tasks indicate that OOCDM outperforms existing CDMs in terms of causal discovery, prediction accuracy, generalization, and computational efficiency.
Researcher Affiliation	Academia	1Institute of Automation, Chinese Academy of Sciences, Beijing, P. R. China 2School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, P. R. China.
Pseudocode	Yes	Algorithm 1 Object-oriented causal discovery (Appendix E.3), Algorithm 2 Learning Object-oriented Causal Dynamics Model (Appendix E.4), and Algorithm 3 Planning with Cross Entropy Method (Appendix E.5) are provided.
Open Source Code	Yes	The source code of this work is made available at https://github.com/Ease Onway/oocdm.
Open Datasets	Yes	The Collect-Mineral-Shards (CMS) and Defeat-Zerglings-Baineling (DZB) environments are Star Craft II mini-games (Vinyals et al., 2017). The environment is adapted from the Walker2D environment, which is based on Mujoco, a popular physics simulator in the Open AI Gym platform (Brockman et al., 2016).
Dataset Splits	No	The paper discusses "i.d. (in-distribution) test data" and "o.o.d. (out-of-distribution) test data" for evaluation, but there is no explicit mention of a separate "validation" dataset split.
Hardware Specification	Yes	All models are trained and evaluated using one GPU (NVIDIA TITAN XP).
Software Dependencies	No	The paper mentions using "Python and Py Torch" but does not provide specific version numbers for these software components.
Experiment Setup	Yes	Table 14. The main hyper-parameters used in our experiments. This table lists various specific hyperparameters like de, dk, dv, dh, ε, nplan, H, α, β, λ, γ, k, k', niter, and nbatch for different environments.