reproducibilityindex.ai

Robust Inverse Constrained Reinforcement Learning under Model Misspecification

Authors: Sheng Xu, Guiliang Liu

ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	In this section, we empirically evaluate the efficacy of the proposed AR-ICRL algorithm in both discrete and continuous environments under transition dynamics mismatch. and Table 1 shows the evaluation results with large-scale noises.
Researcher Affiliation	Academia	1 School of Data Science, The Chinese University of Hong Kong, Shenzhen, Guangdong, 518172, P.R. China . Correspondence to: Guiliang Liu <liuguiliang@cuhk.edu.cn>.
Pseudocode	Yes	Algorithm 1 Adaptively Robust Inverse Constrained Reinforcement Learning (AR-ICRL) and Algorithm 2 Safety-Robust Policy Iteration and Algorithm 3 Safety-Robust Proximal Policy Optimization
Open Source Code	Yes	The code is available at https: //github.com/Jasonxu1225/AR-ICRL.
Open Datasets	Yes	Based on the ICRL benchmark (Liu et al., 2023), we conduct experiments on three continuous robot control tasks with predefined constraints, including Blocked Half-Cheetah, Blocked Ant, and Crippled Walker.
Dataset Splits	No	The paper discusses training and testing in different environments but does not provide specific training/validation/test dataset splits or their percentages/counts for data within those environments.
Hardware Specification	No	The paper does not provide specific hardware details such as exact GPU/CPU models, processor types, or memory used for running its experiments.
Software Dependencies	No	The paper refers to algorithms and methods like PPO but does not provide specific version numbers for software dependencies or libraries used in implementation.
Experiment Setup	Yes	Table 2. List of the utilized hyperparameters in this work. To ensure equitable comparisons, we maintain consistency in the parameters of the same neural networks across different models. and lists specific hyperparameter values in the table.