reproducibilityindex.ai

REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer

Authors: Xingyu Liu, Deepak Pathak, Kris Kitani

ICML 2022 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on a physics simulator show that the proposed continuous evolutionary model can effectively transfer the policy across robots and achieve superior sample efficiency on new robots.
Researcher Affiliation	Academia	1The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA. Correspondence to: Xingyu Liu <xingyul3@cs.cmu.edu>.
Pseudocode	Yes	Algorithm 1 Continuous Robot Evolution Policy Transfer
Open Source Code	Yes	Code is released at https: //github.com/xingyul/revolver.
Open Datasets	Yes	We showcase our REvolve R on three Mu Jo Co Gym environments (Brockman et al., 2016) with dense reward. We also experiment on Hand Manipulation Suite tasks (Rajeswaran et al., 2018) in sparse rewards setting.
Dataset Splits	No	The paper conducts experiments in simulated environments (Mu Jo Co Gym and Hand Manipulation Suite) where policies are trained and evaluated, but it does not describe using pre-defined dataset splits (e.g., train/validation/test percentages or counts) for a fixed dataset in the conventional sense.
Hardware Specification	No	The paper mentions using physics simulators (Mu Jo Co and pyBullet) and running experiments, but it does not provide any specific hardware details such as GPU models, CPU specifications, or memory configurations used for the experiments.
Software Dependencies	No	The paper mentions several software components and algorithms used (e.g., TD3, SAC, Mu Jo Co, pyBullet, NPG) but does not provide specific version numbers for any of them, which is necessary for reproducible software dependencies.
Experiment Setup	Yes	Experiment Hyperparameter Setting We illustrate the hyperparameters of the neural networks used in Gym and Hand Manipulation Suite experiments, including layer size, batch size and learning rate, in Table 3.