reproducibilityindex.ai

Continual Learning of Control Primitives : Skill Discovery via Reset-Games

Authors: Kelvin Xu, Siddharth Verma, Chelsea Finn, Sergey Levine

NeurIPS 2020 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	In this section we aim to experimentally answer the following questions: (1) How does our approach compare to prior methods in the reset free domain? (2) How do the skills learned by our approach compare to prior work? Speciﬁcally, can we learn better hierarchical controllers using the primitives learned by our approach (LSR)? We ﬁrst describe our experimental setup, evaluation metrics, and the prior methods to which we will compare.
Researcher Affiliation	Academia	Kelvin Xu 1, Siddharth Verma 1, Chelsea Finn2, Sergey Levine1 1 UC Berkeley, 2 Stanford University
Pseudocode	Yes	Algorithm 1 Learning Skillful Resets (LSR)
Open Source Code	Yes	2code is available at https://github.com/siddharthverma314/adversarial.git
Open Datasets	Yes	To study reset-free learning, we use the three-ﬁngered hand repositioning task proposed by Zhu et al. [54]... To evaluate our second experimental hypothesis, we evaluate skills acquired with reset-free learning for an ant locomotion task...
Dataset Splits	No	The paper describes experimental setups and evaluation metrics for different environments (DClaw and Ant), but does not explicitly provide specific percentages or counts for training, validation, and test data splits.
Hardware Specification	No	The paper does not provide specific details regarding the hardware (e.g., GPU/CPU models, memory specifications) used to conduct the experiments.
Software Dependencies	No	The paper mentions using 'soft actor-critic (SAC) [22]' and 'random network distillation [7]' but does not provide specific version numbers for these or other software dependencies.
Experiment Setup	No	We leave detailed discussion of hyperparameters and environment parameters to the Appendix 6. This indicates that these details are not provided in the main body of the paper provided.