Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..
Learning to Discover Skills through Guidance
Authors: HYUNSEUNG KIM, BYUNG KUN LEE, Hojoon Lee, Dongyoon Hwang, Sejik Park, Kyushik Min, Jaegul Choo
NeurIPS 2023 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | Empirical evaluation demonstrates that DISCO-DANCE outperforms other USD baselines in challenging environments, including two navigation benchmarks and a continuous control benchmark. |
| Researcher Affiliation | Collaboration | 1Kim Jaechul Graduate School of AI, KAIST. 2KAKAO Corp. |
| Pseudocode | Yes | Algorithm 1: Skill Discovery through Guidance |
| Open Source Code | Yes | Qualitative visualizations and code of DISCO-DANCE are available at https://mynsng.github.io/discodance/. |
| Open Datasets | Yes | 2D mazes: We used the publicly available environment code in [7]. Ant mazes: We used the publicly available environment code in [8]. Deepmind Control suite: We used the publicly available environment code in [22, 53]. |
| Dataset Splits | No | The paper describes training steps and finetuning steps in RL environments, but does not specify distinct training, validation, and test *dataset splits* with percentages or sample counts. |
| Hardware Specification | Yes | We conducted all experiments using a single RTX 3090 GPU, and each experiment requires approximately up to 2GB of memory. |
| Software Dependencies | No | The paper mentions software like SAC, DIAYN, APS, SMM, and URLB, and specific environment code, but does not provide specific version numbers for these software components or programming languages. |
| Experiment Setup | Yes | Table 4: A common set of hyperparameters used in DISCO-DANCE. Table 5: Per environment sets of hyperparameters used in DISCO-DANCE. |