reproducibilityindex.ai

Task Transfer by Preference-Based Cost Learning

Authors: Mingxuan Jing, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu2471-2478

AAAI 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Extensive simulations on several benchmarks have been conducted for further verifying the effectiveness of the proposed method.
Researcher Affiliation	Collaboration	Department of Computer Science and Technology, State Key Lab on Intelligent Technology and Systems, National Lab for Information Science and Technology (TNList), Tsinghua University, Beijing 100084, China Tencent AI Lab, Shenzhen, Guangdong, China {jmx16, maxj14}@mails.tsinghua.edu.cn, {fcsun, hpliu}@tsinghua.edu.cn hwenbing@126.com
Pseudocode	Yes	Algorithm 1 Preference-based task transfer via Adversarial Max Ent IRL
Open Source Code	No	The paper does not provide a direct link or explicit statement for the open-sourcing of the code for its described methodology.
Open Datasets	No	The paper mentions environments like Mu Jo Co and Open AI Gym, and states that 'initial demonstrations are generated by a well-trained PPO', but does not provide access information for a publicly available or open dataset.
Dataset Splits	No	The paper does not provide specific dataset split information (percentages, sample counts, or predefined splits) for training, validation, or testing.
Hardware Specification	No	The paper does not describe the specific hardware (e.g., CPU/GPU models, memory) used to run its experiments.
Software Dependencies	No	The paper mentions 'Mu Jo Co' and 'Open AI Gym' but does not provide specific version numbers for these or other software dependencies.
Experiment Setup	No	The paper describes the general iterative process of the algorithm but does not provide specific experimental setup details such as hyperparameters (e.g., learning rate, batch size, optimizer settings) in the main text.