reproducibilityindex.ai

A General Offline Reinforcement Learning Framework for Interactive Recommendation

Authors: Teng Xiao, Donglin Wang4512-4520

AAAI 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We conduct extensive experiments on two public realworld datasets, demonstrating that the proposed methods can achieve superior performance over existing supervised learning and reinforcement learning methods for recommendation.
Researcher Affiliation	Academia	Teng Xiao, Donglin Wang Machine Intelligence Lab (Mi LAB), AI Division, School of Engineering, Westlake University tengxiao01@gmail.com, wangdonglin@westlake.edu.cn
Pseudocode	No	The paper does not contain any clearly labeled pseudocode or algorithm blocks.
Open Source Code	No	The paper does not include any statement about releasing source code or provide a link to a code repository for the described methodology.
Open Datasets	Yes	We conduct extensive experiments on two public realworld datasets: Rec Sys 1: This dataset is a public dataset released by Rec Sys Challenge 2015 and contains sequences of user purchases and clicks. 1https://recsys.acm.org/recsys15/challenge/ Kaggle 2: This dataset comes from a real-world e-commerce website. 2https://www.kaggle.com/retailrocket/ecommerce-dataset
Dataset Splits	Yes	We randomly sample 80% sequences as the training set, 10% as validation and the rest as test set.
Hardware Specification	No	The paper does not provide specific details about the hardware used to run the experiments (e.g., GPU/CPU models, memory).
Software Dependencies	No	The paper does not provide specific version numbers for ancillary software dependencies.
Experiment Setup	No	The paper mentions that “Hyperparameters are tuned on validation set” and that methods “are based on the same backbone i.e., recurrent neural networks (RNN)”, but it does not provide specific hyperparameter values (e.g., learning rate, batch size, number of epochs) or detailed system-level training settings within the main text.