The Power of Resets in Online Reinforcement Learning

Authors: Zak Mhammedi, Dylan J Foster, Alexander Rakhlin

NeurIPS 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Theoretical This paper has only mathematical congtent. There are no experiments in this paper.
Researcher Affiliation Collaboration Zakaria Mhammedi Google Research mhammedi@google.com Dylan J. Foster Microsoft Research dylanfoster@microsoft.com Alexander Rakhlin MIT rakhlin@mit.edu
Pseudocode Yes Algorithm 1 Sim Golf: Global Optimism via Local Simulator Access
Open Source Code No This paper has only mathematical congtent. There are no experiments in this paper.
Open Datasets No This paper has only mathematical congtent. There are no experiments in this paper.
Dataset Splits No This paper has only mathematical congtent. There are no experiments in this paper.
Hardware Specification No This paper has only mathematical congtent. There are no experiments in this paper.
Software Dependencies No This paper has only mathematical congtent. There are no experiments in this paper.
Experiment Setup No This paper has only mathematical congtent. There are no experiments in this paper.