MetaCURL: Non-stationary Concave Utility Reinforcement Learning
Authors: Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane
NeurIPS 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Theoretical | Justification: This is a theoretical paper, with no experiments. |
| Researcher Affiliation | Collaboration | Bianca Marin Moreno Inria Margaux Brégère Sorbonne Université Pierre Gaillard Inria Nadia Oudjane EDF R&D |
| Pseudocode | Yes | Algorithm 1 Meta CURL with EWA... Algorithm 2 EWA (Exponentially Weighted Average)... Algorithm 3 Online estimation of the probability kernel (ˆpt estimator)... Algorithm 4 Greedy MD-CURL |
| Open Source Code | No | Justification: This is a theoretical paper, with no experiments. The NeurIPS checklist states 'This paper does not release new assets.' and 'The answer NA means that the paper does not include experiments.' |
| Open Datasets | No | Justification: This is a theoretical paper, with no experiments or datasets. The NeurIPS checklist states 'The answer NA means that the paper does not include experiments.' |
| Dataset Splits | No | Justification: This is a theoretical paper, with no experiments or datasets. The NeurIPS checklist states 'The answer NA means that the paper does not include experiments.' |
| Hardware Specification | No | Justification: This is a theoretical paper, with no experiments. The NeurIPS checklist states 'The answer NA means that the paper does not include experiments.' |
| Software Dependencies | No | Justification: This is a theoretical paper, with no experiments. The NeurIPS checklist states 'The answer NA means that the paper does not include experiments.' |
| Experiment Setup | No | Justification: This is a theoretical paper, with no experiments. The NeurIPS checklist states 'The answer NA means that the paper does not include experiments.' |