On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Authors: Rohan Subramani, Marcus Williams, Max Heitmann, Halfdan Holm, Charlie Griffin, Joar Max Viktor Skalse
ICLR 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Theoretical | All of our results are theoretical. All proofs can be found in Appendix B. |
| Researcher Affiliation | Academia | Rohan Subramania,b, , Marcus Williamsa, , Max Heitmanna,c, , Halfdan Holma, Charlie Griffina,c, Joar Skalsea,c a AI Safety Hub, b Columbia University, c University of Oxford *Equal contribution. Correspondence to Rohan Subramani at rs4126@columbia.edu. |
| Pseudocode | No | The paper does not contain structured pseudocode or algorithm blocks. |
| Open Source Code | No | The paper does not provide concrete access to source code for the methodology described. |
| Open Datasets | No | All of our results are theoretical. All proofs can be found in Appendix B. No datasets were used for training. |
| Dataset Splits | No | All of our results are theoretical. All proofs can be found in Appendix B. No dataset split information is provided as no experiments were conducted. |
| Hardware Specification | No | All of our results are theoretical. All proofs can be found in Appendix B. No hardware specifications are provided as no experiments were conducted. |
| Software Dependencies | No | All of our results are theoretical. All proofs can be found in Appendix B. No specific ancillary software details are provided as no experiments were conducted. |
| Experiment Setup | No | All of our results are theoretical. All proofs can be found in Appendix B. No experimental setup details are provided as no experiments were conducted. |