reproducibilityindex.ai

LTLf/LDLf Non-Markovian Rewards

Authors: Ronen Brafman, Giuseppe De Giacomo, Fabio Patrizi

AAAI 2018 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	Building on recent progress in temporal logics over ﬁnite traces, we adopt LDLf for specifying non-Markovian rewards and provide an elegant automata construction for building a Markovian model, which extends that of previous work and offers strong minimality and compositionality guarantees. The aim of this paper is to bring to bear developments in the theory of temporal logic over ﬁnite traces to the problem of specifying and solving MDPs with non-Markovian rewards. With these tools, which were unavailable to earlier work, we are able to provide a cleaner, more elegant approach that builds on well understood semantics, much more expressive languages, and enjoys good algorithmic properties.
Researcher Affiliation	Academia	Ronen I. Brafman Ben-Gurion University, Beer-Sheva, Israel brafman@cs.bgu.ac.il Giuseppe De Giacomo, Fabio Patrizi Sapienza Universit a di Roma, Italy {degiacomo,patrizi}@dis.uniroma1.it
Pseudocode	Yes	1: algorithm LDLf 2NFA 2: input LDLf formula ϕ 3: output NFA Aϕ = (2P, Q, q0, δ, F) 4: q0 {ϕ} 5: F { } 6: if ( (ϕ, ϵ) = true) then 7: F F {q0} 8: Q {q0, }, δ 9: while (Q or δ change) do 10: for (q Q) do 11: if (q (ψ q) (ψ, Θ) then 12: Q Q {q } 13: δ δ {(q, Θ, q )} 14: if ( (ψ q ) (ψ, ϵ) = true) then 15: F F {q } (Figure 1: LDLf 2NFA algorithm)
Open Source Code	No	The paper does not contain any statement or link indicating that open-source code for the described methodology is provided.
Open Datasets	No	This is a theoretical paper that does not conduct empirical experiments or use datasets.
Dataset Splits	No	This is a theoretical paper that does not conduct empirical experiments or use datasets with train/validation/test splits.
Hardware Specification	No	This is a theoretical paper that does not describe empirical experiments, and therefore no hardware specifications are mentioned.
Software Dependencies	No	This is a theoretical paper that does not describe empirical experiments or specific software implementations with version numbers.
Experiment Setup	No	This is a theoretical paper that does not describe empirical experiments or their setup details, such as hyperparameters or training configurations.