Formal Explanations of Neural Network Policies for Planning
Authors: Renee Selvey, Alban Grastien, Sylvie Thiébaux
IJCAI 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We present experimental results of our implementation of this approach for ASNet policies for classical planning domains. |
| Researcher Affiliation | Academia | 1School of Computing, The Australian National University 2LAAS-CNRS, ANITI, Universit e de Toulouse |
| Pseudocode | Yes | Algorithm 1 Computing a minimal explanation for a sequence of decisions. |
| Open Source Code | Yes | For reproducibility, our repository https://github.com/Renee Selvey/policy-explanations provides our algorithm implementation, benchmarks used, learnt policies, and the scripts to learn them and run the experiments. |
| Open Datasets | Yes | We took all deterministic domains and training instances from the code distributions of [Toyer et al., 2020] and [Steinmetz et al., 2022]. |
| Dataset Splits | No | The paper mentions generating problems for evaluation but does not specify a validation split or its details for the experimental data used in this paper. |
| Hardware Specification | Yes | All experiments were run on a machine with an AMD Ryzen Threadripper 3990X CPU, with 64 cores/128 threads, a clock speed of 2.9 GHz base, 4.3 GHz max boost, and 128 GB of memory of which we used 64 GB. |
| Software Dependencies | Yes | Gurobi version 9.1.2 is the MIP solver used for the experiments. |
| Experiment Setup | Yes | To ensure the model is accurate enough for our experiments, we set the integer feasibility tolerance (Int Feas Tol) to 10 9 and the error for function approximations (Func Piece Error) to 10 6. ... Each explanation problem was run with a time limit of 3h, except for Gripper for which the timeout was 4h. |