Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..
Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI
Authors: Malek Mechergui, Sarath Sreedharan
AAAI 2024 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | For evaluation, we ran our method on a set of problems selected from standard IPC benchmark problems (International Planning Competition 2011). |
| Researcher Affiliation | Academia | Malek Mechergui, Sarath Sreedharan Colorado State University EMAIL |
| Pseudocode | Yes | Algorithm 1: An approximation-based algorithm to find a solution to a HAGL |
| Open Source Code | Yes | The code for the experiments can be found at: https://github.com/HAPILab/Goal Alignment. |
| Open Datasets | Yes | For evaluation, we ran our method on a set of problems selected from standard IPC benchmark problems (International Planning Competition 2011). International Planning Competition. 2011. IPC Competition Domains. https://goo.gl/i35bxc. |
| Dataset Splits | No | The paper discusses using "standard IPC benchmark problems" and creating "goal specification provided to the robot by randomly deleting a predicate from the goal specification" but does not provide specific dataset split information (e.g., train/validation/test percentages or counts) for reproduction. |
| Hardware Specification | Yes | All experiments were run on a linux Alma Linux 8.9 machine with 32GB ram and 16 Intel(R) Xeon(R) 2.60GHz CPUs. |
| Software Dependencies | No | The paper mentions using "Fast Downward planner" and "A-star search with LMcut heuristic" but does not provide specific version numbers for these software dependencies. |
| Experiment Setup | Yes | we set β to one for probability calculation. |