reproducibilityindex.ai

Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

Authors: Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

ICLR 2020 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We compare SARFA with existing approaches on agents trained to play board games (Chess and Go) and Atari games (Breakout, Pong and Space Invaders). We show through illustrative examples (Chess, Atari, Go), human studies (Chess), and automated evaluation methods (Chess) that SARFA generates saliency maps that are more interpretable for humans than existing approaches. For instance, in our human studies, SARFA improves accuracy of solving chess puzzles by nearly 25% and reduces the time taken by 31% over existing approaches.
Researcher Affiliation	Collaboration	Media and Data Science Research, Adobe Systems Inc., Noida, Uttar Pradesh, India 201301 Indian Institute of Technology Kharagpur, West Bengal, India 721302 Indian Institute of Technology Madras, Chennai, India 600036 Department of Computer Science, University of California, Irvine, California, USA
Pseudocode	No	The paper provides mathematical equations to describe the SARFA method but does not include any pseudocode or algorithm blocks.
Open Source Code	Yes	For the code release and demo videos, see https://nikaashpuri.github.io/sarfa-saliency/. All of our code and more detailed results are available in our Github repository: https://nikaashpuri.github.io/sarfa-saliency/.
Open Datasets	Yes	For experiments on chess, we use the Stockﬁsh agent (https://stockfishchess.org/). For experiments on Go, we use the pre-trained Mini Go RL agent (https://github.com/tensorﬂow/minigo). ... To automatically compare the saliency maps generated by different perturbation-based approaches, we introduce a Chess saliency dataset. ... The complete dataset is available in our Github repository: https://nikaashpuri.github.io/sarfa-saliency/.
Dataset Splits	No	The paper mentions using pre-trained agents and creating a 'Chess saliency dataset' consisting of 100 chess puzzles, but it does not specify any explicit training, validation, or test dataset splits (e.g., percentages, sample counts, or citations to standard splits) needed for reproduction.
Hardware Specification	No	The paper mentions using pre-trained agents (Stockfish, Mini Go, Atari agents) and their associated algorithms, but it does not provide any specific hardware details (e.g., CPU/GPU models, memory, or cloud instance types) for the machines used to conduct the experiments described in the paper.
Software Dependencies	Yes	For experiments on chess, we use the Stockﬁsh 10 agent: https://stockﬁshchess.org/.
Experiment Setup	Yes	For chess and Go, we perturb the board position by removing one piece at a time. We do not remove a piece if the resulting position is illegal. For instance, in chess, we do not remove the king. For Atari, we use the perturbation technique described in Greydanus et al. (2018). The technique perturbs the input image by adding a Gaussian blur localized around a pixel. The blur is constructed using the Hadamard product to interpolate between the original input image and a Gaussian blur.