reproducibilityindex.ai

Solving MDPs with Skew Symmetric Bilinear Utility Functions

Authors: Hugo Gilbert, Olivier Spanjaard, Paolo Viappiani, Paul Weng

IJCAI 2015 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Finally, we present and discuss experimental results where SSB-optimal policies are computed for a popular TV contest according to several instantiations of SSB utility functions.
Researcher Affiliation	Academia	1Sorbonne Universit es, UPMC Univ Paris 06, UMR 7606, LIP6, F-75005, Paris, France 2CNRS, UMR 7606, LIP6, F-75005, Paris, France 3SYSU-CMU Joint Institute of Engineering, Guangzhou, China 4SYSU-CMU Shunde International Joint Research Institute, Shunde, China
Pseudocode	Yes	Algorithm 1: Double Oracle Algorithm
Open Source Code	No	The paper does not provide an explicit link to open-source code for the methodology described.
Open Datasets	Yes	We used the two models of the Spanish 2003 version of the game presented by Perea and Puerto [2007].
Dataset Splits	No	The paper does not provide specific details about training, validation, or test dataset splits.
Hardware Specification	Yes	All times are wall-clock times on a 2,4 GHz Intel Core i5 machine with 8G main memory.
Software Dependencies	Yes	Our implementation is in Python, with an external call to GUROBI version 5.6.3 in order to solve the linear programs required to ﬁnd the Nash equilibria.
Experiment Setup	Yes	We computed the optimal policies for the two models according to several instantiations of the SSB utility function: the expectation (Exp), probabilistic dominance (PD), threshold probability (Th) criteria (threshold set to 2700) and a risk averse SSB utility function (RA) deﬁned by ϕRA(x, y) = (x − y)/(x + y) − 2/3