Individual Planning in Infinite-Horizon Multiagent Settings: Inference, Structure and Scalability

Authors: Xia Qu, Prashant Doshi

NeurIPS 2015 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Five variants of EM are evaluated as appropriate: the exact EM inference-based planning (labeled as I-EM); replacing the exact M-step with its greedy variant analogously to the greedy maximization in EM for POMDPs [12] (I-EM-Greedy); iterating EM based on coordinate blocks (I-EM-BCD) and coupled with a greedy M-step (I-EM-BCD-Greedy); and lastly, using forward filtering-backward sampling (I-EM-FFBS). In Fig. 2-I(a-d), we compare the variants on all problems.
Researcher Affiliation Collaboration Xia Qu Epic Systems Verona, WI 53593 quxiapisces@gmail.com Prashant Doshi THINC Lab, Dept. of Computer Science University of Georgia, Athens, GA 30622 pdoshi@cs.uga.edu
Pseudocode No The paper does not contain any sections explicitly labeled 'Pseudocode' or 'Algorithm', nor does it present any structured code-like blocks.
Open Source Code No The paper does not provide any specific links to open-source code repositories, nor does it state that the code for the described methodology is publicly available or included in supplementary materials.
Open Datasets Yes We use 4 problem domains: the noncooperative multiagent tiger problem [13]... A larger noncooperative 2-agent money laundering (ML) problem [14] forms the second domain... We also evaluate a 3-agent UAV reconnaissance problem... [8]. Finally, the recent policing protest problem is used... [15].
Dataset Splits No The paper describes the problem domains and model configurations but does not specify dataset splits (e.g., percentages or sample counts) for training, validation, or testing, nor does it refer to predefined splits from cited sources.
Hardware Specification Yes All experiments were run on Linux with Intel Xeon 2.6GHz CPUs and 32GB RAM.
Software Dependencies No The paper mentions the operating system ('Linux') but does not specify any software dependencies with version numbers (e.g., programming languages, libraries, or solvers).
Experiment Setup Yes We use 4 problem domains: the noncooperative multiagent tiger problem [13] (|S|= 2, |Ai|= |Aj|= 3, |Oi|= |Oj|= 6 for level l 1, |Oj|= 3 at level 0, and γ = 0.9)... using FSCs of sizes 5, 3, 9 and 5, for the 4 domains respectively demonstrated a good balance. We explored various coordinate block configurations eventually settling on 3 equal-sized blocks for both the tiger and ML, 5 blocks for UAV and 2 for policing protest.