Representation with Incomplete Votes

Authors: Daniel Halpern, Gregory Kehne, Ariel D. Procaccia, Jamie Tucker-Foltz, Manuel Wüthrich

AAAI 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Finally, an empirical evaluation using real data shows that the proposed algorithm provides representative outcomes in practice.In Section 5 we show empirically (on real datasets from Polis and Reddit) that this extension allows us to find committees satisfying (approximate) JR (and stronger properties) despite access to little information (i.e., few voters, each voting on only a small fraction of the comments).
Researcher Affiliation Academia Daniel Halpern, Gregory Kehne, Ariel D. Procaccia, Jamie Tucker-Foltz and Manuel W uthrich Harvard University
Pseudocode Yes Algorithm 1: (k, t)-α-PAVAlgorithm 2: (k, t)-noisy-α-PAV
Open Source Code No The paper does not provide a statement or link indicating that the source code for the developed algorithms (α-PAV, noisy-α-PAV, ucb-α-PAV) is publicly available.
Open Datasets Yes Polis provides open-use data from real deliberations hosted on their platform.7 These include, for instance, a discussion organized by the government of Taiwan, which led to the successful regulation of Uber.The second dataset we consider consists of Reddit discussions.9 To obtain an interesting dataset, we combined voting data from two subreddits, r/politics and r/Conservative, which are arguably situated at opposite ends of the American political spectrum.
Dataset Splits Yes We split the data into training, validation, and testing as follows: 80% for training, 10% for validation, and 10% for testing.
Hardware Specification Yes All experiments were run on a machine with an AMD Ryzen Threadripper 3970X CPU and a single NVIDIA GeForce RTX 3090 GPU.
Software Dependencies No The paper mentions using 'Lens Kit' as a matrix factorization library but does not provide specific version numbers for it or any other software dependencies crucial for reproducibility.
Experiment Setup Yes For all datasets, we assume that each voter votes on t = 20 comments. Since the total number of comments m ranges from 31 to 1719 across datasets, the percentage of comments each voter votes on, t/m, ranges from 1% to 65%. For each dataset, we run the algorithms with target committee sizes k = 5, 7, 10.For both Algorithm 2 and Algorithm 4 we treat ℓ, the number of times we ask voters about each candidate, as a parameter. In addition, for Algorithm 4, we replace the numerator in the confidence intervals errs with a parameter θ. Both ℓand θ were chosen based on validation on a separate dataset, see Appendix H for details.