reproducibilityindex.ai

A General Framework for Symmetric Property Estimation

Authors: Moses Charikar, Kirankumar Shiragur, Aaron Sidford

NeurIPS 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We performed two different sets of experiments for entropy estimation one to compare performance guarantees and the other to compare running times. In our pseudo PML approach, we divide the samples into two parts. We run the empirical estimate on one (this is easy) and the PML estimate on the other.
Researcher Affiliation	Academia	Moses Charikar Stanford University moses@cs.stanford.edu Kirankumar Shiragur Stanford University shiragur@stanford.edu Aaron Sidford Stanford University sidford@stanford.edu
Pseudocode	Yes	Algorithm 1 General Framework for Symmetric Property Estimation
Open Source Code	Yes	Our code is available at https://github.com/shiragur/Code For Pseudo PML.git
Open Datasets	No	The paper uses synthetic distributions (Mix 2 Uniforms, Zipf(0.5), Zipf(1)) with a specified domain size (N=10^5) for experiments, rather than explicitly referencing a publicly available dataset with concrete access information (link, DOI, formal citation).
Dataset Splits	Yes	In our pseudo PML approach, we divide the samples into two parts. ... Let x2n = (xn/2, xn/2), where xn/2 represent ﬁrst and last n samples of x2n respectively.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., GPU/CPU models, memory, or cloud instance types) used for running the experiments.
Software Dependencies	No	The paper mentions using 'the heuristic algorithm in [PJW17]', but it does not specify any software dependencies with version numbers (e.g., Python, specific libraries, or exact versions of [PJW17]'s implementation).
Experiment Setup	Yes	In our algorithm we pick threshold = 18 (same as [WY16a]) and our set F = [0, 18] (input of Algorithm 1), i.e. we use the PML estimate on frequencies 18 and empirical estimate on the rest.