reproducibilityindex.ai

Tuning-free Estimation and Inference of Cumulative Distribution Function under Local Differential Privacy

Authors: Yi Liu, Qirui Hu, Linglong Kong

ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Through mathematical proofs and extensive numerical testing, we demonstrate that our method achieves uniform and L2 error bounds when estimating the entire CDF curve. Computationally, we demonstrate that our constrained isotonic estimator can be efficiently computed deterministically, eliminating the need for hyperparameters or random optimization.
Researcher Affiliation	Academia	1Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, Canada 2Center for Statistical Science, Department of Industrial Engineering, Tsinghua University, Beijing, China.
Pseudocode	Yes	Algorithm 1 Constrained isotonic estimation
Open Source Code	No	The paper does not provide any statement or link indicating that the source code for the described methodology is publicly available.
Open Datasets	No	The paper uses data generated from standard statistical distributions (Uniform, Truncated normal, Continuous Bernoulli) for its experiments, rather than pre-existing public datasets with specific access information. Thus, there is no public dataset for which access information would be provided.
Dataset Splits	No	The paper conducts experiments by generating data from specified distributions and varying sample sizes (n spans from 10^3 to 10^7, with a total of 10,000 replications). It does not explicitly define or refer to standard train/validation/test dataset splits in percentages or counts for reproduction.
Hardware Specification	Yes	For n 10^7, it took less than 1s to execute on a single core of an AMD Threadripper PRO 3995WX CPU.
Software Dependencies	No	The paper does not list specific software dependencies with version numbers.
Experiment Setup	Yes	For the Truncated normal distribution, the parameters are set as µ = 1/2 and σ2 = 1/4. In the case of the Continuous Bernoulli distribution, the parameter λ is selected to be 1/4... We consider the truthful response rate r = 0.25, 0.5, 0.9, which means the privacy budget is ϵ = log((1 + r)/(1 r)) corresponding to 0.51, 1.09, 2.94 respectively. The sample size ranges n spans from 10^3 to 10^7, with a total of 10,000 replications.