reproducibilityindex.ai

Streaming Submodular Maximization with Differential Privacy

Authors: Anamay Chaturvedi, Huy Nguyen, Thy Dinh Nguyen

ICML 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Finally, we complement our theoretical analysis with experimental corroboration.In this section, we empirically evaluate our approach on a practical instance of private submodular maximization, k-medians clustering.We compare the performance of Algorithm 3 with Laplace and Gumbel noise on two data sets.In Figure 3 and Figure 6 we graph the clustering cost versus the cardinality constraint k on the Taxi and Synthetic data sets respectively.
Researcher Affiliation	Academia	1Khoury College of Computer Science, Northeastern University, Boston, USA.
Pseudocode	Yes	Algorithm 1 Streaming algorithm for monotone submodular maximization subject to a cardinality constraint, Badanidiyuru et al. (2014)input Monotonic submodular function f, cardinality constraint parameter k, element stream V , approximation parameter θ, estimate O of the optimum cost f(OPT)S for e V do if f(e\|S) O/(2k) and \|S\| < k then S S {e} end if end for output S
Open Source Code	Yes	The code used to run all experiments may be found at www.github.com/thydnguyen/Priv Submodular Opt.
Open Datasets	Yes	First, following Mitrovic et al. (2017); Chaturvedi et al. (2021) we use the Uber data set (Five Thirty Eight, 2019) of Uber cab pick ups in Manhattan for the month of April 2014;... URL https://www.kaggle.com/datasets/fivethirtyeight/uber-pickups-in-new-york-city.
Dataset Splits	No	The paper does not provide specific train/validation/test dataset splits with percentages, counts, or a detailed splitting methodology for their own experiments.
Hardware Specification	Yes	All experiments were performed on a PC with 5.2 GHz i9 chip and 64 GB RAM.
Software Dependencies	No	The paper does not explicitly list specific software dependencies with version numbers (e.g., Python, PyTorch, TensorFlow versions).
Experiment Setup	Yes	For both data sets, we set δ = 1/\|P\|1.5 and θ = 0.2.We set E = min (k log n/ε, \|P\|/2) for the private algorithms, and E = min(maxei V f(ei), k log n/ε, \|P\|/2) for the non-private algorithm.For this experiment, we apply basic composition and set ε = ε/T, δ = δ/T.