Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..
Learning-augmented private algorithms for multiple quantile release
Authors: Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii
ICML 2023 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We conclude with experiments on challenging tasks demonstrating that learning predictions across one or more instances can lead to large error reductions while preserving privacy. |
| Researcher Affiliation | Collaboration | 1Carnegie Mellon University; work done in part as an intern at Google Research New York. 2Google Research New York. |
| Pseudocode | Yes | Algorithm 2: Approximate Quantiles with predictions |
| Open Source Code | Yes | Code to reproduce our results is available at https://github.com/mkhodak/private-quantiles. |
| Open Datasets | Yes | We evaluate this approach... on Adult (Kohavi, 1996) and Goodreads (Wan & Mc Auley, 2018)... |
| Dataset Splits | Yes | Adult tests the D D1 case, with its train set the public dataset and a hundred samples from test as private. |
| Hardware Specification | No | The paper does not provide specific details about the hardware used for running experiments, such as exact GPU or CPU models, memory configurations, or specific cloud computing instance types. |
| Software Dependencies | No | The paper mentions various software components and packages used, such as COCOB (with a GitHub link), textstat (with a GitHub link), NLTK, and an adaptation of DP-FTRL from Google Research. However, it does not provide specific version numbers for any of these dependencies or the general programming environment (e.g., Python version, PyTorch/TensorFlow versions). |
| Experiment Setup | Yes | We use the following reasonable guesses for locations ν, scales σ, and quantile ranges ra, bs for these distributions: age: ν 40, σ 5, a 10, b 120; hours: ν 40, σ 2, a 0, b 168; rating: ν 2.5, σ 0.5, a 0, b 5; page count: ν 200, σ 25, a 0, b 1000 |