reproducibilityindex.ai

DPZero: Private Fine-Tuning of Language Models without Backpropagation

Authors: Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	The memory efficiency of DPZERO is demonstrated in privately fine-tuning Ro BERTa and OPT on several downstream tasks. Our code is available at https: //github.com/Liang137/DPZero.
Researcher Affiliation	Collaboration	1 Department of Computer Science, ETH Zurich 2 Amazon Search 3 Paul G. Allen School of Computer Science and Engineering, University of Washington.
Pseudocode	Yes	Algorithm 1 DPGD-0th and Algorithm 2 DPZERO are provided.
Open Source Code	Yes	Our code is available at https: //github.com/Liang137/DPZero.
Open Datasets	Yes	We provide empirical results on synthetic problems and private fine-tuning of language models for sentence classification and generation tasks. A thorough description of the experimental settings is available in Appendix B. All experiments are tested on a single NVIDIA Ge Force RTX 3090 GPU with 24 Gi B memory. Code is available at https://github.com/Liang137/DPZero.
Dataset Splits	Yes	We consider the few-shot scenario with 512 samples per class... The test set is also composed of 1000 randomly selected samples from the original test dataset.
Hardware Specification	Yes	All experiments are tested on a single NVIDIA Ge Force RTX 3090 GPU with 24 Gi B memory.
Software Dependencies	No	The paper mentions: 'Our implementation of DPZERO utilizes the codebase provided by Malladi et al. (2023).' but does not specify software versions for programming languages, libraries, or frameworks like PyTorch, Python, or CUDA.
Experiment Setup	Yes	We fix the total number of iterations to be 10000, the batch size to be 64, and the smoothing parameter λ = 10 3 for both DPZERO and the non-private zeroth-order baseline Me ZO (Malladi et al., 2023). ... The learning rate to be 10 6... All results are averaged through three different random seeds {42, 13, 21}...