reproducibilityindex.ai

Byzantine Machine Learning Made Easy By Resilient Averaging of Momentums

Authors: Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan

ICML 2022 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We also present an empirical evaluation of the practical relevance of RESAM. We report on a comprehensive set of experiments evaluating RESAM on benchmark image classification tasks: MNIST, Fashion-MNIST, and CIFAR10.
Researcher Affiliation	Academia	Sadegh Farhadkhani 1 Rachid Guerraoui 1 Nirupam Gupta 1 Rafael Pinot 1 John Stephan 1 1Distributed Computing Laboratory (DCL), School of Computer and Communication Sciences, École Polytechnique Fédérale de Lausanne (EPFL), Switzerland.
Pseudocode	Yes	Algorithm 1: Distributed SGD using distributed momentum and an (f, λ)-resilient averaging rule F
Open Source Code	Yes	Additional plots and code base to reproduce our experiments are available in the supplementary material. Our implementation will also be made accessible online.
Open Datasets	Yes	We use MNIST (Le Cun & Cortes, 2010), Fashion MNIST (Xiao et al., 2017), and CIFAR-10 (Krizhevsky et al., 2009).
Dataset Splits	No	The paper uses standard datasets (MNIST, Fashion-MNIST, CIFAR-10) which have predefined train/test splits, and discusses 'top-1 cross-accuracy' for evaluation, but it does not explicitly state the dataset split percentages or sample counts for training, validation, and testing in the provided text.
Hardware Specification	No	The paper does not provide specific details about the hardware used for running the experiments.
Software Dependencies	No	The paper does not provide specific software dependencies (e.g., library or framework names with version numbers) used for the experiments.
Experiment Setup	Yes	For MNIST and Fashion-MNIST... we use a constant learning rate γ = 0.75, and a clipping parameter C = 2. We also add an ℓ2-regularization factor of 10 4. Finally, we use a mini-batch size of b = 25. For CIFAR-10... We set n = 25, γ = 0.25, C = 5, and b = 50. ... Finally, we vary the momentum coefficient β in {0, 0.6, 0.8, 0.9, 0.99, 0.999}.