Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation

Authors: Jiacheng Cui, Xinyue Bi, Yaxin Luo, Xiaohan Zhao, Jiacheng Liu, Zhiqiang Shen

NeurIPS 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Exhaustive empirical evaluations of our proposed FADRM on CIFAR-100 [15], Tiny-Image Net [16], Image Net-1K [8] and its subset demonstrate that it not only accelerates the dataset distillation process by 50% but also achieves superior accuracy that beats all previous state-of-the-art methods on both accuracy and generation speed.
Researcher Affiliation	Academia	1VILA Lab, MBZUAI 2University of Ottawa
Pseudocode	Yes	Algorithm 1 FADRM: Residual Matching for Dataset Distillation
Open Source Code	Yes	Code: FADRM (Git Hub)
Open Datasets	Yes	We conduct experiments across datasets with varying resolutions, including CIFAR-100 (32 32) [15], Tiny-Image Net (64 64) [16], Image Net-1K (224 224) [8], and their subsets.
Dataset Splits	Yes	Each distilled dataset is used to train a student model from scratch, and the resulting model is evaluated on the original test set of the corresponding dataset (e.g., CIFAR-100, Tiny-Image Net, Image Net-1K).
Hardware Specification	Yes	Total training hours on a single RTX4090 vs. test set accuracy, comparing prior stateof-the-art methods with our proposed framework (+ denotes multi-model distillation).
Software Dependencies	No	For Image Net-1K, we employed the official Py Torch pretrained models, which have been extensively trained on the full Image Net-1K dataset.
Experiment Setup	Yes	Hyperparameters for Training the Original Dataset Optimizer SGD Learning Rate 0.1 Weight Decay 1e-4 Momentum 0.9 Batch Size 128 Loss Function Cross-Entropy Epochs 300 Augmentation Random Resized Crop, Horizontal Flip, Cut Mix Table 14: Hyperparameters for Training the Original Dataset.