reproducibilityindex.ai

A Theory of Label Propagation for Subpopulation Shift

Authors: Tianle Cai, Ruiqi Gao, Jason Lee, Qi Lei

ICML 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We implement the popular consistency-based semi-supervised learning algorithm Fix Match (Sohn et al., 2020) on the subpopulation shift task from BREEDS (Santurkar et al., 2021), and compare it with popular distributional matching methods (Ganin et al., 2016; Zhang et al., 2019). Results show that the consistency-based method outperforms distributional matching methods by over 8%, partially verifying our theory on the subpopulation shift problem. We also show that combining distributional matching methods and consistency-based algorithm can improve the performance upon distributional matching methods on classic unsupervised domain adaptation datasets such as Office31 (Saenko et al., 2010) and Office-Home (Venkateswara et al., 2017).
Researcher Affiliation	Academia	1Princeton University 2Zhongguancun Haihua Institute for Frontier Information Technology. Correspondence to: Jason D. Lee <jasonlee@princeton.edu>.
Pseudocode	No	The paper describes the algorithm (e.g., equation (1) and (2)) in textual and mathematical form but does not include structured pseudocode or an algorithm block.
Open Source Code	No	We use the implementation from Junguang Jiang (2020), which shows that MDD has the best performance among the evaluated methods.
Open Datasets	Yes	We conduct experiments on a dataset that is constructed to simulate natural subpopulation shift. Towards this goal, we constructed an Unsupervised Domain Adaptation (UDA) task using the challenging ENTITY-30 task from BREEDS tasks (Santurkar et al., 2021)...
Dataset Splits	No	The paper mentions training on source and evaluating on target domains but does not explicitly provide details about train/validation/test dataset splits (e.g., percentages or sample counts) in the provided text.
Hardware Specification	No	The paper does not provide any specific details about the hardware used for running the experiments (e.g., GPU models, CPU types, or memory).
Software Dependencies	No	The paper mentions using specific algorithms and libraries like 'Fix Match', 'MDD', and 'SwAV', and references the 'Transferlearning-library', but does not provide specific version numbers for any software dependencies.
Experiment Setup	No	We defer the detailed experimental settings to Appendix C and report the results here.