Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Wasserstein barycenters can be computed in polynomial time in fixed dimension

Authors: Jason M Altschuler, Enric Boix-Adsera

JMLR 2021 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	While the focus of this paper is theoretical, here we briefly mention that a slight variant of our algorithm can provide high-precision solutions at previously intractable problem sizes. To demonstrate this, we implement our algorithm for dimension d = 2 in Python. The only difference between our numerical implementation and the theoretical algorithm described above is that we use a standard cutting-plane method (see, e.g., (Bertsimas and Tsitsiklis, 1997, 6.3)) for the outer loop in step 1 rather than the Ellipsoid algorithm due to its good practical performance. Code and further implementation details are provided on Github.5
Researcher Affiliation	Academia	Jason M. Altschuler EMAIL Enric Boix-Adser a EMAIL Laboratory for Information and Decision Systems Massachusetts Institute of Technology, Cambridge MA 02139
Pseudocode	No	The paper describes the algorithm conceptually in Section 3 and its subsections, outlining the steps and theoretical foundations, but it does not include a distinct, structured block of pseudocode or a formally labeled algorithm figure.
Open Source Code	Yes	Code and further implementation details are provided on Github.5
Open Datasets	Yes	Speciﬁcally, here we compare our barycenter algorithm against state-of-the-art methods on a standard benchmark dataset of images of nested ellipses (Cuturi and Doucet, 2014; Janati et al., 2020).
Dataset Splits	No	The paper describes generating 'k = 10 uniform distributions each on n = 20 points randomly drawn from [ -1,1]2 R2' and using 'k = 10 images, each of size 60 x 60' for experiments. However, it does not specify any training, validation, or test dataset splits in the conventional machine learning sense for reproducing experiments.
Hardware Specification	Yes	All experiments are run on a standard 2014 Lenovo Yoga 720-13IKB laptop.
Software Dependencies	No	The paper states, 'we implement our algorithm for dimension d = 2 in Python.' However, it does not provide specific version numbers for Python or any other libraries or frameworks used in the implementation, which are necessary for reproducible software dependencies.
Experiment Setup	No	The paper mentions that for comparison algorithms, 'IBP has an additional parameter: the entropic regularization γ, which significantly impacts the algorithm’s accuracy and numerical stability.' It also states that 'we binary search for the most accurate γ.' However, for the authors' own proposed algorithm, specific hyperparameters, training configurations, or system-level settings are not detailed in the main text.