reproducibilityindex.ai

High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach

Authors: Tim Pearce, Alexandra Brintrup, Mohamed Zaki, Andy Neely

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Benchmark experiments show the method outperforms current state-of-the-art uncertainty quantiﬁcation methods, reducing average PI width by over 10%.
Researcher Affiliation	Academia	1Department of Engineering, University of Cambridge, UK 2Alan Turing Institute, UK. Correspondence to: Tim Pearce <tp424@cam.ac.uk>.
Pseudocode	Yes	Algorithm 1 Construction of loss function using basic operations
Open Source Code	Yes	Code is made available online6. https://github.com/Tea Pearce
Open Datasets	Yes	Experiments were run across ten open-access datasets.
Dataset Splits	No	The paper states that experiments were run 'across ten open-access datasets' and mentions a 'validation' phase in relation to the neural network training, but it does not specify explicit training/validation/test dataset splits (e.g., percentages or counts) needed for reproduction.
Hardware Specification	No	The paper does not provide specific details about the hardware used to run the experiments, such as CPU or GPU models.
Software Dependencies	No	The paper mentions 'modern NN APIs' and hints at Python being used via the GitHub link, but it does not provide specific software names with version numbers (e.g., 'PyTorch 1.9' or 'TensorFlow 2.x').
Experiment Setup	Yes	Setting s = 160 worked well in experiments in section 6, requiring no alteration across datasets. Models were asked to output 95% PIs and used ﬁve NNs per ensemble.