Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Learning End-to-End Scene Flow by Distilling Single Tasks Knowledge

Authors: Filippo Aleotti, Matteo Poggi, Fabio Tosi, Stefano Mattoccia10435-10442

AAAI 2020 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Exhaustive experiments show that i) DWARF runs at about 10 FPS on a single high-end GPU and about 1 FPS on NVIDIA Jetson TX2 embedded at KITTI resolution, with moderate drop in accuracy compared to 10 deeper models, ii) learning from many distilled samples is more effective than from the few, annotated ones available. 4 Experimental Results We report extensive experiments aimed at assessing the accuracy and performance of DWARF.
Researcher Affiliation	Academia	Filippo Aleotti, Matteo Poggi, Fabio Tosi, Stefano Mattoccia Department of Computer Science and Engineering (DISI) University of Bologna, Italy
Pseudocode	No	The paper describes the DWARF architecture with block diagrams (Figure 2) and mathematical formulations (e.g., Equation 1 and 2), but it does not include structured pseudocode or algorithm blocks.
Open Source Code	No	The paper does not provide any explicit statement about releasing source code or a link to a code repository for the methodology described.
Open Datasets	Yes	Flying Things 3D. We set α6 = 0.32, α5 = 0.08, α4 = 0.02, α3 = 0.01 and α2 = 0.005, γ = 0.0004 and crosstask weights to ϵ1 = 1, ϵ2 = 1 and ϵ3 = 0.5. Ground truth values are down-scaled to match the resolution of the level and scaled by a factor of 20, as done by Dosovitskiy et al.; Sun et al. (2015; 2017). KITTI 2015. We ﬁne-tuned the network using the 200 training images from the KITTI Scene Flow (Menze, Heipke, and Geiger 2015) dataset with a batch size of 4 for 50K steps.
Dataset Splits	Yes	Flying Things3D. This dataset provides 4248 frames for validation. In Table 1 we report average End-Point-Error (EPE) for the disparity, ﬂow and change (respectively D1, F1 and D2) on 3822 images, obtained by ﬁltering the validation set according to the guidelines. KITTI 2015. For this experiment, we split the KITTI 2015 training set into 160 images for ﬁne-tuning and reserve the last portion of 40 images for validation purposes only.
Hardware Specification	Yes	Exhaustive experiments show that i) DWARF runs at about 10 FPS on a single high-end GPU and about 1 FPS on NVIDIA Jetson TX2 embedded at KITTI resolution. Runtime analysis for different variants of DWARF on NVIDIA Jetson TX2 (using Max-Q, Max-P, Max-N conﬁgurations) and NVIDIA GTX 1080Ti. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.
Software Dependencies	No	The paper mentions the use of Adam optimizer and Leaky ReLU activations, but does not specify software dependencies like deep learning frameworks (e.g., PyTorch, TensorFlow) with their version numbers.
Experiment Setup	Yes	Flying Things 3D. We set α6 = 0.32, α5 = 0.08, α4 = 0.02, α3 = 0.01 and α2 = 0.005, γ = 0.0004 and crosstask weights to ϵ1 = 1, ϵ2 = 1 and ϵ3 = 0.5. ... The network has been trained for 1.2M steps with a batch size of 4 randomly selecting crops with size 768 × 384, using Adam optimiser (Kingma and Ba 2014), with β1 = 0.9, β2 = 0.999 and initial learning rate of 10-4, which has been halved after 400K, 600K, 800K and 1M steps.