Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

EmbedMask: Embedding Coupling for Instance Segmentation

Authors: Hui Ying, Zhaojin Huang, Shu Liu, Tianjia Shao, Kun Zhou

IJCAI 2021 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Current instance segmentation methods can be categorized into segmentation-based methods and proposal-based methods. ... Embed Mask outperforms the state-of-the-art instance segmentation method Mask R-CNN on the challenging COCO dataset, obtaining more detailed masks at a higher speed. ... 4 Experiments 4.1 Experimental Settings We follow the settings of FCOS [Tian et al., 2019b] in our experiments, which chooses the large-scale detection benchmark COCO, and uses the COCO trainval35k split (115K images) for training, minival split (5K images) for ablation study and test-dev (20K images) for reporting the main results.
Researcher Affiliation	Collaboration	1State Key Lab of CAD&CG, Zhejiang University 2Kuai Shou 3Smart More
Pseudocode	No	The paper describes the method using text, figures (network architecture diagrams), and equations, but does not include structured pseudocode or algorithm blocks.
Open Source Code	No	The paper does not provide an explicit statement about releasing its own source code or a direct link to a repository containing its implementation.
Open Datasets	Yes	We follow the settings of FCOS [Tian et al., 2019b] in our experiments, which chooses the large-scale detection benchmark COCO, and uses the COCO trainval35k split (115K images) for training, minival split (5K images) for ablation study and test-dev (20K images) for reporting the main results. ... COCO dataset [Lin et al., 2014]
Dataset Splits	Yes	We follow the settings of FCOS [Tian et al., 2019b] in our experiments, which chooses the large-scale detection benchmark COCO, and uses the COCO trainval35k split (115K images) for training, minival split (5K images) for ablation study and test-dev (20K images) for reporting the main results.
Hardware Specification	Yes	on an NVIDIA Ge Force 2080 Ti GPU
Software Dependencies	No	The paper mentions frameworks and optimizers like FCOS, maskrcnn-benchmark, Adelai Det, and SGD, but does not provide specific version numbers for these or other software dependencies such as Python, PyTorch, or CUDA.
Experiment Setup	Yes	Unless noted, the input images are resized with the shorter side being 800 while the longer side being no longer than 1333. ... We train all the models with SGD using an initial learning rate of 0.01 and batch size of 16, with constant warm-up of 500 iterations. ... The models are trained for 12 epochs (90k iterations) by default, but more epochs are applied when training with the Res Net-101 backbone. ... In the main results, we set embedding dim D = 16. ... λ1 = 0.5 by default. ... We find that when the threshold is 0.52 or 0.53, the mask AP is the highest. Hence we use 0.52 for the Res Net-50 backbone and 0.53 for the Res Net-101 backbone.