Reproducibility Index

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning

Authors: Tianxing Wu, Shutong Zhu, Jingting Wang, Ning Xu, Guilin Qi, Haofen Wang

NeurIPS 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	In this section, we present experiments to show the effectiveness and superiority of ss CDL on the UKG completion tasks of confidence prediction and link prediction. We also analyze the effects of CDL and meta self-training in ss CDL with ablation experiments, and further explore the performance of ss CDL on predicting the confidences of low-confidence triples.
Researcher Affiliation	Academia	1School of Computer Science and Engineering, Southeast University, China 2Key Laboratory of New Generation Artificial Intelligence Technology and its Interdisciplinary Applications (Southeast University), Ministry of Education, China 3College of Design and Innovation, Tongji University, China EMAIL
Pseudocode	Yes	Algorithm 1 gives the details on meta self-training of ss CDL.
Open Source Code	Yes	The source code of ss CDL is publicly available at: https://github.com/seucoin/un KR/tree/main/un KR_ss CDL.
Open Datasets	Yes	We conducted experiments on two widely used UKG datasets, i.e., NL27k extracted from NELL and CN15k extracted from Concept Net (more details are in Table 1).
Dataset Splits	Yes	We followed the setting of UKGE [3] to partition the dataset into 85% for training, 7% for validation, and 8% for testing.
Hardware Specification	Yes	ss CDL was implemented by Pytorch-lightning, and all the experiments were conducted on an RTX3090 GPU card.
Software Dependencies	No	ss CDL was implemented by Pytorch-lightning, and all the experiments were conducted on an RTX3090 GPU card. We used Adam optimizer [13] for SGD training.
Experiment Setup	Yes	The optimal hyper-parameters of ss CDL on NL27k are as follows: the standard deviation of Gaussian distribution σ = 0.6, the weight of pseudo labeled data wp = 0.7, the MSE weight β = 1, the learning rate α = 0.001, the margin γ = 0.1, the threshold for pseudo labeled data selection: 0.03, the batch size: 4096, and the embedding size: 128. The optimal hyper-parameters of ss CDL on CN15k are as follows: the standard deviation of Gaussian distribution σ = 0.6, the weight of pseudo labeled data wp = 0.3, the MSE weight β = 1, the learning rate α = 0.001, the margin γ = 0.1, the threshold for pseudo labeled data selection: 0.015, the batch size: 4096, and the embedding size: 512.