reproducibilityindex.ai

A Unified Framework for 3D Scene Understanding

Authors: Wei Xu, Chunsheng Shi, Sifan Tu, Xin Zhou, Dingkang Liang, Xiang Bai

NeurIPS 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on three benchmarks, including Scan Net20, Scan Refer, and Scan Net200, demonstrate that the Uni Seg3D consistently outperforms current SOTA methods, even those specialized for individual tasks.
Researcher Affiliation	Academia	Wei Xu , Chunsheng Shi , Sifan Tu, Xin Zhou, Dingkang Liang, Xiang Bai Huazhong University of Science and Technology {wxu2023, csshi, dkliang, xbai}@hust.edu.cn
Pseudocode	No	The paper describes its methodology in Section 3 and illustrates it with Fig. 2, but it does not provide formal pseudocode or algorithm blocks.
Open Source Code	Yes	Code and models are available at https://dk-liang.github.io/Uni Seg3D/.
Open Datasets	Yes	Datasets. We evaluate the Uni Seg3D on three benchmarks: Scan Net20 [6], Scan Refer [1], and Scan Net200 [47].
Dataset Splits	Yes	All models are trained for 512 epochs on a single NVIDIA RTX 4090 GPU and evaluated per 16 epochs on the validation set to find the best-performed model.
Hardware Specification	Yes	All models are trained for 512 epochs on a single NVIDIA RTX 4090 GPU and evaluated per 16 epochs on the validation set to find the best-performed model.
Software Dependencies	No	The paper mentions using a 'frozen CLIP [46] text encoder' but does not specify version numbers for any software dependencies.
Experiment Setup	Yes	We adopt the Adam W optimizer with the polynomial schedule, setting an initial learning rate as 0.0001 and the weight decay as 0.05. All models are trained for 512 epochs on a single NVIDIA RTX 4090 GPU and evaluated per 16 epochs on the validation set to find the best-performed model.