Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views

Authors: Zixin Zou, Weihao Cheng, Yan-Pei Cao, Shi-Sheng Huang, Ying Shan, Song-Hai Zhang

AAAI 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental We conduct experiments on CO3DV2 which is a multi-view dataset of real-world objects. Both quantitative and qualitative evaluations demonstrate that our approach outperforms previous state-of-the-art works on the metrics regarding NVS and geometry reconstruction.
Researcher Affiliation Collaboration Zixin Zou1, Weihao Cheng2, Yan-Pei Cao2, Shi-Sheng Huang3, Ying Shan2, Song-Hai Zhang1 1BNRist, Tsinghua University 2ARC Lab, Tencent PCG 3Beijing Normal University
Pseudocode No The paper describes its methods through text and diagrams (e.g., Figure 2 and 3) but does not include any explicit pseudocode or algorithm blocks.
Open Source Code Yes Ar Xiv version with supplementary materials is available at https://arxiv.org/abs/2308.14078
Open Datasets Yes We conduct experiments on CO3DV2 which is a multi-view dataset of real-world objects. We follow the fewview-train and fewview-dev splits provided by CO3Dv2 dataset (Reizenstein et al. 2021) for training and evaluation purposes, respectively.
Dataset Splits Yes We follow the fewview-train and fewview-dev splits provided by CO3DV2 dataset (Reizenstein et al. 2021) for training and evaluation purposes, respectively.
Hardware Specification Yes Ne RF optimization runs for 10,000 steps, which takes about 45 minutes on a single 3090 GPU.
Software Dependencies Yes For the multiview-consistent model, we adopt the Stable Diffusion model v1.5 as our priors. For Ne RF reconstruction, we adapt the threestudio (Guo et al. 2023), which is a unified framework for 3D content creation from various inputs, to implement the Ne RF reconstruction for specific objects.
Experiment Setup Yes We set the weights of the losses with λp = 100, λc = 10, λr = 1000 and λm = 50. Ne RF optimization runs for 10,000 steps, which takes about 45 minutes on a single 3090 GPU. In our experiment with setting the CFG value as 7.5