reproducibilityindex.ai

Visual Concepts Tokenization

Authors: Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng

NeurIPS 2022 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Extensive experiments on several popular datasets verify the effectiveness of VCT on the tasks of disentangled representation learning and scene decomposition. VCT achieves the state of the art results by a large margin.
Researcher Affiliation	Collaboration	Tao Yang1 , Yuwang Wang2 , Yan Lu2 , Nanning Zheng1 yt14212@stu.xjtu.edu.cn, {yuwwan,yanlu}@microsoft.com, nnzheng@mail.xjtu.edu.cn 1Xi an Jiaotong University, 2Microsoft Research Asia
Pseudocode	No	The paper describes the architecture and processes using figures and textual descriptions, but it does not include any explicit pseudocode or clearly labeled algorithm blocks.
Open Source Code	Yes	https://github.com/thomasmry/VCT
Open Datasets	Yes	Datasets Following [36], we conduct the experiments on the public datasets below, which are popular in disentangled representation literature: Shapes3D [27] is a dataset of 3D shapes generated from 6 factors of variation. MPI3D [17] is a 3D dataset recorded in a controlled environment, defined by 7 factors of variation, and Cars3D [34] is a dataset of CAD models generated by color renderings from 3 factors of variation.
Dataset Splits	Yes	Did you specify all the training details (e.g., data splits, hyperparameters, how they were chosen)? [Yes] Please see Appendix A
Hardware Specification	Yes	Did you include the total amount of compute and the type of resources used (e.g., type of GPUs, internal cluster, or cloud provider)? [Yes] Please see Appendix A
Software Dependencies	No	The paper does not provide specific software names with version numbers (e.g., 'PyTorch 1.9', 'Python 3.8') for ancillary software or dependencies.
Experiment Setup	Yes	We set λdis = 1 and adopt VQ-VAE for Lrec in all the experiments.