reproducibilityindex.ai

A Disentangled Recognition and Nonlinear Dynamics Model for Unsupervised Learning

Authors: Marco Fraccaro, Simon Kamronn, Ulrich Paquet, Ole Winther

NeurIPS 2017 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	The model is trained end-to-end on videos of a variety of simulated physical systems, and outperforms competing methods in generative and missing data imputation tasks. and KVAEs are tested on videos of a variety of simulated physical systems in section 5.
Researcher Affiliation	Collaboration	Technical University of Denmark Deep Mind
Pseudocode	No	The paper does not contain structured pseudocode or algorithm blocks.
Open Source Code	Yes	Further implementation details can be found in the supplementary material (appendix A) and in the Tensorﬂow [1] code released at github.com/simonkamronn/kvae.
Open Datasets	No	The paper uses simulated data that it generates itself: 'We simulate 5000 sequences of 20 time steps each of a ball moving in a two-dimensional box, where each video frame is a 32x32 binary image.' and 'Training, validation and test set are formed by 500 sequences of 15 frames of 16x16 pixels.' No concrete access information (link, DOI, formal citation for a public dataset) is provided.
Dataset Splits	No	The paper mentions 'Training, validation and test set are formed by 500 sequences of 15 frames of 16x16 pixels.' but does not provide specific dataset split information (percentages, sample counts, or citations to predefined splits) to reproduce the data partitioning.
Hardware Specification	Yes	We thank NVIDIA Corporation for the donation of TITAN X GPUs.
Software Dependencies	No	The paper mentions 'Tensorﬂow [1]' but does not provide specific version numbers for any software dependencies.
Experiment Setup	Yes	The minimum number of latent dimensions that the KVAE requires to model the ball s dynamics are at R2 and zt R4, as at the very least the ball s position in the box s 2d plane has to be encoded in at, and zt has to encode the ball s position and velocity. The dynamics parameter network uses K = 3 to interpolate three modes... and We use a KVAE with at R2, zt R3 and K = 2