reproducibilityindex.ai

ADIOS: Architectures Deep In Output Space

Authors: Moustapha Cisse, Maruan Al-Shedivat, Samy Bengio

ICML 2016 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on several popular and large multi-label datasets demonstrate that our approach not only yields signiﬁcant improvements, but also helps to overcome trade-offs speciﬁc to the multi-label classiﬁcation setting.
Researcher Affiliation	Collaboration	Moustapha Cisse MOUSTAPHACISSE@FB.COM Maruan Al-Shedivat ALSHEDIVAT@CS.CMU.EDU Samy Bengio BENGIO@GOOGLE.COM Facebook AI Research , Carnegie Mellon University , Google Brain
Pseudocode	Yes	Algorithm 1 Approximate MBC construction
Open Source Code	Yes	Our code is available at https://github.com/alshedivat/adios.
Open Datasets	Yes	We used three readily available datasets that are popular in the multi-label community2: Delicious (text), Media Mill (video) and NUS-WIDE (images). Additionally, we preprocessed and used two other datasets of moderate and large size: image data from SUN2012 (Xiao et al., 2010) and text data from Bio ASQ competition of 20153.
Dataset Splits	Yes	All the datasets were randomly split into a ﬁxed training (60%), testing (20%) and validation sets (20%).
Hardware Specification	No	The paper does not provide specific details about the hardware used for running experiments, such as GPU models, CPU types, or memory specifications.
Software Dependencies	No	The paper mentions software like 'Keras machine learning library', 'Caffe library', and 'gensim library' but does not provide specific version numbers for these dependencies.
Experiment Setup	Yes	For neural network models, we used hidden layers with 1024 Re LU units and trained the models using Adagrad with 20 to 50% dropout and batch normalization. Additionally, we used L2 regularization on the weight matrices and L1 activity regularization on the output layers when it improved performance (typical values ranged between 10 4 to 10 3 for all the datasets).