reproducibilityindex.ai

Non-Discriminatory Machine Learning Through Convex Fairness Criteria

Authors: Naman Goel, Mohammad Yaghini, Boi Faltings

AAAI 2018 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Our experimental analysis demonstrates the success of the method on popular real datasets including Pro Publica s COMPAS dataset. and 7 Experimental Evaluation As mentioned earlier, the weighted sum of logs technique needs to be tuned for meeting a given p-rule requirement while ensuring as little drop in accuracy as possible. We evaluate three additional techniques that can used for this. ... We work with two datasets in our experimental evaluation. The ﬁrst dataset is the Pro Publica s COMPAS dataset (Larson et al. 2016). ... The second dataset is the popular Adult Income dataset (UCI 1996).
Researcher Affiliation	Academia	Naman Goel, Mohammad Yaghini, Boi Faltings Artiﬁcial Intelligence Laboratory, Ecole Polytechnique F ed erale de Lausanne, Lausanne, Switzerland, 1015 {naman.goel, mohammad.yaghini, boi.faltings}@epﬂ.ch
Pseudocode	No	The paper describes the proposed technique in text and mathematical formulations but does not include any pseudocode or clearly labeled algorithm blocks.
Open Source Code	No	The paper does not provide an explicit statement or link for the open-source code of the methodology described in this paper.
Open Datasets	Yes	The ﬁrst dataset is the Pro Publica s COMPAS dataset (Larson et al. 2016). It contains data from Broward County, Florida, compiled by Pro Publica. ... The second dataset is the popular Adult Income dataset (UCI 1996). This dataset has 45, 222 instances and 14 attributes such as age, occupation, working hours per week and work class etc. ... References: Larson, J.; Mattu, S.; Kirchner, L.; and Angwin, J. 2016. https://github.com/propublica/compas-analysis, 2016. UCI. 1996. Adult income data set. url: https://archive. ics.uci.edu/ml/datasets/adult.
Dataset Splits	No	All the results presented in this paper are k-fold cross-validated. The paper mentions using k-fold cross-validation but does not specify the value of k or provide explicit train/validation/test split percentages or sample counts.
Hardware Specification	No	The paper does not provide any specific hardware details such as GPU or CPU models, memory specifications, or types of computing resources used for the experiments.
Software Dependencies	No	The paper mentions using a 'logistic regression classiﬁer' and states 'Our implementation uses the same set of libraries and code for optimization and cross validation etc.', but it does not specify any particular software names with version numbers.
Experiment Setup	No	The paper discusses hyperparameters like δ and γ conceptually, but it does not provide specific numerical values for these or other experimental setup details such as learning rates, batch sizes, or optimizer configurations used in the experiments.