reproducibilityindex.ai

Fairness Aware Counterfactuals for Subgroups

Authors: Loukas Kavouras, Konstantinos Tsopelas, Giorgos Giannopoulos, Dimitris Sacharidis, Eleni Psaroudaki, Nikolaos Theologitis, Dimitrios Rontogiannis, Dimitris Fotakis, Ioannis Emiris

NeurIPS 2023 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We demonstrate the advantages, the wide applicability, and the efficiency of our approach through a thorough experimental evaluation of different benchmark datasets.
Researcher Affiliation	Academia	Loukas Kavouras IMSI / Athena RC kavouras@athenarc.gr Konstantinos Tsopelas IMSI / Athena RC k.tsopelas@athenarc.gr Giorgos Giannopoulos IMSI / Athena RC giann@athenarc.gr Dimitris Sacharidis ULB dimitris.sacharidis@ulb.be Eleni Psaroudaki NTUA & IMSI / Athena RC epsaroudaki@mail.ntua.gr Nikolaos Theologitis IMSI / Athena RC n.theologitis@athenarc.gr Dimitrios Rontogiannis IMSI / Athena RC dronto@gmail.com Dimitris Fotakis NTUA & Archimedes / Athena RC fotakis@cs.ntua.gr Ioannis Emiris NKUA & IMSI / Athena RC emiris@athenarc.gr
Pseudocode	No	The paper describes the steps of the FACTS framework in narrative text but does not include structured pseudocode or a clearly labeled algorithm block.
Open Source Code	Yes	The code is available at: https://github.com/AutoFairAthenaRC/FACTS.
Open Datasets	Yes	This section presents the experimental evaluation of FACTS on the Adult dataset [1]; more information about the datasets, experimental setting, and additional results can be found in the appendix. The code is available at: https://github.com/AutoFairAthenaRC/FACTS. ... C Datasets Description ... 7https://raw.githubusercontent.com/columbia/fairtest/master/data/adult/adult.csv 8https://aif360.readthedocs.io/en/latest/modules/generated/aif360.sklearn. datasets.fetch_compas.html 10https://raw.githubusercontent.com/samuel-yeom/fliptest/master/exact-ot/ chicago-ssl-clean.csv 11https://developer.ibm.com/exchanges/data/all/bias-in-advertising/ 12https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data
Dataset Splits	No	The resulting dataset was split randomly with a 70:30 split ratio and was used to train and test respectively a logistic regression model (consequently used as the black-box model to audit).
Hardware Specification	Yes	Experiments were run on commodity hardware (AMD Ryzen 5 5600H processor, 8GB RAM).
Software Dependencies	No	To conduct our experiments, we have used the Logistic Regression classification model, where we use the default implementation of the python package scikit-learn. ... Our results can be reproduced using the random seed value 131313 in the data split function (train_test_split from the python package scikit-learn). ... We used the implementation in the Python package mlxtend. ... On the software side, all experiments were run in an isolated conda environment using Python 3.9.16.
Experiment Setup	Yes	Experimental Setting The first step was the dataset cleanup (e.g., removing missing values and duplicate features, creating bins for continuous features like age). The resulting dataset was split randomly with a 70:30 split ratio and was used to train and test respectively a logistic regression model... For the generation of the subgroups and the set of actions we used fp-growth with a 1% support threshold on the test set. We also implemented various cost functions, depending on the type of feature, i.e., categorical, ordinal, and numerical. ... We used two different values arbitrarily, i.e., a relatively low effectiveness level of ϕ = 30% and a relatively high effectiveness level of ϕ = 70%. For the estimation of budget-level values c we followed a more elaborate procedure. ... We have chosen the 30%, 60% and 90% percentiles arbitrarily.