reproducibilityindex.ai

Fairwalk: Towards Fair Graph Embedding

Authors: Tahleen Rahman, Bartlomiej Surma, Michael Backes, Yang Zhang

IJCAI 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experimental results demonstrate that Fairwalk reduces bias under multiple fairness metrics while still preserving the utility.In this section, we ﬁrst describe our dataset, followed by the set-up of the machine learning model for our experiments. We then present the evaluation results for our Fairwalk using the fairness metrics deﬁned in Section 3 and ﬁnally show the recommendation utility in terms of precision and recall.
Researcher Affiliation	Academia	CISPA Helmholtz Center for Information Security {tahleen.rahman, bartlomiej.surma, backes, yang.zhang}@cispa.saarland
Pseudocode	Yes	Algorithm 1 Fair random walk trace generation
Open Source Code	No	The paper does not include an unambiguous statement about releasing the source code for the described methodology or provide a direct link to a code repository.
Open Datasets	No	We use Instagram data collected from two of the biggest cities in different English speaking countries, namely London and Los Angeles (LA). The data was collected in 2016 using the Instagram API. The paper does not provide any link, DOI, or specific citation with authors and year to indicate public availability.
Dataset Splits	Yes	We iterate our experiments 5 times. To this end we divide our dataset into 5 equal slices. We train a random forest with 100 trees using 4 out of 5 slices, leaving out a different slice each time.
Hardware Specification	No	The paper does not provide any specific details about the hardware (e.g., GPU models, CPU models, memory) used to run the experiments, only general statements about running experiments.
Software Dependencies	No	The paper mentions using a 'random forest classiﬁer' and refers to a 'neural network' but does not specify any software libraries or frameworks with version numbers (e.g., scikit-learn version, TensorFlow version, PyTorch version).
Experiment Setup	Yes	We use the following hyper-parameters (following [Grover and Leskovec, 2016]), i.e., length of each walk: walk len = 80, number of walks starting from each node in the graph: walk num = 20, number of dimensions of the resulting vector space: d = 128. We train a random forest with 100 trees