reproducibilityindex.ai

Robust Loss Functions for Training Decision Trees with Noisy Labels

Authors: Jonathan Wilton, Nan Ye

AAAI 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Lastly, our experiments on multiple datasets and noise settings validate our theoretical insight and the effectiveness of our adaptive negative exponential loss.
Researcher Affiliation	Academia	School of Mathematics and Physics, The University of Queensland
Pseudocode	Yes	See Algorithm 1 in Appendix J for details.
Open Source Code	Yes	Our source code is available at https://github.com/jonathanwilton/Robust Decision Trees.
Open Datasets	Yes	We consider some commonly used datasets from UCI including Covertype (Blackard 1998), 20News (Lang 1995), Mushrooms (Audubon Society Field Guide 1987), as well as the MNIST digits (Le Cun et al. 1998), CIFAR-10 (Krizhevsky 2009) and UNSW-NB15 (Moustafa and Slay 2015).
Dataset Splits	Yes	The NE impurity hyperparameter λ {0, 0.25, 0.5, 0.75, 1} was tuned by training on 80% of the noisy training set and validation on the other 20%.
Hardware Specification	Yes	Experiments were performed using Python on a computer cluster with Intel Xeon E5 Family CPUs @ 2.20GHz and 192GB memory running Cent OS.
Software Dependencies	No	The paper mentions 'Python' and 'scikit-learn' but does not provide specific version numbers for these software components, which are required for a reproducible description.
Experiment Setup	Yes	Following common practice (Pedregosa et al. 2011), we set no restriction on maximum depth or number of leaf nodes, minimum one sample per leaf node, 100 trees in each RF, each using only d randomly chosen features, and predictions with RF made by majority average label distribution over all trees. The NE impurity hyperparameter λ {0, 0.25, 0.5, 0.75, 1} was tuned by training on 80% of the noisy training set and validation on the other 20%. GCE q = 0.7 as recommended for NNs (Zhang and Sabuncu 2018), Credal-C4.5 s = 1 as recommended in (Mantas and Abellan 2014).