Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
Online Learning from Data Streams with Varying Feature Spaces
Authors: Ege Beyazit, Jeevithan Alagurajah, Xindong Wu3232-3239
AAAI 2019 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | Experiments on 10 datasets with varying feature spaces have been conducted to demonstrate the performance of the proposed OLVF algorithm. Moreover, experiments with trapezoidal data streams on the same datasets have been conducted to show that OLVF performs better than the state-of-the-art learning algorithm (Zhang et al. 2016). |
| Researcher Affiliation | Academia | Ege Beyazit University of Louisiana at Lafayette Lafayette, LA, USA EMAIL Jeevithan Alagurajah University of Louisiana at Lafayette Lafayette, LA, USA EMAIL Xindong Wu University of Louisiana at Lafayette Lafayette, LA, USA EMAIL |
| Pseudocode | Yes | Algorithm 1: The OLVF Algorithm |
| Open Source Code | No | The paper does not provide any statement or link indicating that the source code for the described methodology is publicly available. |
| Open Datasets | Yes | We use 9 different UCI datasets to simulate these scenarios. Additionally, we demonstrate the effectiveness of the proposed sparse strategy. Finally we evaluate the performance of OLVF using the real-world dataset IMDB movie reviews (Maas et al. 2011). |
| Dataset Splits | No | The paper mentions 'average prediction accuracy on 20 random permutations of each dataset' but does not specify exact percentages, sample counts, or a formal cross-validation setup for training/validation/test splits. |
| Hardware Specification | No | The paper does not provide any specific details about the hardware (e.g., CPU, GPU models, memory) used for running the experiments. |
| Software Dependencies | No | The paper does not provide specific software dependencies, such as library names with version numbers, needed to replicate the experiments. |
| Experiment Setup | Yes | The parameters C and C are chosen using grid search. We set the C to 0.1, and C to 10 5. For both algorithms, we set B s to 0.1 and ο¬nd their best setting for the C and C parameters by using grid search. |