reproducibilityindex.ai

A Study of Educational Data Mining: Evidence from a Thai University

Authors: Ruangsak Trakunphutthirak, Yen Cheung, Vincent C. S. Lee734-741

AAAI 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	To contribute to this area of research, we employed two datasets such as web-browsing categories and Internet access activity types to select the best outcomes, and compared different weights in the time and frequency domains. We found that the random forest technique provides the best outcome in these datasets to identify those students who are at-risk of failure.
Researcher Affiliation	Academia	Ruangsak Trakunphutthirak, Yen Cheung, Vincent C. S. Lee, SMIEEE Faculty of IT, Clayton Campus, Monash University Melbourne, Australia {ruangsak.trakunphutthirak, yen.cheung, vincent.cs.lee} @monash.edu
Pseudocode	No	The paper describes methods and includes flowcharts (e.g., Figure 1) but does not provide any structured pseudocode or algorithm blocks.
Open Source Code	No	The paper does not contain any statement about making its source code publicly available, nor does it provide a link to a code repository for its methodology.
Open Datasets	No	The university's log file was gathered by recording all internet access activities of students. ... The permission to use the dataset has been approved by the research ethics committee of the university. Due to privacy and security concerns, students identification was also encrypted and de-identified in the dataset used for this study. This indicates a private, internal dataset, not a publicly available one.
Dataset Splits	Yes	This study used 10 folds cross-validation to reduce the bias of a test dataset.
Hardware Specification	No	The paper does not mention any specific hardware used for running the experiments, such as CPU or GPU models, or memory specifications.
Software Dependencies	No	The paper mentions using various machine learning techniques like Decision Tree (J48), Logistic Regression, Naive Bayes, Neural Network, and Random Forest, but it does not specify any software platforms, libraries, or their version numbers used for implementation.
Experiment Setup	No	The paper discusses data preprocessing, attribute selection, and the use of different datasets (APP and CAT) and their combinations. However, it does not specify concrete hyperparameters or system-level training settings for the machine learning models (e.g., learning rates, batch sizes, epochs, optimizer details).