reproducibilityindex.ai

Predicting Soccer Highlights from Spatio-Temporal Match Event Streams

Authors: Tom Decroos, Vladimir Dzyuba, Jan Van Haaren, Jesse Davis

AAAI 2017 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	An empirical evaluation on a real-world dataset shows that POGBA outperforms the baseline algorithms in terms of both precision and recall.
Researcher Affiliation	Academia	Tom Decroos, Vladimir Dzyuba, Jan Van Haaren, Jesse Davis KU Leuven, Department of Computer Science, 3001 Leuven, Belgium {tom.decroos, vladimir.dzyuba, jan.vanhaaren, jesse.davis}@cs.kuleuven.be
Pseudocode	Yes	Algorithm 1 provides pseudocode for the full prediction pipeline.
Open Source Code	Yes	More extensive and detailed results are available in the online supplement.3 https://dtai.cs.kuleuven.be/sports/pogba
Open Datasets	No	The dataset consists of play-by-play data for 69 soccer matches from a Belgian professional soccer club. The paper does not provide explicit access information (link, citation, repository) for this dataset, nor does it state that it is a publicly available or open dataset.
Dataset Splits	No	For each match, we train the model on the 68 other matches and compute the precision and recall for the remaining match aggregated over both teams. This describes a leave-one-match-out cross-validation setup, but does not specify a distinct validation set or explicit train/validation/test split percentages/counts typically used for hyperparameter tuning.
Hardware Specification	No	The paper mentions 'Our unoptimized Python implementation classiﬁes a window in approximately three seconds' but does not provide specific details about the hardware used for running experiments (e.g., CPU/GPU models, memory).
Software Dependencies	No	The paper mentions 'Our unoptimized Python implementation' but does not specify the version of Python or any other software dependencies with version numbers.
Experiment Setup	Yes	Based on our preliminary experiments and domain knowledge, we set the number of nearest neighbors k to 100, the exponential smoothing parameter α to 0.2, and the peak detection window w to 18. The choice of w is based on the analysis of the training data. It showed that, on average, a noteworthy event (e.g., a goal attempt) happens approximately every three minutes. Hence, we set the peak detection window to 18, which corresponds to 90 seconds, as this yields roughly one detected peak per three minutes. In the main experiment, we used ERT.