Predicting Soccer Highlights from Spatio-Temporal Match Event Streams

Authors: Tom Decroos, Vladimir Dzyuba, Jan Van Haaren, Jesse Davis

AAAI 2017 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental An empirical evaluation on a real-world dataset shows that POGBA outperforms the baseline algorithms in terms of both precision and recall.
Researcher Affiliation Academia Tom Decroos, Vladimir Dzyuba, Jan Van Haaren, Jesse Davis KU Leuven, Department of Computer Science, 3001 Leuven, Belgium {tom.decroos, vladimir.dzyuba, jan.vanhaaren, jesse.davis}@cs.kuleuven.be
Pseudocode Yes Algorithm 1 provides pseudocode for the full prediction pipeline.
Open Source Code Yes More extensive and detailed results are available in the online supplement.3 https://dtai.cs.kuleuven.be/sports/pogba
Open Datasets No The dataset consists of play-by-play data for 69 soccer matches from a Belgian professional soccer club. The paper does not provide explicit access information (link, citation, repository) for this dataset, nor does it state that it is a publicly available or open dataset.
Dataset Splits No For each match, we train the model on the 68 other matches and compute the precision and recall for the remaining match aggregated over both teams. This describes a leave-one-match-out cross-validation setup, but does not specify a distinct validation set or explicit train/validation/test split percentages/counts typically used for hyperparameter tuning.
Hardware Specification No The paper mentions 'Our unoptimized Python implementation classifies a window in approximately three seconds' but does not provide specific details about the hardware used for running experiments (e.g., CPU/GPU models, memory).
Software Dependencies No The paper mentions 'Our unoptimized Python implementation' but does not specify the version of Python or any other software dependencies with version numbers.
Experiment Setup Yes Based on our preliminary experiments and domain knowledge, we set the number of nearest neighbors k to 100, the exponential smoothing parameter α to 0.2, and the peak detection window w to 18. The choice of w is based on the analysis of the training data. It showed that, on average, a noteworthy event (e.g., a goal attempt) happens approximately every three minutes. Hence, we set the peak detection window to 18, which corresponds to 90 seconds, as this yields roughly one detected peak per three minutes. In the main experiment, we used ERT.