Learning Mixtures of Gaussian Processes through Random Projection

Authors: Emmanuel Akeweje, Mimi Zhang

ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Extensive experiments on synthetic and real datasets confirm the superiority of our method over existing techniques.
Researcher Affiliation Academia 1School of Computer Science and Statistics, Trinity College Dublin, Ireland 2I-Form Advanced Manufacturing Research Centre, Science Foundation Ireland, Ireland.
Pseudocode Yes The pseudo code of the clustering method is given in Algorithm 1.
Open Source Code Yes Our Python package1 GPmix offers a range of options. 1https://github.com/EAkeweje/GPmix
Open Datasets Yes We evaluated the seven algorithms on 10 real datasets from the UEA & UCR Time Series Classification Repository... Details of these simulation scenarios are provided in Appendix G.2.
Dataset Splits No The paper mentions using "training and testing datasets" for real data but does not provide specific details on dataset splits (e.g., percentages, sample counts, or explicit cross-validation methodology) for training, validation, or testing.
Hardware Specification Yes All experiments were conducted on a PC with a 3.20GHz processor, 16 CPU cores, and 32GB of RAM.
Software Dependencies No The paper mentions software like "Python package GPmix" and various "R package" names (fun FEM, fun HDDC, Funclustering, fdapace, fdasrvf, FADPclust) but does not specify exact version numbers for any programming languages or libraries, which are required for reproducibility.
Experiment Setup Yes For each algorithm, the argument for the number of clusters is set to the true cluster number in the dataset. FEM (from R package fun FEM): The other arguments are set to model = "all", crit = "bic", init = "kmeans", maxit = 50, eps = 1e-06. Table 5 gives the configuration of the GPmix algorithm for the ten real datasets and 12 simulation scenarios.