Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

Taxonomy of reduction matrices for Graph Coarsening

Authors: Antonin Joly, Nicolas Keriven, Aline Roumy

NeurIPS 2025 | Venue PDF | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental We compare these different choices, both in terms of RSA and performance when used within GNNs trained on coarsened graphs. In this section, we evaluate numerically the performance of these examples, both in terms of RSA constant and used within GNNs.
Researcher Affiliation Academia Antonin Joly CNRS, IRISA, Rennes, FRANCE EMAIL Nicolas Keriven CNRS, IRISA, Rennes, FRANCE EMAIL Aline Roumy INRIA, Rennes, FRANCE EMAIL
Pseudocode Yes Algorithm 1 Loukas Algorithm
Open Source Code Yes The code is available at https://gitlab.inria.fr/anjoly/taxonomy-coarsening-matrices
Open Datasets Yes We consider the two classical medium-scale graphs Cora [30], and Citeseer [15], and use the public split from [39] for training the GNNs.
Dataset Splits Yes We consider the two classical medium-scale graphs Cora [30], and Citeseer [15], and use the public split from [39] for training the GNNs. Each training is averaged on 10 random split, using the same experimental setting as in [23], and the hyperparameters are provided in App. G.1. Table 2: Characteristics of Cora and Cite Seer Datasets Dataset # Nodes # Edges # Train Nodes # Val Nodes # Test Nodes Cora 2,708 10,556 140 500 1,000 Cora PCC 2,485 10,138 122 459 915 Citeseer 3,327 9,104 120 500 1,000 Citeseer PCC 2,120 7,358 80 328 663
Hardware Specification No handling larger graphs presents challenges. Indeed, in the optimizations we propose, the RSA requires computing the square root of the original Laplacian, which is not sparse in general and, for graphs like Reddit [16], cannot be stored on modern GPUs.
Software Dependencies No It only uses open-source Python libraries.
Experiment Setup Yes G.1 Hyperparameters for Tab. 1 For the GCN, for both Cora and Cite Seeer we have 3 convolutional layers with the hidden dimensions [256, 128]. We use an Adam Optimizer with a learning rate lr = 0.01 and a weight decay wd = 0.001. For the SGC model on Cora and Citeseer we make 2 propagations as preprocessing for the features. We use an Adam Optimizer with a learning rate lr = 0.1 and a weight decay wd = 0.001.