reproducibilityindex.ai

Revisiting Causal Discovery from a Complexity-Theoretic Perspective

Authors: Robert Ganian, Viktoriia Korchemna, Stefan Szeider

IJCAI 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	This paper investigates the complex relationship between the graph structure and the efficiency of constraint-based causal discovery algorithms. Our main contributions include (i) a near-tight characterization of which causal graphs admit a small d-separating set for each pair of vertices and thus can potentially be efficiently recovered by a constraintbased causal discovery algorithm, (ii) the explicit construction of a sequence of causal graphs on which the influential PC algorithm might need exponential time, although there is a small d-separating set between every pair of variables, and (iii) the formulation of a new causal discovery algorithm which achieves fixed-parameter running time by considering the maximum number of edge-disjoint paths between variables in the (undirected) super-structure as the parameter. A distinguishing feature of our investigation is that it is carried out within a more fine-grained model which more faithfully captures the infeasibility of performing accurate independence tests for large sets of conditioning variables.
Researcher Affiliation	Academia	Robert Ganian , Viktoriia Korchemna , Stefan Szeider Algorithms and Complexity Group, TU Wien, Vienna, Austria {rganian,vkorchemna,sz}@ac.tuwien.ac.at
Pseudocode	Yes	The PC algorithm... This phase consists of the following sequence of procedures [Glymour et al., 2019]: 1. Construct a complete undirected graph over the provided set of variables. 2. Eliminate edges between variables that are either unconditionally independent or are known to be independent due to expert knowledge (non-adjacent in the super-structure). 3. For each pair of variables {X, Y } having an edge between them, and for each variable Z with an edge connected to either of them, eliminate the edge between X and Y if X is conditionally independent of Y given Z. 4. For each pair of variables {X, Y } having an edge between them, and for each pair of variables {Z, W} with edges both connected to X or both connected to Y , eliminate the edge between X and Y if X Y \| {Z, W}. 5. Continue checking independencies conditional on subsets of variables of size 3, . . . , i until there are no more adjacent pairs X, Y , such that there is a subset of variables of size i such that all of the variables in the subset are adjacent to X or all adjacent to Y .
Open Source Code	No	The paper does not provide any link or explicit statement about the availability of source code for the methodology described.
Open Datasets	No	This is a theoretical paper focusing on complexity analysis and algorithm design, not empirical evaluation on datasets. Therefore, no public dataset access information is provided.
Dataset Splits	No	This is a theoretical paper and does not involve empirical experiments with dataset splits for training, validation, or testing.
Hardware Specification	No	This is a theoretical paper focusing on algorithm design and complexity analysis. It does not describe any empirical experiments or the hardware used to run them.
Software Dependencies	No	This is a theoretical paper and does not describe empirical experiments or specific software dependencies with version numbers for reproducibility.
Experiment Setup	No	This is a theoretical paper and does not provide details on an experimental setup, hyperparameters, or training configurations.