CS224W-Machine Learning with Graph-Link Prediction and Causality

The 3 rungs of the ladder of causation

Rung 1: Associational

Background: Inverse Transform Sampling

Rung 2: Interventional (cont)

Imagine two hypothetical data generators for

do (X=x)(X=x) changes fxf_x to a constant in data generation

Rung 3: Counterfactual

Imagine two hypothetical data generators for

Now assume we know X=xX=x', Y=yY=y'. This knowledge changes distribution of UxU_x  and UyU_y

Casual DAG

The above data generation can be described by an execution graph, called the causal Directed Acyclic Graph (DAG):

Causality Challenge: Identifiability

Link prediction for decision-making interventions (e.g., search & recommendations) tends to be causal

P(Accept(i,j))=yesdo(show recommendation=j to user=i)P(Accept(i,j)) = yes | do (show \ recommendation = j \ to \ user = i )

Importance of Causality in Decision-making

Zillow House Offer Example

Biomedical Experiment Causal Graph

Other Applications of Causality in Graph Learning

(Out-of-distribution Graph Tasks)

Consider an out-of-distribution graph classification task

Differences between In-distribution and Out-of-distribution tasks

Out-of-distribution tasks are a mix of associational and counterfactual tasks

Data: (Xtr,Ytr)(X^{tr},Y^{tr})

Task: Predict YteY ^{te} given XteX^{te} under P(YtrXtr)=P(YteXte)P(Y^{tr} | X^{tr}) = P(Y^{te}| X^{te})

Why is Graph ODD Learning a Counterfactual Task?

Example:

A Causal Mechanism for Graph Sizes

Link Prediction

Temporal Graph Representation Learning is Observational

Time-then-graph more expressive than Time-and-graph (when using MP-GNNs)

Causality & Link Prediction

Link Prediction as an exposure

Task: Link Creation outcome

Causal Identifiability

Graph Formation Process Key to Understand Effect of Exposures

Graph Formation Processes

Latent Factor Graph Formation