Skip to content

We explore observational bias, topological shortcuts and inductive tests in link prediction.

Notifications You must be signed in to change notification settings

ChatterjeeAyan/observational-bias

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Observational Bias and Inductive Link Prediction

Abstract

Link prediction is essential in graph machine learning, finding applications in various domains such as social networks and drug-target interaction networks. The transductive setting, where both test nodes are observed during training, is the most common scenario. However, when one or both nodes are unobserved, it becomes semi-inductive or inductive link prediction. Recent research has uncovered how machine learning models exploit graph topology as shortcuts, achieving comparable performance to computationally expensive deep neural networks using simple algorithms. We show that transductive link prediction heavily relies on higher degree nodes, which are frequently observed and encountered during training, termed \emph{observational bias}. Basic node features like degrees and nearest-neighbor degrees serve as strong predictors, eliminating the need for latent graph representation learning in transductive setting. To evaluate the true capabilities of machine learning models in link prediction, we design inductive tests that remove topological shortcuts and observational bias. Our findings reveal that state-of-the-art models struggle in inductive tests, highlighting the significance of leveraging node attributes independent of graph topology.

This work focuses on advancements and challenges in link prediction, exploring the roles of graph topology, observational bias, and node attributes in transductive and inductive scenarios. Our research aims to provide insights for the development of more effective link prediction models.

Reproducing the Results

Requirements

We use the OGB benchmark to develop and run the experiments. Please refer to the OGB documentation and setup for executing the experiments listed below: https://github.com/snap-stanford/ogb

Merge the files from /Inductive-tests/ogb/ in the extracted ogb folder.

For setting up the state-of-the-art link prediction model PLNLP, please refer to: https://github.com/zhitao-wang/plnlp

Topological Shortcuts in Transductive Tests

Configuration models - traditional and duplex: /Topological-Shortcuts/maximumentropymodels.py

Transductive link prediction on ogbl-ddi: /Topological-Shortcuts/ogbl-ddi-configuration-model.ipynb

Inductive Tests on State-of-the-art PLNLP with random node split

ogbl-ddi: /Inductive-tests/PLNLP/main_ddi_node_split.py

ogbl-ppa: /Inductive-tests/PLNLP/main_node_split_collab.py

ogbl-collab: /Inductive-tests/PLNLP/main_node_split_ppa.py

Inductive tests using MLPs and pre-trained node attributes

ogbl-ddi: /Inductive-tests/ogb/examples/linkproppred/ddi/

ogbl-ppa: /Inductive-tests/ogb/examples/linkproppred/ppa/

ogbl-collab: /Inductive-tests/ogb/examples/linkproppred/collab/

Supplementary Material

Topological Shortcuts in DGL

We compare the performance of GraphSAGE [1] with the traditional configuration model and observe that the deep model leverages topological shortcuts. Both models achieve similar AUROC over multiple benchmark datasets in DGL. We use random edge split to create the train-validation-tests graphs.

Datasets are available at: https://docs.dgl.ai/en/0.4.x/api/python/data.html

Dataset GraphSAGE AUROC Configuration Model AUROC
CitationGraphDataset - pubmed 0.912 0.881
CoraDataset 0.864 0.829
CoraFull 0.889 0.857
AmazonCoBuy - computers 0.499 0.891
AmazonCoBuy - photo 0.5 0.897
Coauthor - cs 0.721 0.854
Coauthor - physics 0.861 0.851

Description of the OGB MLP Architectures for Reproducibility

Dataset Layers Parameters Hidden Channels Dropout Batch Size Learning Rate Epochs
ogbl-ppa 3 113,921 256 0.1 65,536 0.01 20
ogbl-collab 3 99,073 256 0.1 65,536 0.01 200
ogbl-ddi 3 99,073 256 0.1 65,536 0.01 100

Benchmark Graph Datasets are insufficient for Inductive Tests

Open Graph Benchmark (OGB) [2] provides a useful benchmark for comparing link prediction models, but is limited only to the transductive setting. OGB-provided train-validation-test splits are inadequate for inductive tests. OGB provides large-scale graph datasets from various domains like social networks, biological networks, and molecular graphs. The train-validation-test splits are specifically tailored to test generalization based on specific properties associated with each graph. For example, in ogbl-ppa (protein-protein interaction network), the training graph consists of the interactions obtained via high throughput technology or even text-mining. This method of obtaining the interactions is cost-effective but produces low-confidence data. The validation and the test datasets are obtained from low throughput and resource-intensive experiments. Thus, they are of high confidence and provide a challenging generalization scenario. However, the large overlap between the nodes in the train, the validation, and the test graphs limits us from creating node-disjoint train and test datasets for an inductive test setting. Thus, running an inductive test using the default OGB train-validation-tests splits is unfeasible. We summarize below the graph properties of the OGB link prediction datasets, which shows the node overlaps between the train-validation-test splits in the OGB datasets. We lose the majority of the training edges when we remove the edges from the training graph which share nodes with the test dataset. The OGB data splits are thus insufficient for inductive tests. The default train-validation-test splits in the OGB link prediction benchmark have overlapping nodes. Therefore, when we remove the edges from the training graph which share nodes with the test graph, we lose majority of the edges.

The benchmark train-validation-test graphs in OGB have overlapping nodes, which hinder the creation of inductive tests:

Dataset Train Nodes Validation Nodes Test Nodes Train - Test Nodes Test - Train Nodes
ogbl-ppa 576,289 276,199 576,071 0 0
ogbl-collab 235,868 144,942 143,679 0 0
ogbl-ddi 3,967 3,995 1,737 86 0

Inductive train-validation-test split using random edge split on the OGB benchmark:

Dataset Train Nodes Validation Nodes Test Nodes Train Edges Validation Edges Test Edges Edges lost
ogbl-ppa 3,991 36,778 461,288 2,626 25,973 1,213,051 29,084,623
ogbl-collab 36,593 62,561 60,023 27,482 51,419 42,680 1,281,488
ogbl-ddi 0 3,759 508 0 53,396 5 445,109

References

[1] William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. https://arxiv.org/abs/1706.02216

[2] Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. 2020. Open Graph Benchmark: Datasets for Machine Learning on Graphs. https://arxiv.org/abs/2005.00687

About

We explore observational bias, topological shortcuts and inductive tests in link prediction.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published