GPSE Implementation #9018

semihcanturk · 2024-03-05T06:27:55Z

Graph Positional and Structural Encoder implementation as per #8310. Adapted from the original repository: https://github.com/G-Taxonomy-Workgroup/GPSE. This version is a standalone implementation that is decoupled from GraphGym, and thus aims for better accessibility and a smoother integration into PyG. While the priority of this PR is to enable loading and using pre-trained models in plug-and-play fashion, it also includes the custom loss function used to train the model. Nevertheless, it might be easier to use the original repository for pre-training and fine-tuning new GPSE models for the time being.

This PR includes the following:

GPSE: The main GPSE module, that generates learned encodings for input graphs.
Several helper classes (FeatureEncoder, GNNStackStage, IdentityHead, GNNInductiveHybridMultiHead, ResGatedGCNConvGraphGymLayer, Linear, MLP, GeneralMultiLayer, GeneralLayer, BatchNorm1dNode, BatchNorm1dEdge, VirtualNodePatchSingleton) and wrapper functions (GNNPreMP, GNNLayer), all adapted from their GraphGym versions for compatibility and enabling the loading of weights pre-trained using the GraphGym/original version.
The class method GPSE.from_pretrained() that returns a model with pre-trained weights from the original repository/Zenodo files.
GPSENodeEncoder, a helper linear/MLP encoder that takes the GPSE encodings precomputed asbatch.pestat_GPSE in the input graphs, maps them to a desired dimension and appends them to node features.
precompute_GPSE , a function that takes in a GPSE model and a dataset, and precomputes GPSE encodings in-place for a given dataset using the helper function gpse_process_batch.
The transform AddGPSE, which in similar fashion to AddLaplacianEigenvectorPE and AddRandomWalkPE adds the GPSE encodings to a given graph using the helper function gpse_process
The testing modules test/test_gpse.py and test/test_add_gpse.py.
The loss function gpse_loss and helper functions cosim_col_sep and process_batch_idx used in GPSE training.
A comprehensive example in examples/gpse.py using the ZINC dataset. Two different ways of using GPSE to generate encodings is demonstrated: (a) as a function that adds encodings in-place through precompute_GPSE, (b) as a pre-transform. [EDIT: To be added as a separate PR]

codecov · 2024-03-27T05:46:55Z

Codecov Report

Attention: Patch coverage is 64.41860% with 153 lines in your changes are missing coverage. Please review.

Project coverage is 89.23%. Comparing base (ed17034) to head (f24acbc).

❗ Current head f24acbc differs from pull request most recent head 9f000d0. Consider uploading reports for the commit 9f000d0 to get more accurate results

Files	Patch %	Lines
torch_geometric/nn/models/gpse.py	62.68%	153 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #9018      +/-   ##
==========================================
+ Coverage   88.46%   89.23%   +0.76%     
==========================================
  Files         470      472       +2     
  Lines       30189    30594     +405     
==========================================
+ Hits        26708    27301     +593     
+ Misses       3481     3293     -188

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Semih Cantürk added 3 commits March 5, 2024 00:06

GPSE implementation

0e84df7

rename example module

a3ac5a2

cosim_col_sep raises ValueError if no batch_idx

e4ed760

semihcanturk requested review from wsad1 and EdisonLeeeee as code owners March 5, 2024 06:27

github-actions bot added nn example transform labels Mar 5, 2024

remove examples/gpse.py, to be added as a separate PR

e7add7e

github-actions bot removed the example label Mar 7, 2024

semihcanturk mentioned this pull request Mar 7, 2024

Graph Positional and Structural Encoder #8310

Open

rusty1s assigned semihcanturk Mar 25, 2024

rusty1s added feature 0 - Priority P0 labels Mar 25, 2024

Semih Cantürk added 5 commits March 27, 2024 00:40

GPSE implementation

7726933

rename example module

6bf3042

cosim_col_sep raises ValueError if no batch_idx

8a295af

remove examples/gpse.py, to be added as a separate PR

971fa48

Merge remote-tracking branch 'origin/gpse' into gpse

f24acbc

semihcanturk requested review from a team, rusty1s and mananshah99 as code owners March 27, 2024 05:40

github-actions bot added documentation benchmark example dataset data utils labels Mar 27, 2024

Merge branch 'master' into gpse

9f000d0

github-actions bot removed documentation benchmark example dataset data utils labels Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPSE Implementation #9018

GPSE Implementation #9018

semihcanturk commented Mar 5, 2024 •

edited

codecov bot commented Mar 27, 2024 •

edited

GPSE Implementation #9018

Are you sure you want to change the base?

GPSE Implementation #9018

Conversation

semihcanturk commented Mar 5, 2024 • edited

codecov bot commented Mar 27, 2024 • edited

Codecov Report

semihcanturk commented Mar 5, 2024 •

edited

codecov bot commented Mar 27, 2024 •

edited