GitHub - akensert/molgraph: Graph neural networks for molecular machine learning. Implemented and compatible with TensorFlow and Keras.

Graph Neural Networks with TensorFlow and Keras. Focused on Molecular Machine Learning.

Currently, Keras 3 does not support extension types. As soon as it does, it is hoped that MolGraph will migrate to Keras 3.

Highlights

Build a Graph Neural Network with Keras' Sequential API:

from molgraph import GraphTensor
from molgraph import layers
from tensorflow import keras

g = GraphTensor(node_feature=[[4.], [2.]], edge_src=[0], edge_dst=[1])

model = keras.Sequential([
    keras.layers.Input(type_spec=g.spec),
    layers.GINConv(units=32),
    layers.GINConv(units=32),
    layers.Readout(),
    keras.layers.Dense(units=1),
])

pred = model(g)

Paper

See arXiv

Documentation

See readthedocs

Overview

Implementations

Graph tensor (GraphTensor)
- A composite tensor holding graph data.
- Has a ragged state (multiple graphs) and a non-ragged state (single disjoint graph).
- Can conveniently go between both states (merge(), separate()).
- Can propagate node states (features) based on edges (propagate()).
- Can add, update and remove graph data (update(), remove()).
- Compatible with TensorFlow's APIs (including Keras). For instance, graph data (encoded as a GraphTensor) can now seamlessly be used with keras.Sequential, keras.Functional, tf.data.Dataset, and tf.saved_model APIs.
Layers
- Convolutional
  - GCNConv (GCNConv)
  - GINConv (GINConv)
  - GCNIIConv (GCNIIConv)
  - GraphSageConv (GraphSageConv)
- Attentional
  - GATConv (GATConv)
  - GATv2Conv (GATv2Conv)
  - GTConv (GTConv)
  - GMMConv (GMMConv)
  - GatedGCNConv (GatedGCNConv)
  - AttentiveFPConv (AttentiveFPConv)
- Message-passing
  - MPNNConv (MPNNConv)
  - EdgeConv (EdgeConv)
- Distance-geometric
  - DTNNConv (DTNNConv)
  - GCFConv (GCFConv)
- Pre- and post-processing
  - In addition to the aforementioned GNN layers, there are also several other layers which improves model-building. See readout/, preprocessing/, postprocessing/, positional_encoding/.
Models
- Although model building is easy with MolGraph, there are some built-in GNN models:
  - GIN
  - MPNN
  - DMPNN
- And models for improved interpretability of GNNs:
  - SaliencyMapping
  - IntegratedSaliencyMapping
  - SmoothGradSaliencyMapping
  - GradientActivationMapping (Recommended)

Requirements/dependencies

Python (version >= 3.10)
- TensorFlow (version 2.15.*)
- RDKit (version 2023.9.*)
- Pandas
- IPython

Installation

For CPU users:

pip install molgraph

For GPU users:

pip install molgraph[gpu]

Now run your first program with MolGraph:

from tensorflow import keras
from molgraph import chemistry
from molgraph import layers
from molgraph import models

# Obtain dataset, specifically ESOL
esol = chemistry.datasets.get('esol')

# Define molecular graph encoder
atom_encoder = chemistry.Featurizer([
    chemistry.features.Symbol(),
    chemistry.features.Hybridization(),
    # ...
])

bond_encoder = chemistry.Featurizer([
    chemistry.features.BondType(),
    # ...
])

encoder = chemistry.MolecularGraphEncoder(atom_encoder, bond_encoder)

# Obtain graphs and associated labels
x_train = encoder(esol['train']['x'])
y_train = esol['train']['y']

x_test = encoder(esol['test']['x'])
y_test = esol['test']['y']

# Build model via Keras API
gnn_model = keras.Sequential([
    keras.layers.Input(type_spec=x_train.spec),
    layers.GATConv(units=32, name='gat_conv_1'),
    layers.GATConv(units=32, name='gat_conv_2'),
    layers.Readout(),
    keras.layers.Dense(units=1024, activation='relu'),
    keras.layers.Dense(units=y_train.shape[-1])
])

# Compile, fit and evaluate
gnn_model.compile(optimizer='adam', loss='mae')
gnn_model.fit(x_train, y_train, epochs=50)
scores = gnn_model.evaluate(x_test, y_test)

# Compute gradient activation maps
gam_model = models.GradientActivationMapping(
    model=gnn_model, layer_names=['gat_conv_1', 'gat_conv_2'])

maps = gam_model(x_train.separate())

Name		Name	Last commit message	Last commit date
Latest commit History 260 Commits
.github/workflows		.github/workflows
docker		docker
docs		docs
examples		examples
molgraph		molgraph
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

docker

docker

docs

docs

examples

examples

molgraph

molgraph

tests

tests

.gitignore

.gitignore

.readthedocs.yaml

.readthedocs.yaml

CHANGELOG.md

CHANGELOG.md

LICENSE

LICENSE

README.md

README.md

setup.py

setup.py

Repository files navigation

Highlights

Paper

Documentation

Overview

Implementations

Requirements/dependencies

Installation

About

Languages

License

akensert/molgraph

Folders and files

Latest commit

History

Repository files navigation

Highlights

Paper

Documentation

Overview

Implementations

Requirements/dependencies

Installation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages