torchogonal

Promise:

Make your deep neural network Dynamically Isometric. Say farewell to vanishing and exploding gradients

What does this repo acctually do?

It purly makes your model weights 'w' tensor orthogonal (w.t()@w = Identity). This preserves the norm of the input tensor. Since the eigen-value of an orthogonal matrix is with norm equal to 1- gradients can easily seep even through a very deep network.
Using SVD (NOT using QR Decomposition with Gram-Schmidt, that causes many issues). This ensures that even non-square matrices (MxN) will be orthogonalize, and also that the orthogonalization will be as close as possible to the current weights.

How to use?

import it:

from torchogonal import orthogonlize_module

When initializing the model run:

orthogonlize_model(model)

IMPORTANT! This is an in-place function. This means the function does not return a new model, but it has changed the input model itself. So now 'model' weights are orthogonal.

After each optimizer step, run this line again to re-orthogonalize the model:

orthogonlize_model(model)

The weights will be orthogonalize as close as possible to the changed weights (paper #2):

Tests and requirements

Tested on Python 3.8 and Pytorch 1.10. The functions used in this repo are basic. This repo should easily work even older versions.

Issues and future work

Even though orthogonilization is extremely important in RNNs, they requiers a different approach of orthogonilzing (paper #3)

Inspired by the papers:

Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks (https://arxiv.org/pdf/1806.05393.pdf)
projUNN: efficient method for training deep networks with unitary matrices (https://arxiv.org/pdf/2203.05483.pdf)
Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks (https://arxiv.org/pdf/1806.05394.pdf)

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
README.md		README.md
torchogonal.py		torchogonal.py
uni.png		uni.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

torchogonal.py

torchogonal.py

uni.png

uni.png

Repository files navigation

torchogonal

Promise:

What does this repo acctually do?

How to use?

Tests and requirements

Issues and future work

Inspired by the papers:

About

Releases

Packages

Languages

omrijsharon/torchogonal

Folders and files

Latest commit

History

Repository files navigation

torchogonal

Promise:

What does this repo acctually do?

How to use?

Tests and requirements

Issues and future work

Inspired by the papers:

About

Topics

Resources

Stars

Watchers

Forks

Languages