Higher Distributed

This is the supporting code repository for the article Higher-order gradients in PyTorch, Parallelized by Sanyam Kapoor and Ramakrishna Vedantam.

Setup

(Optional) Setup a new Python environment via conda as:

conda env create -n <name>

Install CUDA-compiled PyTorch version from here. The codebase has been tested with PyTorch version 1.13 on CUDA 11.8.

pip install 'torch<2' torchvision --extra-index-url https://download.pytorch.org/whl/cu118

Finally, in the same target environment (e.g. the one setup above), run to setup all the dependencies.

pip install -e .

Run

We will use CUDA_VISIBLE_DEVICES environment variable to mask the number of GPUs available for use.

For instance, to use four GPUs:

CUDA_VISIBLE_DEVICES=0,1,2,3 accelerate launch --multi_gpu train_toy.py

The default parameters should not need changing for the demo.

NOTE: The device IDs may need to change as per hardware availability.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt
train_toy.py		train_toy.py
train_toy.sh		train_toy.sh
viz_toy.ipynb		viz_toy.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

environment.yml

environment.yml

requirements.txt

requirements.txt

train_toy.py

train_toy.py

train_toy.sh

train_toy.sh

viz_toy.ipynb

viz_toy.ipynb

Repository files navigation

Higher Distributed

Setup

Run

License

About

Languages

License

activatedgeek/higher-distributed

Folders and files

Latest commit

History

Repository files navigation

Higher Distributed

Setup

Run

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages