Cross-Gradient Aggregation

Repository for implementing Cross-Gradient Aggregation (CGA)

Paper accepted in 38^th International Conference on Machine Learning (ICML 2021)

Algorithm Overview

In the proposed CGA algorithm,

each agent computes gradients of model parameters on its own data set;
each agent sends its model parameters to its neighbors;
each agent computes the gradients of its neighbors' models on its own data set and sends the cross gradients back to the respective neighbors;
cross gradients and local gradients are projected into an aggregated gradient (using Quadratic Programming); which is then used to
update the model parameter.

Results

Average training loss (log scale) for (a) CGA optimizer on IID (b) CGA optimizer on non-IID data distributions (c) different optimizers on non-IID data distributions for training 5 agents using CNN model architecture.

Running experiments

Example run:

python -m torch.distributed.launch --nnodes 1 --nproc_per_node 5 main.py --data_dist non-iid --opt CGA --epochs 5 --experiment 1 -log 5 --data CIFAR10 --model CNN --scheduler --momentum 0.5

Topologies (--experiment argument)

Fully Connected
Ring
Bipartite

List of Optimizers

CGA: Cross-Gradient Aggregation
CompCGA: Compressed Cross-Gradient Aggregation
CDSGD: Consensus Based Distributed Stochastic Gradient Descent
CDMSGD: Consensus Based Distributed Momentum Stochastic Gradient Descent
SGP: Stochastic Gradient Push
SGA
SwarmSGD

List of Models

LR
FCN
CNN (CNN, Big_CNN, stl10_CNN, mnist_CNN)
VGG (VGG11, VGG13, VGG16, VGG19)
ResNet (resnet20, resnet32, resnet44, resnet56, resnet110, resnet1202, WideResNet28x10, PreResNet110)

Citation

Please cite our paper in your publications if it helps your research:

@article{esfandiari2021cross,
  title={Cross-Gradient Aggregation for Decentralized Learning from Non-IID data},
  author={Esfandiari, Yasaman and Tan, Sin Yong and Jiang, Zhanhong and Balu, Aditya and Herron, Ethan and Hegde, Chinmay and Sarkar, Soumik},
  journal={arXiv preprint arXiv:2103.02051},
  year={2021}
}

Paper Links

Cross-Gradient Aggregation for Decentralized Learning from Non-IID data

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Connectivity		Connectivity
Models		Models
New_connectivity		New_connectivity
Optimizers		Optimizers
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
collaborative.py		collaborative.py
comp_plot.py		comp_plot.py
data.py		data.py
environment.yml		environment.yml
genconnectivity.py		genconnectivity.py
main.py		main.py
models.py		models.py
plot_each_exp.py		plot_each_exp.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-Gradient Aggregation

Algorithm Overview

Results

Running experiments

Topologies (--experiment argument)

List of Optimizers

List of Models

Citation

Paper Links

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

yasesf93/CrossGradientAggregation

Folders and files

Latest commit

History

Repository files navigation

Cross-Gradient Aggregation

Algorithm Overview

Results

Running experiments

Topologies (--experiment argument)

List of Optimizers

List of Models

Citation

Paper Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages