Improving Generalization in Federated Learning by Seeking Flat Minima

This repository contains the official implementation of

Caldarola, D., Caputo, B., & Ciccone, M. Improving Generalization in Federated Learning by Seeking Flat Minima, European Conference on Computer Vision (ECCV) 2022.

Abstract

Models trained in federated settings often suffer from degraded performances and fail at generalizing, especially when facing heterogeneous scenarios. In this work, we investigate such behavior through the lens of geometry of the loss and Hessian eigenspectrum, linking the model's lack of generalization capacity to the sharpness of the solution. Motivated by prior studies connecting the sharpness of the loss surface and the generalization gap, we show that i) training clients locally with Sharpness-Aware Minimization (SAM) or its adaptive version (ASAM) and ii) averaging stochastic weights (SWA) on the server-side can substantially improve generalization in Federated Learning and help bridging the gap with centralized models. By seeking parameters in neighborhoods having uniform low loss, the model converges towards flatter minima and its generalization significantly improves in both homogeneous and heterogeneous scenarios. Empirical results demonstrate the effectiveness of those optimizers across a variety of benchmark vision datasets (e.g. CIFAR10/100, Landmarks-User-160k, IDDA) and tasks (large scale classification, semantic segmentation, domain generalization).

Setup

Environment

Install conda environment (preferred): conda env create -f environment.yml
Install with pip (alternative): pip3 install -r requirements.txt

Weights and Biases

The code runs with WANDB. For setting up your profile, we refer you to the quickstart documentation. Insert your WANDB API KEY here. WANDB MODE is set to "online" by default, switch to "offline" if no internet connection is available.

Resources

All experiments run on one NVIDIA GTX-1070. If needed, you can specify the GPU ID here.

Data

Execute the following code for setting up the datasets:

conda activate torch10
cd data
chmod +x setup_datasets.sh
./setup_datasets.sh

Datasets

CIFAR-100

Overview: Image Dataset based on CIFAR-100 and Federated Vision Datasets
Details: 100 users with 500 images each. Different combinations are possible, following Dirichlet's distribution
Task: Image Classification over 100 classes

CIFAR-10

Overview: Image Dataset based on CIFAR10 and Federated Vision Datasets
Details: 100 users with 500 images each. Different combinations are possible, following Dirichlet's distribution
Task: Image Classification over 10 classes

Running experiments

Examples of commands for running the paper experiments (FedAvg/FedSAM/FedASAM w/ and w/o SWA) can be found in fedsam/paper_experiments. E.g. for CIFAR10 use the following command:

cd paper_experiments
chmod +x cifar10.sh
./cifar10.sh

Bibtex

@inproceedings{caldarola2022improving,
  title={Improving generalization in federated learning by seeking flat minima},
  author={Caldarola, Debora and Caputo, Barbara and Ciccone, Marco},
  booktitle={European Conference on Computer Vision},
  pages={654--672},
  year={2022},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
loss_landscapes		loss_landscapes
models		models
paper_experiments		paper_experiments
.gitignore		.gitignore
.nojekyll		.nojekyll
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
requirements.txt		requirements.txt

License

debcaldarola/fedsam

Folders and files

Latest commit

History

Repository files navigation

Improving Generalization in Federated Learning by Seeking Flat Minima

Abstract

Setup

Environment

Weights and Biases

Resources

Data

Datasets

Running experiments

Bibtex

About

Resources

License

Stars

Watchers

Forks

Languages