Skip to content

2-norm guided FP32 truncation for heterogeneous deep learning training

Notifications You must be signed in to change notification settings

zhuangsc/bitpack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

What is this

This repository contains everything needed to train various networks (Alexnet, VGG, Resnet) with Bitpack and Bitunpack on Tensorflow.

Bitpack and Bitunpack routines are machine-specific hence two folders are provided for the x86 and POWER versions.

What do you need to run

  • An x86/POWER machine with GPUs
  • Tensorflow
  • gcc on x86 and XLC on POWER
  • CUDA
  • ImageNet ILSVRC-2012 dataset in TFRecord format

How to run

  1. Depends on the CPU model go to either bitpack_x86 or bitpack_power9, issue command "make".
  2. Choose either one of the three available networks to run (Alexnet, VGG, Resnet). An example job script file is provided.

Reference: Sicong Zhuang, Cristiano Malossi and Marc Casas. 2020. Reducing Data Motion to Accelerate the Training of Deep Neural Networks. CoRR abs/2004.02297. https://arxiv.org/abs/2004.02297

Sicong Zhuang sicong.zhuang@gmail.com

About

2-norm guided FP32 truncation for heterogeneous deep learning training

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published