Skip to content

EmilianoGagliardiEmanueleGhelfi/CNN-compression-performance

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

For real world application, convolutional neural network(CNN) model can take more than 100MB of space and can be computationally too expensive. Therefore, there are multiple methods to reduce this complexity in the state of art. Ristretto is a plug-in to Caffe framework that employs several model approximation methods. For this projects, first a CNN model will be trained for Cifar-10 dataset with Caffe, then Ristretto will be use to generate multiple approximated version of the trained model using different schemes. The goal of this projects is comparison of the models in terms of execution performance, model size and cache utilizations in the test or inference phase. The same steps are done with Tensorflow and Quantisation tool. The quantisation schemes of Tensorflow and Ristretto are then compared.

Folder description

  • ristretto: contains the tools for caffe ristretto quantization
  • tf_quantize: contains the tools for tensorflow quantization

A more detailed description is inside the readme of each folder.

Results

Cache Misses

Authors

  • Emanuele Ghelfi
  • Emiliano Gagliardi

About

A python script that automatise the training of a CNN, compress it through tensorflow (or ristretto) plugin, and compares the performance of the two networks

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published