The Hierarchical Mixture of Bigots

This is my implementation of an approach to neural network model interpretability issue described in the paper "Distilling a Neural Network Into a Soft Decision Tree" by Nicholas Frosst and Geoffrey Hinton.

Location	Content description
models/tree.py	Implementation of SBDT model in `tf.keras` with all details as stated in the paper. Some parts such as loss regularization term calculation are done in pure TensorFlow. The rest is encapsulated into keras custom layers.
	Due to lack of keras' flexibility, SBDT model is not save-able using keras' serialization methods so `tf.Saver` is used instead. This also means that keras callback for ModelCheckpoint won't work with this implementation (unless the model is re-written to avoid using `tf.Tensor` objects as keras `Layer` arguments).
	Due to use of moving averages in calculation of penalty terms, custom two-step initialization of model parameters is required and model training (evaluation of `tree.loss` tensorflow graph op) is `batch_size`-dependent. This also means, that `batch_size % train_data_size == 0` must hold, otherwise shape mismatch will be encountered at the end of training epoch (keras will feed the remainder as a smaller minibatch).
models/convnet.py	Implementation of basic convolutional NN model in `tf.keras` as given by keras MNIST-CNN basic example.
models/utils.py	Utility functions for re-setting tensorflow session and visualizing model parameters.
makegif.sh	Converts directory of images into animation and labels frames based on folder name and file names. See mnist.ipynb for exemplary usage.
assets	Saved model checkpoints (for easier reproducibility) and illustrative images / animations.

Requirements

Git Large File Storage (git-lfs) is used to store model checkpoints and large gif animation files in the repo. To install it, run

git lfs install

Code was tested on Python 3.5 with TensorFlow version 1.10.0 and matplotlib version 2.1.0.

To install the required python packages, run

pip3 install -r requirements.txt

For GPU support, use

pip3 install -r requirements_gpu.txt

Results

The table below summarizes results as produced and presented in mnist.ipynb, but no exhaustive hyperparameter search was performed, so there is space for improvement.

Model	Depth	Labels	Batch size	Epochs	Accuracy
ConvNet	-	hard	16	12	99.29%
Tree (normal)	4	hard	4	40	80.88%
Tree (distill)	4	soft	4	40	90.71%

Interpretability

Just a quick taste of what's inside. Detailed instructions for how to read these visualizations are in the mnist.ipynb notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
models		models
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
makegif.sh		makegif.sh
mnist.ipynb		mnist.ipynb
requirements.txt		requirements.txt
requirements_gpu.txt		requirements_gpu.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

models

models

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

makegif.sh

makegif.sh

mnist.ipynb

mnist.ipynb

requirements.txt

requirements.txt

requirements_gpu.txt

requirements_gpu.txt

Repository files navigation

The Hierarchical Mixture of Bigots

Contents

Requirements

Results

Interpretability

Inference

Learning

About

Releases

Packages

Languages

License

lmartak/distill-nn-tree

Folders and files

Latest commit

History

Repository files navigation

The Hierarchical Mixture of Bigots

Contents

Requirements

Results

Interpretability

Inference

Learning

About

Resources

License

Stars

Watchers

Forks

Languages