rbm_dbn

Restricted Boltzmann Machines (RBMs) and Deep Belief Networks (DBNs) from scratch for representation learning on the MNIST dataset.

All of the code has been written based on "A Practical Guide to Training Restricted Boltzmann Machines" by Geoffrey Hinton and "A fast learning algorithm for deep belief nets" by Geoffrey Hinton et al. Both of the papers can be found at literature/.

The documentation of the code generated by Sphinx is located docs/. cd to docs/ then make html, then open index.html at docs/_build/html with your fav browser.

Some of the code is credited to the TAs and lecturers of DD2437 Artificial Neural Networks and Deep Architectures, who delivered an amazing set of lectures and laboratories. Thanks for all, it was really educational! The code was part of a laboratory, the description of which (with much of the theoretical background) is located at literature/.

Files

util.py - Utility file containing activation functions, sampling methods, load/save files, etc.
rbm.py - Contains the Restricted Boltzmann Machine class.
dbn.py - Contains the Deep Belief Network class.
data/train-images-idx3-ubyte - MNIST training images
data/train-labels-idx1-ubyte - MNIST training labels
data/t10k-images-idx3-ubyte - MNIST test images data/t10k-labels-idx1-ubyte - MNIST test labels
trained_rbm/ - Directory to store trained RBM model
single_rbm/ - Directory to store figures of the reconstruction losses of different RBMs.
rbm_viz/ - Directory to store the learned weights of different RBMs.
rbm_dbn.ipynb - Notebook for a walkthrough and demo.
litrature/ - Papers and documents that the code is based on.
dbn_mp4/ - Directory to store an animation of generating digits from the trained DBN.
docs/ - Sphinx docs of code.

TODO

Implement the wake-sleep algorithm to fine-tune all the parameters of the DBN.

dbn.train_wakesleep_finetune() - main method for wake-sleep learning
rbm.update_generate_params() - updates the generative parameters (directed)
rbm.update_recognize_params() - updates the recognition parameters (directed)

Implement momentum update for more efficient gradient-based optimization rbm.update_params(v_0, h_0, v_k, h_k)

Improve weight initialization in rbm and dbn - right now just random normal with random seed set.

Time to run

A rough estimate of how much running time can be expected for each section in the notebook.

Training a single RBM will be in the order 10-20 minutes for the whole training set. Training a DBN, that involves training three seperate RBMs, it is in the order of three times longer than training a single RBM, so around 30 to 90 minutes. The wake-sleep fine-tuning will (when completed) take around 30 to 60 minutes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

dbn_mp4

dbn_mp4

docs

docs

literature

literature

rbm_viz

rbm_viz

single_rbm

single_rbm

trained_rbm

trained_rbm

.gitignore

.gitignore

README.md

README.md

dbn.py

dbn.py

environment.yml

environment.yml

rbm.py

rbm.py

rbm_dbn.ipynb

rbm_dbn.ipynb

util.py

util.py

Repository files navigation

rbm_dbn

Files

TODO

Time to run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
dbn_mp4		dbn_mp4
docs		docs
literature		literature
rbm_viz		rbm_viz
single_rbm		single_rbm
trained_rbm		trained_rbm
.gitignore		.gitignore
README.md		README.md
dbn.py		dbn.py
environment.yml		environment.yml
rbm.py		rbm.py
rbm_dbn.ipynb		rbm_dbn.ipynb
util.py		util.py

mark-antal-csizmadia/rbm_dbn

Folders and files

Latest commit

History

Repository files navigation

rbm_dbn

Files

TODO

Time to run

About

Topics

Resources

Stars

Watchers

Forks

Languages