Uncertainty Estimation with HyperNetworks

This project stems from a small idea I got of finding a generative neural network which generates other networks. Turns out they're called HyperNetworks [1] and have found multiple use cases in different areas of Machine Learning. Something I thought they would be particularly good at would be Uncertainty Estimation, that is learning to estimate the epistemic uncertainty of a model. A first step in a bayesian approach to uncertainty estimation would be to estimate a distribution over the model parameters and inferring the posterior. Finding this distribution is hard since Neural Networks may contain thousands of parameters. Previous approaches have used approximations such as Variational Inference or even Dropout [2] to estimate this posterior.

However, modeling complex distributions is something Neural Networks are very good at. In the same way that GANs do not need anything more than a simple discriminator to be able to generate very realistic images, one could perhaps presume that they could also generate "samples" of complex neural networks. The only problem is finding a good discriminator. The discriminator puts a measure on how close a generated sample is from the true distribution. If we define the "true distribution" as a net that solves some regression or classification task, the discriminator simply becomes how well the generated network performs on this task i.e. the output of the objective function!

We thus train a generative network by generating the weights of a main network which performs a forward pass and evaluates the loss function. By training it in this way, we find a generative network which with one forward pass can generate large ensembles of neural networks.

Ensemble methods are as most powerful when the models used are as diverse and performant as possible. To ensure diversity between our models, we employ another trick from GAN literature. We add a measure of mutual information between the generated output and noisy samples used as input. By ensuring that the mutual information is high, we see a larger diversity of the generated networks, and with it higher and more robust performance on our toy dataset.

Full write up available: https://gtegner.github.io/uncertainty/estimation/2020/01/06/hyper-gan.html

Setup

Dependent on my implementation of Mutual Information Neural Estimation [3][4] for mutual information estimation.

pip install -r requirements.txt

References

[1] https://arxiv.org/pdf/1609.09106.pdf

[2] https://arxiv.org/abs/1506.02142

[3] https://arxiv.org/abs/1801.04062

[4] www.github.com/gtegner/mine-pytorch

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
FIGURES		FIGURES
.gitignore		.gitignore
README.md		README.md
entropy.png		entropy.png
experiment.py		experiment.py
ff_models.py		ff_models.py
hypergan_utils.py		hypergan_utils.py
main.py		main.py
requirements.txt		requirements.txt
toy_dataset.py		toy_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIGURES

FIGURES

.gitignore

.gitignore

README.md

README.md

entropy.png

entropy.png

experiment.py

experiment.py

ff_models.py

ff_models.py

hypergan_utils.py

hypergan_utils.py

main.py

main.py

requirements.txt

requirements.txt

toy_dataset.py

toy_dataset.py

Repository files navigation

Uncertainty Estimation with HyperNetworks

Setup

References

About

Releases

Packages

Languages

gtegner/hyper-gan

Folders and files

Latest commit

History

Repository files navigation

Uncertainty Estimation with HyperNetworks

Setup

References

About

Resources

Stars

Watchers

Forks

Languages