Try flow hmc in covtype dataset #277

fehiepsi · 2019-08-08T05:53:42Z

Resolves #417. This PR tracks the progress of using flow hmc in covtype dataset.

Problem setting

Data is randomly split into train set with 400,000 data points and test set with 181,012 data points (about 31%). Each data point has 55 features, which is normalized.
400,000 data points are divided into 40 shards, each has 10,000 data points.
Using logistic regression model

Some observations

With full training data, NUTS takes 5s to give 1 sample in GPU and 30s in CPU (for 1 device). Hence it is infeasible to train NUTS in CPU. In GPU, it will take a day to get 10000 samples, so I would defer this work to later. Anyway, this speed is 200x faster than the speed in embarassing parallel paper (which used Stan 2013 and took 15minutes for 1 sample in CPU - for now, Stan might took 2-3minutes to generate 1 sample (estimated based on edward paper). To my knowledge, our NUTS implementation using JAX is the fastest one for this dataset.
Using subposterior method, I can get all subposteriors (4 chains of 2500 samples for each of 40 shards) for just more than 1 hour in CPU (with 4 cores). This shows a huge benefit of subposterior methods. I wish I can have 40 CPU cores and get all subposteriors in just 10minutes. :D
Both consensus and parametric methods give 77.1% accuracy on the testset. This is better than the result in embarrassing paper (about 75.5% accuracy). Another benchmark result which I can find is with libsvm, where it also gets 77.1% accuracy. Maybe to be fair, it is better to just compare HMC / NeutraHMC / ParallelHMC / FlowHMC.
The code is so simple to write, predicting using vmap is so fast and convenient,...
I don't expect flow hmc will give better mixing rate for this dataset but it might be helpful for merging subposteriors (using consensus/parametric). Though our caching mechanism will help a lot, I hope that IAF transform will not add much overhead for running MCMC.

Tasks

Get subposteriors and report the result: 77.1% accuracy
Train AutoIAFNormal
Get Neutra samples and compare the result. FIXME using iaf transform in hmc makes NUTS pretty slow. It took 10 minutes to get samples from 1 shard. This is so slow comparing to using vanilla HMC. My last bet is to implement bnaf to see if it helps.
Run NUTS (in GPU) with full training dataset and compare the result
Organize the results in a joined notebook, remove summary tables, add some additional metrics such as cross entropy loss.

martinjankowiak · 2019-08-08T15:17:33Z

@fehiepsi this is awesome, such compact elegant code!

some suggestions for future iterations:

also compute test LLs
it might be nice to plot some of the subposterior sufficient statistics in some histograms. e.g. coefs[21] seems to be hitting two different modes in the different subposteriors. similarily with coefs[28], which seems to revert to the prior in some subposteriors but not others
it'd be interesting to see what happens when you make the logistic regression a small bayesian neutral network, e.g. 55 -> 5 -> 1 instead of 55 -> 1
it'd be nice to see how results vary with the number of shards
when you do subposteriors + flows it'd be interesting to see a comparison between doing the merging in the warped space (which we expect to be more normal) versus doing the merging in the unwarped space

fehiepsi · 2019-08-08T17:01:50Z

Thanks a lot @martinjankowiak ! All your suggestions are reasonable and wouldn't take much effort to incorporate. I'll address them after finishing the tasks. :)

fehiepsi · 2020-01-14T17:51:53Z

Though through this experiment, we can see that using FlowHMC significantly improves ESS/s of parallel methods, we will attempt to have a small example instead. This is more or less research work so I would like to close for now.

fehiepsi added 22 commits July 1, 2019 00:54

sketch the boiler plate for bnaf

77012ea

add permute and nn.Tanh

b4f2a82

tries to find valid init_params

871d55e

move find valid initial params to infer.util

98d0315

fix import

5286704

incorporate various init strategy

cf059b9

use init strategy in autoguide

4443f13

merge master, resolve conflicts

9692da4

use np.mean inplace of np.median

3d23e7e

dynamic support for svi and autoguide

3a211bd

fig typos

6e956ee

add tests and update autoguide for dynamic support

818b062

swap actual and expected in test

25b2408

address comment

76a89f4

fix issue at elbo loss, at test for autoguide

c392565

fix import at hmc_util test

75bb0a9

merge svi support

c8d140f

merge master

29b4097

use cov instead of np.cov, defer the change to later PR

f33c0cf

add option param_as_improper

240399b

support param as improper

92293cc

add covtype notebook

3d5d1e8

fehiepsi added the WIP label Aug 8, 2019

fehiepsi added 5 commits August 9, 2019 08:49

temporary save

0fcfb1d

add iaf result

c3a9438

temp delete

493ca87

add flowhmc section, currently do not work

e491230

merge master

54e59cc

fehiepsi added 19 commits September 18, 2019 00:53

change softmax import in ts notebook

904adb8

add tests for bnaf

7f0fba9

make sure that mask_o is upper triangular matrix, not lower

45c849c

use Elu for IAF

8f04ca3

revert arn test

0bc59d1

add bnaf autoguide

deeee19

use bnaf in neutra

38add0a

Merge remote-tracking branch 'upstream' into bnaf

dd0b8b8

finalllygit add -u .git add -u .! neutra example works

952e8ef

Merge branch 'bnaf' into autobnaf

c736b43

Merge branch 'master' into autobnaf

585b3d8

resolve merge conflict

da2f359

fix typo

866ce17

merge bnaf branch

e5df203

merge master

96d173d

merge master, resolve conflict

087d63d

Merge branch 'autobnaf' into covertype

b2fb2f1

try bnaf

8db30e1

merge master

56a3863

fehiepsi closed this Jan 14, 2020

fehiepsi reopened this Sep 1, 2020

merge master

07c801d

fehiepsi added this to the 0.5.1 milestone Jan 16, 2021

fehiepsi mentioned this pull request Mar 1, 2021

Check list for 0.6 release #935

Closed

9 tasks

fehiepsi modified the milestones: 0.5.1, 0.7 Mar 7, 2021

fehiepsi removed this from the 0.7 milestone Jul 8, 2021

fehiepsi mentioned this pull request Jan 20, 2022

Raise better error message when using HMC for models with subsample #1293

Closed

fehiepsi added the Tutorials/Examples label Dec 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try flow hmc in covtype dataset #277

Try flow hmc in covtype dataset #277

fehiepsi commented Aug 8, 2019 •

edited

martinjankowiak commented Aug 8, 2019

fehiepsi commented Aug 8, 2019

fehiepsi commented Jan 14, 2020

Try flow hmc in covtype dataset #277

Are you sure you want to change the base?

Try flow hmc in covtype dataset #277

Conversation

fehiepsi commented Aug 8, 2019 • edited

Problem setting

Some observations

Tasks

martinjankowiak commented Aug 8, 2019

fehiepsi commented Aug 8, 2019

fehiepsi commented Jan 14, 2020

fehiepsi commented Aug 8, 2019 •

edited