Fix VNNGP with batches #2375

LuhuanWu · 2023-07-09T07:11:04Z

Fix VNNGP in batch settings VNNGP with Batches #2300
In addition, set default jitter value to 1e-3 and variational stddev init value to 1e-2.

gpleiss · 2023-07-17T23:13:04Z

gpytorch/variational/nearest_neighbor_variational_strategy.py

@@ -87,8 +90,6 @@ def __init__(
        super().__init__(
            model, inducing_points, variational_distribution, learn_inducing_locations=False, jitter_val=jitter_val
        )
-        # Make sure we don't try to initialize variational parameters - because of minibatching
-        self.variational_params_initialized.fill_(1)


Why did this line get deleted?

gpleiss · 2023-07-17T23:13:56Z

gpytorch/variational/nearest_neighbor_variational_strategy.py

+                )
+                # initialize with a small variational stddev for quicker conv. of kl divergence
+                self._variational_distribution._variational_stddev.data.copy_(torch.tensor(1e-2))
+                self.variational_params_initialized.fill_(1)


Ahhh okay, I see you've added a new initialization scheme.

Yes. In practice I found that the variational standard deviation tends to shrink towards 0 in the end given that inducing points = data points. If initialized with ones, the KL term is way larger than log likelihood term, resulting a long time to converge. Initializing with a smaller value speeds up the convergence.

For the future, it's good to list all of these little changes in the PR description as well :)

gpleiss · 2023-07-17T23:17:12Z

gpytorch/variational/nearest_neighbor_variational_strategy.py

@@ -266,78 +298,121 @@ def _firstk_kl_helper(self) -> Tensor:
        variational_inducing_covar = DiagLinearOperator(variational_covar_fisrtk)

        variational_distribution = MultivariateNormal(inducing_values, variational_inducing_covar)
-        kl = torch.distributions.kl.kl_divergence(variational_distribution, prior_dist)  # model_batch_shape
+        with settings.max_preconditioner_size(0):


Why is this setting necessary? Can you add a comment in code?

I was following the KL computation in _variational_strategy, see this line. What comment do you think is suitable here? Or do you suggest removing this line?

Let's get rid of it for now, I don't think it's necessary. And we might want to remove it eventually from _variational_strategy, but it's not necessary for this PR.

gpleiss · 2023-07-17T23:19:20Z

gpytorch/variational/nearest_neighbor_variational_strategy.py

@@ -359,5 +434,7 @@ def _compute_nn(self) -> "NNVariationalStrategy":
        with torch.no_grad():
            inducing_points_fl = self.inducing_points.data.float()
            self.nn_util.set_nn_idx(inducing_points_fl)
-            self.nn_xinduce_idx = self.nn_util.build_sequential_nn_idx(inducing_points_fl)
+            if self.k < self.M:
+                self.nn_xinduce_idx = self.nn_util.build_sequential_nn_idx(inducing_points_fl)


Do we have a test in code for the k < M case? If not, can you add one?

Yeah I think k = 3 in the test code, which is smaller than M. I could add a case when k = M later, which is missing currently.

gpleiss · 2023-07-17T23:21:02Z

test/variational/test_nearest_neighbor_variational_strategy.py

@@ -115,7 +115,7 @@ def _training_iter(
        return output, loss

    def _eval_iter(self, model, cuda=False):
-        inducing_batch_shape = model.variational_strategy.inducing_points.shape[:-2]
+        inducing_batch_shape = model.variational_strategy._inducing_batch_shape


Can you add a unit test that runs VNNGP with batches? AFIAK, this unit test is still only running for non-batched VNNGP.

I think line 194 test_training_iteration_batch_model does the batch model testing.

@LuhuanWu why wasn't test_training_iteration_batch_model failing before this PR then?
Ideally we want to have a test case that (1) would've failed before this PR was added (capturing the behavior described in #2300) but (2) does not fail with the new code in this PR.

good point. I'll look into that.

LuhuanWu added 4 commits June 25, 2023 23:56

fix vnngp batch mode

9dfeb72

fix format issue

84a9d77

fix nearest_neighbor_variational_strategy batch compatibility

0ae019f

fix nearest_neighbor_variational_strategy type error

dc4657b

LuhuanWu mentioned this pull request Jul 9, 2023

VNNGP with Batches #2344

Open

gpleiss requested changes Jul 17, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix VNNGP with batches #2375

Fix VNNGP with batches #2375

LuhuanWu commented Jul 9, 2023

gpleiss Jul 17, 2023

gpleiss Jul 17, 2023

LuhuanWu Jul 17, 2023 •

edited

gpleiss Jul 18, 2023

gpleiss Jul 17, 2023

LuhuanWu Jul 17, 2023 •

edited

gpleiss Jul 18, 2023

gpleiss Jul 17, 2023

LuhuanWu Jul 17, 2023

gpleiss Jul 17, 2023

LuhuanWu Jul 17, 2023

gpleiss Jul 18, 2023

LuhuanWu Jul 18, 2023

Fix VNNGP with batches #2375

Are you sure you want to change the base?

Fix VNNGP with batches #2375

Conversation

LuhuanWu commented Jul 9, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LuhuanWu Jul 17, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LuhuanWu Jul 17, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LuhuanWu Jul 17, 2023 •

edited

LuhuanWu Jul 17, 2023 •

edited