marg_MAP option to use LBFGS hessians #48

kimmywu · 2021-01-21T07:49:37Z

This PR adds the option for marg_MAP to use LBFGS Hessians updates at each step, as well as terminating the MAP estimate either at nstep or when ϕtol is reached.

kimmywu · 2021-01-21T07:54:23Z

src/maximization.jl

@@ -311,6 +311,14 @@ function MAP_marg(
            diffϕ=sum(unbatch(norm(LowPass(1000) * (sqrt(ds.Cϕ) \ (ϕ - lastϕ))) / sqrt(2length(ϕ))))
        end   

+        push!(history, select((;g,ϕ,lastHg=Map(lastHg),diffϕ), history_keys))


@marius311: For some unknown reason, converting to Map using Map(lastHg) is needed for the output lastHg to be scaled correctly when hess_method="lbfgs-hessian." Otherwise, it is orders of magnitude off (and looks like it's coming from a scale factor) when passed to history. It has the correct amplitude when applied to ϕ in the code.

Hmm I don't really remember tbh but glancing at

CMBLensing.jl/src/maximization.jl

Lines 200 to 212 in ec4bd10

ϕ, = @⌛ optimize(

objective,

Map(ϕ),

OptimKit.LBFGS(

lbfgs_rank;

maxiter = nsteps,

verbosity = verbosity[1],

linesearch = OptimKit.HagerZhangLineSearch(verbosity=verbosity[2], maxiter=5)

);

finalize!,

inner = (_,ξ1,ξ2)->sum(unbatch(dot(ξ1,ξ2))),

precondition = (_,η)->Map(Hϕ⁻¹*η),

)

looks like I also have some Maps. I think in theory you could get rid of that by defining more of the things OptimKit needs like retract, inner (that one you already are), scale!, add!, and transport! as mentioned in their readme, although my guess is performance-wise the extra Map don't really matter so its probably fine.

I did see that you pass in Map-converted field variables in MAP_joint for the optimization in places. I tested both and in my case, they yield the same results with or without passing a Map-converted field and figure to just go without to reduce the back-and-forth.

Agreed that it doesn't slow down the code. But it does mean it's storing a larger vector (Map vs Fourier), and it's not the same (Fourier)type as the rest of the return keys. So I want to see if you already know of similar peculiar behavior. Or if this is a corner case, that I run into.

kimmywu added 2 commits January 20, 2021 22:25

use lbfgs-hessian for MAP_marg

3c6a359

move history; workaround of lastHg bug in history

6ab01bf

kimmywu commented Jan 21, 2021

View reviewed changes

marius311 force-pushed the master branch from 85a63d8 to 4fc14e9 Compare April 14, 2021 08:36

marius311 force-pushed the master branch from 43316b9 to a445f9a Compare April 28, 2021 18:47

marius311 force-pushed the master branch 2 times, most recently from bb06adc to 9f016a7 Compare January 6, 2023 01:43

marius311 force-pushed the master branch from 0374568 to 9b821d4 Compare January 20, 2023 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

marg_MAP option to use LBFGS hessians #48

marg_MAP option to use LBFGS hessians #48

kimmywu commented Jan 21, 2021

kimmywu Jan 21, 2021 •

edited

marius311 Jan 21, 2021

kimmywu Jan 21, 2021

	ϕ, = @⌛ optimize(
	objective,
	Map(ϕ),
	OptimKit.LBFGS(
	lbfgs_rank;
	maxiter = nsteps,
	verbosity = verbosity[1],
	linesearch = OptimKit.HagerZhangLineSearch(verbosity=verbosity[2], maxiter=5)
	);
	finalize!,
	inner = (_,ξ1,ξ2)->sum(unbatch(dot(ξ1,ξ2))),
	precondition = (_,η)->Map(Hϕ⁻¹*η),
	)

marg_MAP option to use LBFGS hessians #48

Are you sure you want to change the base?

marg_MAP option to use LBFGS hessians #48

Conversation

kimmywu commented Jan 21, 2021

kimmywu Jan 21, 2021 • edited

Choose a reason for hiding this comment

marius311 Jan 21, 2021

Choose a reason for hiding this comment

kimmywu Jan 21, 2021

Choose a reason for hiding this comment

kimmywu Jan 21, 2021 •

edited