Adaptive-depth anderson #939

mfherbst · 2023-12-27T12:59:56Z

Superseeds #719.

Todo:

Some testing
Unit tests

mfherbst · 2023-12-27T19:14:40Z

From a few experiments, just limiting the window size using adaptive depth anderson is rarely a good idea as the obtained sizes can get too large to be practical. Thus we for sure need both a maximal m and the adaptive depth algorithm is only a bonus if "smaller values also make sense". The condition number check is in principle obsolete (adaptive depth achieves the same thing), but probably does not hurt either.

For now I've left m=10, to upper-bound the required memory for handling the history, but we might want to increase that in the future.

antoine-levitt · 2023-12-27T20:01:33Z

Yeah I think mmax is always going to make sense. I have to reread the paper but I think I remember that it's just an alternative way to monitor condition number (both are essentially measuring the size of the extrapolation coefficients). I think the version of the code in master is basically sound. @msdupuy might know better.

One thing I wanted to explore was to annotate each residual vector with an error bar, coming additively both from the approximate eigensolve (which we can try to estimate from the eigenvalue residuals) and from some estimate of the nonlinear effects (which we maybe try to model as an isotropic quadratic), and take that into account in the least squares. I never got around to it, but do ping me if you're interested in exploring this kind of things further.

mfherbst · 2023-12-28T09:06:16Z

I think the version of the code in master is basically sound.

So you suggest we don't do this PR at all ? I mean what I like about it is that it catches the issues before computing the considerably more costly dot products and it overall seems to do a better job in shrinking the history when needed (from the few experiments I did).

One thing I wanted to explore was to annotate each residual vector with an error bar, coming additively both from the approximate eigensolve (which we can try to estimate from the eigenvalue residuals) and from some estimate of the nonlinear effects (which we maybe try to model as an isotropic quadratic), and take that into account in the least squares. I never got around to it, but do ping me if you're interested in exploring this kind of things further.

Hmm. Are you sure that would be worth it, given that it probably turns out to be quite crude (especially closer to convergence) ? Or was your thinking to only employ this for the initial phase to avoid Anderson putting too much trust into a potentially sketchy result ? For that it could be useful, given that the bounds are not completely random. How do you want to get the insight on the isotropic quadratic, just using the points the SCF anyway follows ? Happy to discuss at some point.

mfherbst · 2024-01-10T19:24:38Z

@antoine-levitt Thoughts on the above ?

antoine-levitt · 2024-01-11T10:13:15Z

src/scf/anderson.jl

+    # We need to solve 0 = M' Pfxₙ + M'M βs <=> βs = - (M'M)⁻¹ M' Pfxₙ
+
+    # Ensure the condition number of M stays below maxcond, else prune the history
+    # TODO This is too be tested, but in theory the adaptive-depth DIIS mechanism


antoine-levitt · 2024-01-11T10:13:22Z

src/scf/anderson.jl

+
+    # Ensure the condition number of M stays below maxcond, else prune the history
+    # TODO This is too be tested, but in theory the adaptive-depth DIIS mechanism
+    #      we implement, should ensure the condition number to stay bounded as well.


antoine-levitt · 2024-01-11T10:14:05Z

src/scf/anderson.jl

+    #      we implement, should ensure the condition number to stay bounded as well.
+    Mfac = qr(M)
+    while size(M, 2) > 1 && cond(Mfac.R) > anderson.maxcond
+        M = M[:, 2:end]  # Drop oldest entry in history


antoine-levitt · 2024-01-11T10:17:17Z

Merge the two mechanisms? Ie drop vectors with biggest errors (instead of oldest ones) until conditioning decreases below the acceptable threshold. For safety don't drop say the last iterate and the one before.

antoine-levitt · 2024-01-11T10:19:35Z

I mean what I like about it is that it catches the issues before computing the considerably more costly dot products

Are they really expensive?

What I liked in the current version is its simplicity, but if dropping based on residual norm is better then sure go ahead.

Also might be better to monitor the norm of the extrapolation coefficients rather than the conditioning. Idk there's so much possibilities it scares me.

antoine-levitt · 2024-01-11T10:21:25Z

Hmm. Are you sure that would be worth it, given that it probably turns out to be quite crude (especially closer to convergence) ? Or was your thinking to only employ this for the initial phase to avoid Anderson putting too much trust into a potentially sketchy result ? For that it could be useful, given that the bounds are not completely random. How do you want to get the insight on the isotropic quadratic, just using the points the SCF anyway follows ? Happy to discuss at some point.

The point is to have something a bit more principled than the binary keep/discard mechanism; instead of having a weight of 1 (keep) or 0 (discard), have something in between. But it's pretty hazy how to do this properly, so more research is needed (tm).

mfherbst mentioned this pull request Dec 27, 2023

Adaptive depth DIIS #719

Closed

mfherbst added 5 commits December 27, 2023 18:07

Adaptive-depth anderson

f44feab

Add unit test

bb26f1d

up

bfe1195

up

930c2b8

Tests

a9727e1

mfherbst force-pushed the adaptive-depth-anderson branch from de1f480 to a9727e1 Compare December 27, 2023 18:47

Up

b27cb33

mfherbst marked this pull request as ready for review December 27, 2023 19:06

Commint

b65735d

Fix tests

c0f1c7f

Referencing

a2b18e9

mfherbst force-pushed the adaptive-depth-anderson branch from 5330a6d to a2b18e9 Compare December 31, 2023 09:34

antoine-levitt reviewed Jan 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive-depth anderson #939

Adaptive-depth anderson #939

mfherbst commented Dec 27, 2023 •

edited

mfherbst commented Dec 27, 2023

antoine-levitt commented Dec 27, 2023

mfherbst commented Dec 28, 2023

mfherbst commented Jan 10, 2024 •

edited

antoine-levitt Jan 11, 2024

antoine-levitt Jan 11, 2024

antoine-levitt Jan 11, 2024

antoine-levitt commented Jan 11, 2024

antoine-levitt commented Jan 11, 2024

antoine-levitt commented Jan 11, 2024

Adaptive-depth anderson #939

Are you sure you want to change the base?

Adaptive-depth anderson #939

Conversation

mfherbst commented Dec 27, 2023 • edited

mfherbst commented Dec 27, 2023

antoine-levitt commented Dec 27, 2023

mfherbst commented Dec 28, 2023

mfherbst commented Jan 10, 2024 • edited

antoine-levitt Jan 11, 2024

Choose a reason for hiding this comment

antoine-levitt Jan 11, 2024

Choose a reason for hiding this comment

antoine-levitt Jan 11, 2024

Choose a reason for hiding this comment

antoine-levitt commented Jan 11, 2024

antoine-levitt commented Jan 11, 2024

antoine-levitt commented Jan 11, 2024

mfherbst commented Dec 27, 2023 •

edited

mfherbst commented Jan 10, 2024 •

edited