-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Jacobian chunking in VMC_SRt
#1740
base: master
Are you sure you want to change the base?
Conversation
What is wrong with #1590? It works well modulo a typo in (On a side note, I was thinking just now that maybe a better name would be
I would be much happier if this was based of #1590, and |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #1740 +/- ##
==========================================
- Coverage 82.73% 82.68% -0.05%
==========================================
Files 299 299
Lines 18271 18293 +22
Branches 3480 3487 +7
==========================================
+ Hits 15116 15126 +10
- Misses 2476 2487 +11
- Partials 679 680 +1 ☔ View full report in Codecov by Sentry. |
It's not merged. Also, it only implements a different backprop chunk size for
You have to discuss with @chrisrothUT
That would be nice, but in the meanwhile it doesn't hurt to make
No, |
Yes and no.
It does not, because it is not finished, but it introduces a standard way, consistent with what is there already to set the chunk size of backpropagations. It is trivial then to default to
|
Ah okay, I thought you had some problem with doing chunking in general. I don't have time to finish #1590, but it would be important to support chunking in BTW, I think |
I don't have time for this either, but we should think about whether this is actually feasible/worth it. |
Some neural networks require much more memory for backpropagation, which makes it useful to allow chunking (with smaller chunks) when computing the Jacobian. (cf. #1347, #1590)
This PR therefore adds Jacobian chunking to
VMC_SRt
. IIUC all this takes is to add an extra argument tonkjax.jacobian
and copying some logic fromMCState
to set safe chunk sizes.Outstanding questions:
VMC_SRt
take the vstate's chunk size (like the QGT objects) or not chunk at all?