Reduce memory usage in gKDR #244

ots22 · 2022-09-05T15:53:56Z

Fixes #242.

The current implementation of gKDR exclusively uses array computations. A drawback of this is that there are some large temporary arrays created (the largest having size N^2 M^2 where N is the number of observations and M is the number of features).

This PR replaces one of these array computations with an explicit loop, reducing the space used to O(M^2 + M N^2).

It also replaces two calls to np.linalg.solve with calls to cho_solve (and a single Cholesky decomposition).

- Possible since the regularized Gram matrix is Hermitian - Use some more descriptive variable names

edaub

Looks good. If you merge this into main, please bump the version number and also merge into devel. (If the fix is less urgent feel free to merge into devel instead and we'll make it part of the next release.)

mogp_emulator/tests/test_DimensionReduction.py

mogp_emulator/DimensionReduction.py

…to cho_solve

ots22 · 2022-10-03T11:53:16Z

@edaub - thanks for the review - I've now finally resolved your comments above. I've left the target of the PR as main but the version number for you to bump up as discussed.

edaub · 2022-11-04T15:16:32Z

I've decided to stick the three current outstanding bugfixes into a single bugfix release 0.7.2, so changing the base branch to put them all onto a single temporary branch prior to merging. I'll also merge this into devel. I don't know if this will fix our issue with main and devel not knowing their common origin, but I'll try and figure this out separately.

* added bugfix to delete GPU .so file committed to devel to bugfix release on main * Update documentation to reflect patsy requirement (#245) A previous major upgrade added patsy as a requirement rather than an optional dependency. The documentation (readme and installation page on the sphinx docs) now have been updated to reflect this. * Reduce memory usage in gKDR (#244) * Test that a larger gKDR example can run * Reduce memory footprint in gKDR with an explicit loop * Use cholesky/cho_solve instead of general linear solver - Possible since the regularized Gram matrix is Hermitian - Use some more descriptive variable names * Add explanatory docstring * Generate deterministic test data for 'large' gKDR test * Additional docstrings * Use cho_factor instead of cholesky, which simplifies immediate calls to cho_solve Co-authored-by: ots22 <ots22@users.noreply.github.com>

ots22 added 3 commits September 5, 2022 15:38

Test that a larger gKDR example can run

07ae00d

Reduce memory footprint in gKDR with an explicit loop

b76265f

Use cholesky/cho_solve instead of general linear solver

d42b003

- Possible since the regularized Gram matrix is Hermitian - Use some more descriptive variable names

ots22 requested a review from edaub September 5, 2022 15:54

edaub approved these changes Sep 7, 2022

View reviewed changes

mogp_emulator/tests/test_DimensionReduction.py Show resolved Hide resolved

mogp_emulator/DimensionReduction.py Outdated Show resolved Hide resolved

ots22 added 4 commits October 3, 2022 12:40

Add explanatory docstring

fe60400

Generate deterministic test data for 'large' gKDR test

85f1fd2

Additional docstrings

c3c89e5

Use cho_factor instead of cholesky, which simplifies immediate calls …

68aae20

…to cho_solve

edaub changed the base branch from main to v0.7.2rc November 4, 2022 15:13

edaub merged commit 5c3c132 into v0.7.2rc Nov 4, 2022

edaub deleted the gkdr-mem branch November 4, 2022 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory usage in gKDR #244

Reduce memory usage in gKDR #244

ots22 commented Sep 5, 2022

edaub left a comment

ots22 commented Oct 3, 2022

edaub commented Nov 4, 2022

Reduce memory usage in gKDR #244

Reduce memory usage in gKDR #244

Conversation

ots22 commented Sep 5, 2022

edaub left a comment

Choose a reason for hiding this comment

ots22 commented Oct 3, 2022

edaub commented Nov 4, 2022