Linear interpolators updates: refactor and new functionality #4710

chrishavlin · 2023-10-20T15:26:06Z

I started using the linear interpolators yesterday and for my purposes it'd be a tab bit easier to re-use the interpolator objects but swap out different table data, so this PR adds that functionality. To do that though, it was easier to refactor the interpolators to have a common base class (and cut out some repetitive code structures in the process). Everything here should be fully backwards compatible. Also happy to split this PR to pull out the refactor from the new functionality (but the new functionality is pretty simple, most of the PR is the refactor).

chrishavlin · 2023-10-20T15:27:41Z

Note that the test for the 4d interpolator is kinda slow cause it uses quite a big array, (random_data = np.random.random((64, 64, 64, 64))). I'm inclined to drop that down in size to speed it up, I can do that after an initial review (and after the tests pass as-is).

yt/utilities/linear_interpolators.py

matthewturk · 2023-10-24T15:01:13Z

Hmm, how slow is it? Is it a lot slower than doing a 3D interpolator on (256, 256, 256) data?

chrishavlin · 2023-10-24T15:30:14Z

Will check if it's slower than the 3d case, but 4d with (64,64,64,64) took maybe 4-6 seconds if I remember correctly

chrishavlin · 2023-10-24T16:08:04Z

@matthewturk so the 3D and 4D scale at the same rate with the total array size:

code here https://gist.github.com/chrishavlin/9370e2a4a1895745a40c46f39d6c44f4

The full test time (pytest --durations=10 yt/utilities/tests/test_interpolators.py):

3.20s call     yt/utilities/tests/test_interpolators.py::test_linear_interpolator_4d
0.31s call     yt/utilities/tests/test_interpolators.py::test_ghost_zone_extrapolation
0.25s call     yt/utilities/tests/test_interpolators.py::test_get_vertex_centered_data
0.05s setup    yt/utilities/tests/test_interpolators.py::test_linear_interpolator_1d
0.04s call     yt/utilities/tests/test_interpolators.py::test_linear_interpolator_3d

So only 3ish seconds, not as bad as I remember but 2 orders of magnitude longer than the other interpolator tests (and I don't see a particular reason to test with N=64 vs say N=16?).

matthewturk

I think as long as the tests pass, and the API is consistent/unbroken if not unchanged, this is good.

neutrinoceros

Overall looks sound. I left a couple minor questions and suggestions.

yt/utilities/tests/test_interpolators.py

yt/utilities/linear_interpolators.py

neutrinoceros · 2023-11-12T09:47:27Z

yt/utilities/linear_interpolators.py

-    def __init__(self, table, boundaries, field_names, truncate=False):
+class _LinearInterpolator(abc.ABC):
+    _ndim: int
+    _dim_i_type = "int32"


wow, I'm sure this is just a your refactor making this more obvious, but do you have any idea why we are not using the same dtype in all interpolators ?

I do not know :) I meant to look more closely, maybe there's some reason at the cython level? Or...?

Ok, looked back and the superficial reason is that the cython functions for the different interpolators have different int declarations in their call signature: UnilinearlyInterpolate and BilinearlyInterpolate are declared as int32_t, e.g., :

def BilinearlyInterpolate(np.ndarray[np.float64_t, ndim=2] table, np.ndarray[np.float64_t, ndim=1] x_vals, np.ndarray[np.float64_t, ndim=1] y_vals, np.ndarray[np.float64_t, ndim=1] x_bins, np.ndarray[np.float64_t, ndim=1] y_bins, np.ndarray[np.int32_t, ndim=1] x_is, np.ndarray[np.int32_t, ndim=1] y_is, np.ndarray[np.float64_t, ndim=1] output):

while TrilinearlyInterpolate and QuadrilinearlyInterpolate use int_t:

def TrilinearlyInterpolate(np.ndarray[np.float64_t, ndim=3] table, np.ndarray[np.float64_t, ndim=1] x_vals, np.ndarray[np.float64_t, ndim=1] y_vals, np.ndarray[np.float64_t, ndim=1] z_vals, np.ndarray[np.float64_t, ndim=1] x_bins, np.ndarray[np.float64_t, ndim=1] y_bins, np.ndarray[np.float64_t, ndim=1] z_bins, np.ndarray[np.int_t, ndim=1] x_is, np.ndarray[np.int_t, ndim=1] y_is, np.ndarray[np.int_t, ndim=1] z_is, np.ndarray[np.float64_t, ndim=1] output):

As to why they differ, I'm really not sure...

I could try switching UnilinearlyInterpolate and BilinearlyInterpolate over to use int_t as well?

I could try switching UnilinearlyInterpolate and BilinearlyInterpolate over to use int_t as well?

Now might be a good time to try indeed, but it's not critical !

Just switched all the interpolators to use int_t

Co-authored-by: Clément Robert <cr52@protonmail.com>

yt/utilities/linear_interpolators.py

chrishavlin · 2023-11-13T16:33:30Z

oh, one more push coming... i'm going to go through all the other tests and add make the booleans explicit in the interpolator calls.

chrishavlin · 2023-11-13T16:42:04Z

Ok, I just pushed up a change that adds keywords to all the other test calls and I did drop the resolution in the 4d test case to (32, 32, 32, 32) (rather than (64, 64, 64, 64)).

Pretty sure that the only remaining question is that of the differing types for the index arrays inputs to the cython interpolators #4710 (comment)

chrishavlin added 6 commits October 19, 2023 12:00

in progress...

eee4434

fix type casting

833fcd3

refactor boundary size check

d79994e

fix boundary check

b6fc8d9

add bin validation test, add tests to nose ignore

745e8fd

add tests for not storing table

0b87b8c

chrishavlin added enhancement Making something better refactor improve readability, maintainability, modularity labels Oct 20, 2023

neutrinoceros reviewed Oct 20, 2023

View reviewed changes

yt/utilities/linear_interpolators.py Show resolved Hide resolved

matthewturk previously approved these changes Nov 2, 2023

View reviewed changes

neutrinoceros self-requested a review November 11, 2023 17:45

neutrinoceros reviewed Nov 12, 2023

View reviewed changes

chrishavlin dismissed matthewturk’s stale review via c7b238d November 13, 2023 15:31

Apply suggestions from code review

c7b238d

Co-authored-by: Clément Robert <cr52@protonmail.com>

chrishavlin commented Nov 13, 2023

View reviewed changes

yt/utilities/linear_interpolators.py Outdated Show resolved Hide resolved

avoid both logs and errors, note defaults in docstrings

e14add4

chrishavlin commented Nov 13, 2023

View reviewed changes

yt/utilities/linear_interpolators.py Show resolved Hide resolved

avoid making a copy of the table if possible

9e87685

add keywords for bool args, drop 4d interpolator test resolution

5f13122

use same type in all interpolators

0de67f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear interpolators updates: refactor and new functionality #4710

Linear interpolators updates: refactor and new functionality #4710

chrishavlin commented Oct 20, 2023

chrishavlin commented Oct 20, 2023 •

edited

matthewturk commented Oct 24, 2023

chrishavlin commented Oct 24, 2023

chrishavlin commented Oct 24, 2023

matthewturk left a comment

neutrinoceros left a comment

neutrinoceros Nov 12, 2023

chrishavlin Nov 13, 2023

chrishavlin Nov 13, 2023

neutrinoceros Nov 13, 2023

chrishavlin Nov 13, 2023

chrishavlin commented Nov 13, 2023

chrishavlin commented Nov 13, 2023

Linear interpolators updates: refactor and new functionality #4710

Are you sure you want to change the base?

Linear interpolators updates: refactor and new functionality #4710

Conversation

chrishavlin commented Oct 20, 2023

chrishavlin commented Oct 20, 2023 • edited

matthewturk commented Oct 24, 2023

chrishavlin commented Oct 24, 2023

chrishavlin commented Oct 24, 2023

matthewturk left a comment

Choose a reason for hiding this comment

neutrinoceros left a comment

Choose a reason for hiding this comment

neutrinoceros Nov 12, 2023

Choose a reason for hiding this comment

chrishavlin Nov 13, 2023

Choose a reason for hiding this comment

chrishavlin Nov 13, 2023

Choose a reason for hiding this comment

neutrinoceros Nov 13, 2023

Choose a reason for hiding this comment

chrishavlin Nov 13, 2023

Choose a reason for hiding this comment

chrishavlin commented Nov 13, 2023

chrishavlin commented Nov 13, 2023

chrishavlin commented Oct 20, 2023 •

edited