Tenalg einsum backend: cache contraction equation #459

JeanKossaifi · 2022-11-14T00:30:46Z

Adds a wrapper to cache the contraction equation and just reuse it.

Move unfolding_dot_khatri_rao to tenalg, add einsum version

codecov · 2022-11-14T00:34:10Z

Codecov Report

Merging #459 (2459be9) into main (ade7a87) will decrease coverage by 0.45%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##             main     #459      +/-   ##
==========================================
- Coverage   86.84%   86.39%   -0.46%     
==========================================
  Files         118      119       +1     
  Lines        7313     7357      +44     
==========================================
+ Hits         6351     6356       +5     
- Misses        962     1001      +39

Impacted Files	Coverage Δ
...ensorly/tenalg/einsum_tenalg/_batched_tensordot.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/einsum_tenalg/_khatri_rao.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/einsum_tenalg/_kronecker.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/einsum_tenalg/caching.py	`0.00% <0.00%> (ø)`
.../tenalg/einsum_tenalg/generalised_inner_product.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/einsum_tenalg/moments.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/einsum_tenalg/mttkrp.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/einsum_tenalg/n_mode_product.py	`0.00% <0.00%> (ø)`
tensorly/tenalg/proximal.py	`67.45% <0.00%> (ø)`
tensorly/decomposition/_cp.py	`85.38% <0.00%> (+0.32%)`	⬆️
... and 2 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

cohenjer · 2022-11-16T16:04:35Z

Thanks for adding this @JeanKossaifi, I will try to look at it asap but that's slightly outside my expertise.

tensorly/tenalg/einsum_tenalg/caching.py

yngvem · 2022-11-20T06:39:42Z

tensorly/tenalg/einsum_tenalg/caching.py

+        try:
+            equation = cache[key]
+        except KeyError:
+            equation = fun(*args, **kwargs)
+            cache[key] = equation


Is there a reason why you use try-except instead of checking if the key is in the cache beforehand? Is that more efficient?

Yes, getting an item is just O(1) while checking for existence is O(n) and the overhead of the try/except is minimal and only happens at the first call so the overall cost is just O(1).

tensorly/tenalg/einsum_tenalg/caching.py

yngvem · 2022-11-20T06:44:33Z

tensorly/tenalg/einsum_tenalg/generalised_inner_product.py

+@einsum_path_cached
+def inner_path(tensor1, tensor2, n_modes=None):


Will that not set key=tensor1? will that give the correct behaviour for the cache?

No the key is only used inside the wrapper to retrieve the cached version but not actually passed to the wrapped function. I need to document that clearly.

yngvem

I did a quick pass on some of the files (nothing big, didn't download and run the code) :)

JeanKossaifi · 2022-11-20T06:48:51Z

Thanks @yngvem, great points, as always! :)

Co-authored-by: Yngve Mardal Moe <yngve.m.moe@gmail.com>

JeanKossaifi · 2022-11-22T20:20:46Z

So update, I don't actually see any speedup by caching the einsum equation, not sure whether we want to merge this feature anyway. Any thoughts?

cohenjer · 2022-12-05T11:46:07Z

So if I understood the PR correctly, what you are doing is storing the contracting path in a global dictionary. While this could be interesting for further improvements, this PR alone will not lead to significant speed-up; what is costly is not to compute the einsum path, but to actually compute the contractions along that path. Therefore what we would need to cache is rather the intermediate results of the einsum.

But I do not believe we have access to these partial results easily, so maybe indeed this PR is not useful as such (but the code structure could be useful to start another PR where we cache contraction results along a path? Then we will also need to store the path so we would reuse this code).

No particular comment on the code itself, maybe the doc is not so clear so it took me some time to understand what was going on :p

edit: In fact this is useful to impose a specific path, such as done in #462 with einsum-opt; so it could also lead to speed ups this way I guess. Would be curious to see your tests, maybe you do not see a speedup because the naive path is already close to optimal in your experiments?

JeanKossaifi · 2022-12-12T23:18:15Z

@cohenjer I was talking about caching the contraction equation here, not the contraction path. The way I wrote the einsum tenalg backend is that I first check the validity of the operation (e.g. mttkrp) and then generate programmatically the corresponding contraction equation. The idea of this PR is that it is redundant to perform these checks and generate the equation at each call. However the gain seems to be pretty much non-existent so this may be over engineering.

Caching the optimal contraction path on the other hand (#462) always helps.

JeanKossaifi added 2 commits November 10, 2022 22:32

Refactors mttkrp

dce3580

Move unfolding_dot_khatri_rao to tenalg, add einsum version

Tenalg einsum backend: cache contraction equation

81a6c2d

JeanKossaifi force-pushed the main branch from 513953c to 6f1c673 Compare November 15, 2022 17:42

JeanKossaifi added 2 commits November 15, 2022 09:42

Adds missing caching

473a9db

Merge branch 'main' into einsum_path

ab4bc7f

cohenjer mentioned this pull request Nov 16, 2022

Adds opt-einsum plugin #460

Closed

yngvem reviewed Nov 20, 2022

View reviewed changes

tensorly/tenalg/einsum_tenalg/caching.py Outdated Show resolved Hide resolved

yngvem reviewed Nov 20, 2022

View reviewed changes

tensorly/tenalg/einsum_tenalg/caching.py Outdated Show resolved Hide resolved

yngvem reviewed Nov 20, 2022

View reviewed changes

JeanKossaifi and others added 4 commits November 21, 2022 20:41

Update tensorly/tenalg/einsum_tenalg/caching.py

0c3adbd

Co-authored-by: Yngve Mardal Moe <yngve.m.moe@gmail.com>

Update tensorly/tenalg/einsum_tenalg/caching.py

e4d7ce9

Co-authored-by: Yngve Mardal Moe <yngve.m.moe@gmail.com>

Black linting

79dd086

FIX syntax

d8d4b5c

Merge branch 'main' into einsum_path

2459be9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tenalg einsum backend: cache contraction equation #459

Tenalg einsum backend: cache contraction equation #459

JeanKossaifi commented Nov 14, 2022

codecov bot commented Nov 14, 2022 •

edited

cohenjer commented Nov 16, 2022

yngvem Nov 20, 2022

JeanKossaifi Nov 20, 2022

yngvem Nov 20, 2022

JeanKossaifi Nov 20, 2022 •

edited

yngvem left a comment

JeanKossaifi commented Nov 20, 2022

JeanKossaifi commented Nov 22, 2022

cohenjer commented Dec 5, 2022 •

edited

JeanKossaifi commented Dec 12, 2022

		@einsum_path_cached
		def inner_path(tensor1, tensor2, n_modes=None):

Tenalg einsum backend: cache contraction equation #459

Are you sure you want to change the base?

Tenalg einsum backend: cache contraction equation #459

Conversation

JeanKossaifi commented Nov 14, 2022

codecov bot commented Nov 14, 2022 • edited

Codecov Report

cohenjer commented Nov 16, 2022

yngvem Nov 20, 2022

Choose a reason for hiding this comment

JeanKossaifi Nov 20, 2022

Choose a reason for hiding this comment

yngvem Nov 20, 2022

Choose a reason for hiding this comment

JeanKossaifi Nov 20, 2022 • edited

Choose a reason for hiding this comment

yngvem left a comment

Choose a reason for hiding this comment

JeanKossaifi commented Nov 20, 2022

JeanKossaifi commented Nov 22, 2022

cohenjer commented Dec 5, 2022 • edited

JeanKossaifi commented Dec 12, 2022

codecov bot commented Nov 14, 2022 •

edited

JeanKossaifi Nov 20, 2022 •

edited

cohenjer commented Dec 5, 2022 •

edited