Optimal transport R matrix rows don't sum to 1 #72

jaysonjeg · 2023-10-08T21:32:56Z

alignment_methods.OptimalTransportAlignment() results in transformation matrix R whose rows and columns sum to 1/nfeatures. This does not seem ideal. On the other hand, alignment_methods.POTAlignment()'s R matrix has rows and columns that sum to 1, which seems better.

If the columns of R sum to 1/nfeatures, then if you transform a functional brain map, the transformation is "shrinking" each parcel's mean value depending on the size of each parcel.

Minimal reproducing example:

from fmralign import alignment_methods as am
import numpy as np
x = np.random.random((20,50)) #20 nsamples, 50 nfeatures
y = np.random.random((20,50))
ot = am.OptimalTransportAlignment()
pot = am.POTAlignment()
ot.fit(x,y)
pot.fit(x,y)
print(ot.R.sum(axis=0)[0:3])
print(pot.R.sum(axis=0)[0:3])

Solution:

At the end of alignment_methods.OptimalTransportAlignment().fit(), add a line: self.R = self.R * self.R.shape[0]

emdupre · 2023-10-11T21:59:33Z

Hi @jaysonjeg,

Thanks for reporting ! Could you confirm which version of fmralign you're using ? Reproducing your example from the current main branch, I get:

$ print(ot.R.sum(axis=0)[0:3])
[0.9999997 0.9999997 1.       ]

$ print(pot.R.sum(axis=0)[0:3])
[1. 1. 1.]

Since the .fit() call includes the multiplication you suggested, I think this should be correct. But please let me know if you're still seeing this with the latest code.

We haven't had a release in a long time, but that will happen soon ! I'd just like to merge #70 first. Hopefully that will help to clarify these concerns !

jaysonjeg · 2023-10-12T00:27:00Z

Hi Elizabeth, you are right. I was using a previous version of fmralign. You have fixed this issue already.

emdupre · 2023-10-12T16:18:31Z

Thanks for confirming ! I'll close this, but please open a new issue if you encounter any problems with the newest code.

alexisthual · 2023-10-24T08:01:17Z

Hi all!
I just came across this, and it feels to me that it is normal that each row / column of an OT plan should sum to 1 / n_samples. Indeed, in our case, a transport plan is a soft mapping between point clouds ; each point is associated with a mass (1 / n_samples in the case of a uniform distribution) and each row / column describes how this mass should be spread across points of the other cloud.
However, in the general case, the total mass mass of the plan (sum of all coefficients) should be close to 1.

jaysonjeg · 2023-10-24T19:54:50Z

But, if the rows/columns sum to 1/nsamples, then when OT is calculated, then the R matrix for large parcels will have smaller coefficients. If the R matrices for each parcel are then applied to transform a brain map (like task fMRI), the mean value in each parcel will be scaled by 1/nsamples. That is, the brain map will become dimmer in large parcels. Do we want this to happen?

…

On Wed, 25 Oct 2023, 5:51 am Elizabeth DuPre, ***@***.***> wrote: Reopened #72 <#72>. — Reply to this email directly, view it on GitHub <#72 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGJ73VAWXEVNFSED5FGZBGDYBAE4HAVCNFSM6AAAAAA5X6LARGVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJQG42TQMBXHE3DMNY> . You are receiving this because you were mentioned.Message ID: ***@***.***>

bthirion · 2023-10-24T20:44:14Z

In larger parcels, the coefficients are indeed smaller, but this should be compensated by the fact that there are more coefficients. Is this an effect you have observed (cancellations effects may indeed be present, but more so in the noise case where the signal sign varies a lot) ?

jaysonjeg · 2023-10-24T20:59:05Z

So if I continue the reproducing example in the first post in this thread.
z = np.random.random((50))
z_transformed = z @ ot.R
print(z.sum())
print(z2.sum())

The sum (or mean) of z_transformed is 1/50th the sum (or mean) of z. This is with the older version of fmralign. With the latest version, the rows/cols of R sum to 1, so this no longer happens.

alexisthual · 2023-10-25T09:26:11Z

If I understand correctly what you are looking for, one indeed needs to normalize the z_transformed tensor defined in your example (i.e. divide by the plan's sum of rows / columns). You can find an example here: https://github.com/alexisthual/fugw/blob/58cdfee03f39e6ae6f66b701489071af24664391/src/fugw/mappings/dense.py#L335C1-L337C55

jaysonjeg · 2023-10-25T10:26:13Z

In the above thread, I think you meant to say (1/nfeatures), not (1/nsamples)? I don't fully understand the rationale for rows/columns summing to (1/nfeatures) rather than to 1. But I agree that if the normalization is desired, it can be done retrospectively.

alexisthual · 2023-10-25T16:06:56Z

Sorry if it was not clear: indeed, the transport plan is of size n_source_vertices x n_target_vertices (I was referring to any of these twi values as n_samples).

emdupre mentioned this issue Oct 11, 2023

Fix OTT-JAX tests #62

Closed

emdupre closed this as completed Oct 12, 2023

emdupre reopened this Oct 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimal transport R matrix rows don't sum to 1 #72

Optimal transport R matrix rows don't sum to 1 #72

jaysonjeg commented Oct 8, 2023 •

edited

emdupre commented Oct 11, 2023

jaysonjeg commented Oct 12, 2023

emdupre commented Oct 12, 2023

alexisthual commented Oct 24, 2023

jaysonjeg commented Oct 24, 2023 via email

bthirion commented Oct 24, 2023

jaysonjeg commented Oct 24, 2023

alexisthual commented Oct 25, 2023

jaysonjeg commented Oct 25, 2023

alexisthual commented Oct 25, 2023

Optimal transport R matrix rows don't sum to 1 #72

Optimal transport R matrix rows don't sum to 1 #72

Comments

jaysonjeg commented Oct 8, 2023 • edited

emdupre commented Oct 11, 2023

jaysonjeg commented Oct 12, 2023

emdupre commented Oct 12, 2023

alexisthual commented Oct 24, 2023

jaysonjeg commented Oct 24, 2023 via email

bthirion commented Oct 24, 2023

jaysonjeg commented Oct 24, 2023

alexisthual commented Oct 25, 2023

jaysonjeg commented Oct 25, 2023

alexisthual commented Oct 25, 2023

jaysonjeg commented Oct 8, 2023 •

edited