[FIX] Fix z statistic computation in computefeats2 #644

smoia · 2021-01-13T22:39:46Z

Closes #178 .

@tsalo, apologies for the delay! The branch is incredibly behind ME-ICA/tedana/master, ~~but at first sight I didn't see any conflict related to lack of updates~~. Joke, this PR does have issues related to update. I'll take care of it.

Also, let me know if instead of a [FIX] this should be a [REF] or a [ENH].

Also, it's still a draft because I see it's missing some final tweaks.

Changes proposed in this pull request:

Main change: removed the possibility of normalising the output of computecoeffs2 because with the new Z-score implementation it doesn't make much sense anymore.
Main change: added the optional steps of computing z values from beta scores in get_coeffs if an argument of the function is flagged as True. However, the way the output is returned might not be so pythonic - what do you think?
Main change: Rename get_coeffs into get_ls_coeffs to make the least square approach more explicit, and update the related API documentation as well as its calls throughout the code. If it's an issue, though, we can revert to the former name.
Add t_to_z function to tedana's stats.py, from Vanessa Sochat's TtoZ package, add its reference for duecredit (but please do check the docstring for copyright). ~~However, this could be avoided by adding the package as dependency and importing it. What do you all think? (Now that I'm not a python newbie, I think I should!)~~ Or shouldn't, given that the package works with nifti files. Maybe we could ask to modify the original script, adding an extra function, and import that? Also, this might have been someone else's work during DCCC2019 - if you know who was the author, let me know in order to add acknowledgements.
Update test_get_coeffs to test_get_ls_coeffs and the calls to get_coeffs. However, it's not testing the new part of get_ls_coeffs, only what was previously tested. If you have any suggestion on how to test the new part of the code, let me know.

Things left to do and I might or might not (if not required to merge PR) do [and by extension, question for the reviewer to see if they expect them]:

Update branch to latest master and fix conflicts!
~~Import~~ Kindly propose a PR to TtoZ to be able to import TtoZ as dependency rather than merely copying code.
Add an empty commit to acknowledge @CesarCaballeroGaudes as co-author of this PR since we worked on it together.
Think about the output of get_ls_coeffs and whether it should always compute z scores or not
Update documentation beside API
Test the z score computation (any idea for that?)

…atistics

…uares, changed import in viz.py.

… normalisation of z_score happening inside get_ls_coeffs

Changed default compute_zvalues of get_coeffs to False

Modified get_coeffs into get_ls_coeffs, corrected its use in the code.

… OLS_stats_modification

…ation

…n betas.

tsalo · 2021-01-14T14:49:50Z

tedana/stats.py

-    # R-to-Z transform
-    data_Z = np.arctanh(data_R)
+    # get betas and z-values of `data`~`mmix`
+    # mmix is normalized internally


I think we should do it here anyway.

What if we use add_const=True while calling get_ls_coeffs instead, now that there's the option? I think it would make more sense given how the program runs.

In fact, I would set add_const default as True, given that if the data is not demeaned, you need the intercept, and if it's demeaned, the intercept doesn't bother you (it's 0).

@CesarCaballeroGaudes is there any other difference between normalising before computing LS and adding the intercept in the model?

I like including the intercept by default, but isn't variance normalization required to get standardized parameter estimates?

tedana/stats.py

tsalo · 2021-02-16T18:56:27Z

tedana/stats.py

    """
    Converts `data` to component space using `mmix`


Suggested change

"""

Converts `data` to component space using `mmix`

"""Fits mmix to data with OLS and calculates standardized beta values and z-statistics.

tsalo · 2021-02-16T18:58:43Z

tedana/stats.py

+        # sigma = RSS / Degrees_of_Freedom
+        sigma = np.sum(np.power(mdata - np.dot(X, betas.T), 2), axis=0) / df
+        sigma = sigma[:, np.newaxis]
+        # Copmute std of betas:


Suggested change

# Copmute std of betas:

# compute std of betas:

tsalo · 2021-02-16T19:02:40Z

tedana/stats.py

+    """
+    From Vanessa Sochat's TtoZ package.
+    Copyright (c) 2015 Vanessa Sochat
+    MIT Licensed
+    """


Just recommending the docstring I use in NiMARE.

Suggested change

"""

From Vanessa Sochat's TtoZ package.

Copyright (c) 2015 Vanessa Sochat

MIT Licensed

"""

"""Convert t-statistics to z-statistics.

An implementation of [1]_ from Vanessa Sochat's TtoZ package [2]_.

Parameters

----------

t_values : array_like

T-statistics

dof : int

Degrees of freedom

Returns

-------

z_values : array_like

Z-statistics

References

----------

.. [1] Hughett, P. (2007). Accurate Computation of the F-to-z and t-to-z

Transforms for Large Arguments. Journal of Statistical Software,

23(1), 1-5.

.. [2] Sochat, V. (2015, October 21). TtoZ Original Release. Zenodo.

http://doi.org/10.5281/zenodo.32508

"""

The actual code from NiMARE might also be good to use since I improved the internal variable names.

CesarCaballeroGaudes and others added 22 commits November 25, 2019 16:54

change stats.py to compute OLS based on Pseudo-Inverse & Compute Z-st…

904bc01

…atistics

updating pca.py to same as tedana in upstream

8587b79

Changed default compute_zvalues of get_coeffs to F

1617d2a

LIntered and changed name of function get_coeff into compute_least_sq…

2453c78

…uares, changed import in viz.py.

Changed orientation of "beta" matrix in RSS

94cc77e

Renmaed compute_least_squares into get_ls_coeffs

b3baa52

Lintering and Removing part of code that is not needed anymore due to…

487416d

… normalisation of z_score happening inside get_ls_coeffs

Improved comments on what's going on in get_ls_coeffs

b96c3b2

Corrected use of "due" in stats.py

7061614

Corrected use of duecredit

4820c19

Merge pull request ME-ICA#1 from smoia/patch-1

1def734

Changed default compute_zvalues of get_coeffs to False

Merge branch 'OLS_stats_modification' into OLS_stats_modification

ba3aa26

Merge pull request ME-ICA#2 from smoia/OLS_stats_modification

2ec3b3c

Modified get_coeffs into get_ls_coeffs, corrected its use in the code.

compute OLS with np.linalg.lstsq instead of np.linalg.pinv

25b5ea3

Removed unused parameter from computefeats2

8b03c2b

Removed unused parameter

88d049d

Removed unused parameter

adc2850

Removed unused parameter

f8dfc8b

Added limits to betas and z_scores to avoid breaking code due to INFs.

c679600

Merge branch 'OLS_stats_modification' of github.com:smoia/tedana into…

2e2c63b

… OLS_stats_modification

Merge remote-tracking branch 'upstream/master' into OLS_stats_modific…

a4d2c35

…ation

Modified nan_to_num to clip in order to limit arrays, removed limit o…

ae582b6

…n betas.

smoia marked this pull request as draft January 13, 2021 22:39

smoia requested a review from tsalo January 13, 2021 22:40

tsalo reviewed Jan 14, 2021

View reviewed changes

tsalo changed the title ~~[FIX] Fix z statistic computaiton in computefeats2~~ [FIX] Fix z statistic computation in computefeats2 Jan 14, 2021

Stefano Moia added 2 commits January 15, 2021 12:42

Merge branch 'master' into OLS_stats_modification

cd42798

Align indented line and keep the linter happy.

d8bd355

tsalo reviewed Jan 15, 2021

View reviewed changes

tedana/stats.py Outdated Show resolved Hide resolved

Base automatically changed from master to main February 1, 2021 23:57

Stefano Moia added 2 commits February 15, 2021 18:16

Merge remote-tracking branch 'origin/main' into OLS_stats_modification

4c85723

Rename computefeats2 as get_ls_zvalues

8cfa3ca

tsalo reviewed Feb 16, 2021

View reviewed changes

tsalo mentioned this pull request Feb 23, 2021

[ENH] BIDS Derivatives-compatible outputs #574

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Fix z statistic computation in computefeats2 #644

[FIX] Fix z statistic computation in computefeats2 #644

smoia commented Jan 13, 2021 •

edited

tsalo Jan 14, 2021

smoia Jan 15, 2021

tsalo Jan 15, 2021

tsalo Feb 16, 2021

tsalo Feb 16, 2021

tsalo Feb 16, 2021

tsalo Feb 16, 2021

	"""
	Converts `data` to component space using `mmix`
	"""Fits mmix to data with OLS and calculates standardized beta values and z-statistics.

-    """
-    From Vanessa Sochat's TtoZ package.
-    Copyright (c) 2015 Vanessa Sochat
-    MIT Licensed
-    """
+    """Convert t-statistics to z-statistics.
+    An implementation of [1]_ from Vanessa Sochat's TtoZ package [2]_.
+    Parameters
+    ----------
+    t_values : array_like
+        T-statistics
+    dof : int
+        Degrees of freedom
+    Returns
+    -------
+    z_values : array_like
+        Z-statistics
+    References
+    ----------
+    .. [1] Hughett, P. (2007). Accurate Computation of the F-to-z and t-to-z
+           Transforms for Large Arguments. Journal of Statistical Software,
+(1), 1-5.
+    .. [2] Sochat, V. (2015, October 21). TtoZ Original Release. Zenodo.
+           http://doi.org/10.5281/zenodo.32508
+    """

[FIX] Fix z statistic computation in computefeats2 #644

Are you sure you want to change the base?

[FIX] Fix z statistic computation in computefeats2 #644

Conversation

smoia commented Jan 13, 2021 • edited

tsalo Jan 14, 2021

Choose a reason for hiding this comment

smoia Jan 15, 2021

Choose a reason for hiding this comment

tsalo Jan 15, 2021

Choose a reason for hiding this comment

tsalo Feb 16, 2021

Choose a reason for hiding this comment

tsalo Feb 16, 2021

Choose a reason for hiding this comment

tsalo Feb 16, 2021

Choose a reason for hiding this comment

tsalo Feb 16, 2021

Choose a reason for hiding this comment

smoia commented Jan 13, 2021 •

edited