Add a score function for SRM #478

Kajiyu · 2020-08-13T08:13:15Z

This adds a function on SRM which calculates the log-likelihood of test-subjects' data for model selection.
We followed a style of scikit-learn: https://scikit-learn.org/stable/modules/generated/sklearn.mixture.GaussianMixture.html#sklearn.mixture.GaussianMixture.score

Co-authored-by: @lcnature .

…likelihood

remove white space

lcnature · 2020-08-14T14:46:26Z

This PR partially deals with #69 by adding cross-validated logp for SRM.

mihaic · 2020-08-24T18:20:21Z

@qihongl, could you please review?

qihongl · 2021-02-23T21:51:51Z

Oops sorry about the delayed reply! @lcnature @Kajiyu I didn't notice it! I will take a look in 2 weeks.

btw is there a test added to this module?

lcnature · 2021-02-24T07:35:02Z

Thanks @qihongl ! It looks like we did not add. I will maybe work on it this weekend or next week.

qihongl · 2021-03-02T20:53:47Z

@lcnature Ah do you mean there are some new updates that haven't been added to this PR?

mihaic · 2021-03-08T19:19:19Z

@lcnature, please remember about this pull request. It seems close to completion.

qihongl · 2021-03-17T19:35:50Z

brainiak/funcalign/srm.py

+        -------
+
+        ll_score : Log-likelihood of test-subject's data
+        """


Might be better to have an input validation here to make sure all d in data have the same number of samples?

I noticed that later on line 616, you are skipping any d in data that is None. I was wondering if this can be done earlier, such a removing None from the very beginning, or warn the user that there is None in the data list? Feel free to decide what's more user-friendly.

ah and this function probably want to assert len(data) >= 2

qihongl · 2021-03-17T19:35:54Z

brainiak/funcalign/srm.py

+        w = []
+        for subject in range(subjects):
+            _w = self._update_transform_subject(data[subject], self.s_)
+            w.append(_w)


not necessary but I think the following is slightly faster

w = [self._update_transform_subject(data[subject], self.s_) for subject in range(subjects)]

qihongl · 2021-03-17T19:36:05Z

brainiak/funcalign/srm.py

+            w.append(_w)
+
+        x, mu, rho2, trace_xtx = self._init_structures(data, subjects)
+        sigma_s = self.sigma_s_


not necessary but how about just use self.sigma_s_ on line 597?

qihongl · 2021-03-17T19:36:37Z

brainiak/funcalign/srm.py

+
+        # Invert Sigma_s using Cholesky factorization
+        (chol_sigma_s, lower_sigma_s) = scipy.linalg.cho_factor(
+            sigma_s, check_finite=False)


referred in a comment above

qihongl · 2021-03-17T19:36:44Z

brainiak/funcalign/srm.py

+        wt_invpsi_x = np.zeros((self.features, samples))
+        trace_xt_invsigma2_x = 0.0
+        for subject in range(subjects):
+            if data[subject] is not None:


referred in a comment above

qihongl

Hi, I'm not familiar with mpi but I think the code makes sense (in terms of matching the formulas in the paper)!
I think some input validation for data at the beginning of this function would be useful here. And I have 2 other very minor comments. Ah, and a test would be nice. Perhaps a test that computes the score for some toy data?

Thanks!

Kajiyu and others added 4 commits August 13, 2020 09:36

Add score function in SRM

47d140d

Reduced unessential variables and fixed the constant term of the log-…

f81ce5b

…likelihood

Add some comments

fe3e598

Update srm.py

ecdf54b

remove white space

qihongl reviewed Mar 17, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a score function for SRM #478

Add a score function for SRM #478

Kajiyu commented Aug 13, 2020

lcnature commented Aug 14, 2020 •

edited

mihaic commented Aug 24, 2020

qihongl commented Feb 23, 2021

lcnature commented Feb 24, 2021

qihongl commented Mar 2, 2021

mihaic commented Mar 8, 2021

qihongl Mar 17, 2021

qihongl Mar 18, 2021

qihongl Mar 17, 2021

qihongl Mar 17, 2021

qihongl Mar 17, 2021

qihongl Mar 17, 2021

qihongl left a comment •

edited

Add a score function for SRM #478

Are you sure you want to change the base?

Add a score function for SRM #478

Conversation

Kajiyu commented Aug 13, 2020

lcnature commented Aug 14, 2020 • edited

mihaic commented Aug 24, 2020

qihongl commented Feb 23, 2021

lcnature commented Feb 24, 2021

qihongl commented Mar 2, 2021

mihaic commented Mar 8, 2021

qihongl Mar 17, 2021

Choose a reason for hiding this comment

qihongl Mar 18, 2021

Choose a reason for hiding this comment

qihongl Mar 17, 2021

Choose a reason for hiding this comment

qihongl Mar 17, 2021

Choose a reason for hiding this comment

qihongl Mar 17, 2021

Choose a reason for hiding this comment

qihongl Mar 17, 2021

Choose a reason for hiding this comment

qihongl left a comment • edited

Choose a reason for hiding this comment

lcnature commented Aug 14, 2020 •

edited

qihongl left a comment •

edited