ENH: Enable models for sparsely sampled fMRI series #414

effigies · 2019-03-11T19:53:39Z

Initial commit is all but the Convolve portions of #376 squashed onto master @ 4315865. There may be some parts that are no longer appropriate.

Supersedes and closes #376 (leaving open for now for reference).
Closes #252.

codecov · 2019-03-11T20:33:20Z

Codecov Report

Merging #414 into master will decrease coverage by 0.29%.
The diff coverage is 44.23%.

@@            Coverage Diff            @@
##           master     #414     +/-   ##
=========================================
- Coverage   62.37%   62.08%   -0.3%     
=========================================
  Files          27       27             
  Lines        4564     4602     +38     
  Branches     1174     1185     +11     
=========================================
+ Hits         2847     2857     +10     
- Misses       1433     1455     +22     
- Partials      284      290      +6

Flag	Coverage Δ
#unittests	`62.08% <44.23%> (-0.3%)`	⬇️

Impacted Files	Coverage Δ
bids/variables/kollekshuns.py	`83.57% <100%> (ø)`	⬆️
bids/variables/entities.py	`87.77% <100%> (+0.13%)`	⬆️
bids/variables/variables.py	`83.54% <36.36%> (-4.85%)`	⬇️
bids/variables/io.py	`72.24% <37.5%> (-3.01%)`	⬇️
bids/analysis/analysis.py	`86.91% <50%> (-1.87%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4315865...0729028. Read the comment docs.

tyarkoni

I left some comments. I think it might be worth scheduling a meeting to chat about the more general issue of how to handle dense variables internally in pybids (notably, whether we should switch to using pandas timeseries everywhere).

tyarkoni · 2019-03-11T20:06:16Z

bids/variables/io.py

+            slicetimes = sorted(img_md['SliceTiming'])
+            # a, b ... z
+            # z = final slice onset, b - a = slice duration
+            ta = np.round(slicetimes[-1] + slicetimes[1] - slicetimes[0], 3)


Is it worth testing for non-uniform slice onsets (and raise a not supported error)? I don't know if that ever happens in practice, but if it does, we should probably fail...

That feels like a validation problem. While we can put those checks in, ad hoc, I think it would make sense to either fail on load for validation problems or be able to insert a little boilerplate check like: validate(slicetimes, 'SliceTiming')

tyarkoni · 2019-03-11T20:24:30Z

bids/variables/io.py

+        else:
+            ta = tr
+    elif 'VolumeTiming' in img_md:
+        return NotImplemented


Should this be NotImplementedError? I think this will fail as written because of the tuple unpacking when _get_timing_info is called, but the user won't have any idea what went wrong.

tyarkoni · 2019-03-11T20:27:26Z

bids/variables/variables.py

@@ -453,18 +453,36 @@ def resample(self, sampling_rate, inplace=False, kind='linear'):

        self.index = self._build_entity_index(self.run_info, sampling_rate)

-        x = np.arange(n)
        num = len(self.index)


This is on me I think, but we could probably use less confusing names than n and num for the old and new lengths.

Yeah, I can try to rewrite; I was mostly aiming for a minimal diff.

tyarkoni · 2019-03-11T20:49:03Z

bids/variables/variables.py

-        f = interp1d(x, self.values.values.ravel(), kind=kind)
-        x_new = np.linspace(0, n - 1, num=num)
-        self.values = pd.DataFrame(f(x_new))
+        if integration_window is not None:


Rather than determine which approach to use based on whether integration_window is passed, I think we should probably always use interpolation for upsampling and decimation for downsampling (we should definitely avoid downsampling via interpolation—that was a pretty bad decision on my part). Actually, I'm not sure integration/averaging is a great solution, as it's sort of a half-assed way to do temporal filtering. If we're going to go down this road, maybe we should just temporally filter out frequencies above half the target sampling rate and then decimate.

In general, I wonder if it makes sense to implement the resampling operations ourselves, or if we should just do it via pandas (or something like traces, which wraps pandas). I would expect that resampling in pandas is probably going to be more efficient than rolling our own solution, while relieving of us of the burden of extensive testing.

tyarkoni · 2019-03-11T23:40:21Z

bids/variables/variables.py

+            integrator = lil_matrix((num, n), dtype=np.uint8)
+            count = None
+            for i, new_time in enumerate(new_times):
+                cols = (old_times >= new_time) & (old_times < new_time + integration_window)


Should the integration window be centered on new_time, rather than using it as the lower boundary? Otherwise the value at each downsampled point is basically reading the future, which is probably not what we want.

effigies · 2019-03-12T19:35:54Z

Ah, saw you said we should set up a call. Looking at this again, I agree. Hacking sparse sampling into or next to the current approach is probably not worth it, and we should work out a strategy for resampling more broadly, with discontinuous integration windows in mind. Might also be worth thinking through the clustered acquisition approach at the same time, to avoid settling on a solution that will need to be refactored again when we get around to that.

@satra Do you think you or anyone in your group should be on the call?

effigies · 2019-08-07T01:50:51Z

@satra Bump.

tyarkoni · 2020-01-10T00:43:35Z

Any status update on this, post resampling changes? (Fine if not, this is a low priority from my perspective.)

satra · 2020-01-10T00:52:01Z

@tyarkoni - i haven't had a look at this in a while. can you point me to where the design matrix is being built and i can backtrack from there?

ENH: Initial framework for sparse acquisition

d5f6b93

ENH: Add method to choose sampling rate

0729028

tyarkoni reviewed Mar 12, 2019

View reviewed changes

yarikoptic mentioned this pull request Mar 16, 2023

FOI: sample con/solidation report bids-standard/maintenance-tools#10

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Enable models for sparsely sampled fMRI series #414

ENH: Enable models for sparsely sampled fMRI series #414

effigies commented Mar 11, 2019

codecov bot commented Mar 11, 2019 •

edited

tyarkoni left a comment

tyarkoni Mar 11, 2019

effigies Mar 12, 2019

tyarkoni Mar 11, 2019

tyarkoni Mar 11, 2019

effigies Mar 12, 2019

tyarkoni Mar 11, 2019

tyarkoni Mar 11, 2019

effigies commented Mar 12, 2019

effigies commented Aug 7, 2019

tyarkoni commented Jan 10, 2020

satra commented Jan 10, 2020

ENH: Enable models for sparsely sampled fMRI series #414

Are you sure you want to change the base?

ENH: Enable models for sparsely sampled fMRI series #414

Conversation

effigies commented Mar 11, 2019

codecov bot commented Mar 11, 2019 • edited

Codecov Report

tyarkoni left a comment

Choose a reason for hiding this comment

tyarkoni Mar 11, 2019

Choose a reason for hiding this comment

effigies Mar 12, 2019

Choose a reason for hiding this comment

tyarkoni Mar 11, 2019

Choose a reason for hiding this comment

tyarkoni Mar 11, 2019

Choose a reason for hiding this comment

effigies Mar 12, 2019

Choose a reason for hiding this comment

tyarkoni Mar 11, 2019

Choose a reason for hiding this comment

tyarkoni Mar 11, 2019

Choose a reason for hiding this comment

effigies commented Mar 12, 2019

effigies commented Aug 7, 2019

tyarkoni commented Jan 10, 2020

satra commented Jan 10, 2020

codecov bot commented Mar 11, 2019 •

edited