Population module memory-related crash #1650

MDijsselhof · 2024-03-15T16:04:31Z

Description

The population module loads images for all subjects but crashes if there are too many subjects and/or not enough memory available.

It crashes at the following point, using 2 CPUs with 2 Gb each (I've used very little memory just to speed up the crashing):

It crashes at 51%, in xASL_wrp_CreatePopulationTemplates starting from line 418.

Points Henk

Matlab is notoriously bad in memory management, so we have to explicitly clear variables, even if we implement the changes below (afaik).
I agree with your points below. I would indeed revamp xASL_wrp_CreatePopulationTemplates, but keep a copy of this old version in our active ExploreASL, commented out, which has the benefit that we will update it with our general updates (nomenclature/variables/BIDS etc).
This old version has several more options than only choosing the calculation type, e.g., smoothing, mirroring left-right; not sure if it is easy to keep all these options but I would like it.
Other calculations that I use this function for are minimal intensity projection (MIP) and maximal intensity projection (MIP) across the population, voxel-wise. This can be much more useful than mean or even sd for seeing things that everyone has but at different locations, e.g. vascular artifacts.
The previous also serves as an example that you probably haven't thought about, as an indication that Function2Use is a useful feature in the future (I'm sure I will like it to test another idea, and I actually find this group-wise processing a huge pre of our pipeline so for me a good reason to keep a copy of the old function. Or would it even be possible to keep Function2Use still as a different sub-function inside this function? So that we have two different sub-functions for loading data (the default one sums, the other loads all volumes to memory) and similarly for the calculation subfunction? (or perhaps not even subfunctions but code-sections, using boolean to switch between Function2Use and hardcoded mean+sd+mip+map).

Explanation for smoothing:
you have lesion maps per subject where lesions are often small spots (e.g. microbleeds or microinfarcts). Or you have

The smoothing can be useful for creating lesion heat maps, and the mirroring for things that always happen unilaterally (e.g. I used this for creating patient-group average vascular territories, depending on the amount of stenosis, which is always unilateral but sometimes left sometimes right).

Discussion Henk+Jan:

We will revamp this to avoid loading all NIfTIs in RAM.

We will load NIfTIs 1 by 1 and add them to the corresponding templates (SNR, SD, mean). Only the templates will stay in the memory.
Memory mapping and IM2Column will not be used as it only accelerates when data is used multiple times, which in most cases we do not do.
We will add a second-pass option (OFF on default) that will remove outliers.
We will only do SD, mean, etc, but not median as this is not possible with a single-pass option (HENK: we only used this for outlier removal so this won't change much)
We will keep all the computation tricks as they are, only the order of reading will change and the memory management.
We will also create per-session templates and per-group templates.

Tasks proposed by Jan

Changes done only inside xASL_wrp_CreatePopulationTemplates

having a (sub)function that defines which subjectsessions to calculate mean+sd for (which defaults to all)
loading & summing

How to test

Optional: insert description about how to test the code changes here

Release notes

Required: summarize the changes for the release notes here

The text was updated successfully, but these errors were encountered:

jan-petr · 2024-03-24T15:11:22Z

Discussion Henk+Jan:

We need to revamp this to avoid loading the whole thing to the memory.

We will load NIfTIs 1 by 1 and add them to the corresponding templates (SNR, SD, mean). Only the templates will stay in the memory.
Memory mapping and IM2Column will not be used as it only accelerates when data is used multiple times, which in most cases we do not do.
We will add a second-pass option (OFF on default) that will remove outliers.
We will only do SD, mean, etc, but not median as this is not possible with a single-pass option (HENK: we only used this for outlier removal so this won't change much)
We will keep all the computation tricks as they are, only the order of reading will change and the memory management.
We will also create per-session templates and per-group templates.

MDijsselhof added the bug Something isn't working label Mar 15, 2024

MDijsselhof added this to the Release 1.12.0 milestone Mar 15, 2024

MDijsselhof assigned jan-petr and HenkMutsaerts and unassigned jan-petr Mar 15, 2024

jan-petr mentioned this issue Apr 8, 2024

Master issue Population revamp #1675

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Population module memory-related crash #1650

Population module memory-related crash #1650

MDijsselhof commented Mar 15, 2024 •

edited by HenkMutsaerts

jan-petr commented Mar 24, 2024 •

edited by HenkMutsaerts

Population module memory-related crash #1650

Population module memory-related crash #1650

Comments

MDijsselhof commented Mar 15, 2024 • edited by HenkMutsaerts

Description

Points Henk

Discussion Henk+Jan:

Tasks proposed by Jan

How to test

Release notes

jan-petr commented Mar 24, 2024 • edited by HenkMutsaerts

MDijsselhof commented Mar 15, 2024 •

edited by HenkMutsaerts

jan-petr commented Mar 24, 2024 •

edited by HenkMutsaerts