export_to_phy() not respecting n_jobs #2817

cheydrick · 2024-05-07T15:10:58Z

The "Fitting PCA" step in the export to Phy process isn't respecting the n_jobs argument anymore.

I have set the global n_jobs parameter to 4 with:
si.set_global_job_kwargs(n_jobs=4)

When using SpikeInterface from April 17 (commit hash 33d478a) I can see that only four cores are being used while export_to_phy() is running.

However, when using SpikeInterface from May 6 (commit hash 42e1f02) the "Fitting PCA" step is saturating all cores:

Thanks,
Chris
chris@plexon.com

The text was updated successfully, but these errors were encountered:

zm711 · 2024-05-07T18:25:44Z

Maybe related to #2696...

Thanks for the report. I'll see if I can track this down.

EDIT: Although @alejoe91 @samuelgarcia did PCA ever respect the number of cores for n_jobs?

EDIT2: Looks like it does use n_jobs. Although there might be a separate bug here

spikeinterface/src/spikeinterface/core/job_tools.py

Lines 89 to 93 in 42e1f02

    
           if isinstance(n_jobs, float): 
        
               n_jobs = int(n_jobs * os.cpu_count()) 
        
           elif n_jobs < 0: 
        
               n_jobs = os.cpu_count() + 1 + n_jobs 
        
           job_kwargs["n_jobs"] = max(n_jobs, 1)

if n_jobs=4.0 is given then it would do the following:

n_jobs = int(4.0 * os.cpu_count())

which could max out the n_jobs.

@cheydrick did you enter 4 as a float or as an int for your script?

cheydrick · 2024-05-07T18:46:06Z

@zm711 it's an int.

si.set_global_job_kwargs(n_jobs=4)

zm711 · 2024-05-07T19:05:21Z

Could you do me a favor and run with verbose=True inside the export_to_phy. There has been some work on verbose so it might not work but it would be nice to see the max amount of information. Specifically to see how many jobs are being passed to the ChunkRecordingExecutor.

zm711 · 2024-05-07T19:06:55Z

The other thing we may need to look at is this line where we send the sorting.to_multiprocessing. It seems like NumpySorting would be converted to SharedMem here. Is that desired?

spikeinterface/src/spikeinterface/postprocessing/principal_component.py

Lines 351 to 360 in 42e1f02

    
           func = _all_pc_extractor_chunk 
        
           init_func = _init_work_all_pc_extractor 
        
           init_args = ( 
        
               recording, 
        
               sorting.to_multiprocessing(job_kwargs["n_jobs"]), 
        
               all_pcs_args, 
        
               waveforms_ext.nbefore, 
        
               waveforms_ext.nafter, 
        
               unit_channels, 
        
               pca_model,

spikeinterface/src/spikeinterface/core/basesorting.py

Lines 651 to 681 in 42e1f02

    
               def to_multiprocessing(self, n_jobs): 
        
                   """ 
        
                   When necessary turn sorting object into: 
        
                   * NumpySorting when n_jobs=1 
        
                   * SharedMemorySorting when n_jobs>1 
        
                   If the sorting is already NumpySorting, SharedMemorySorting or NumpyFolderSorting 
        
                   then this return the sortign itself, no transformation so. 
        
                   Parameters 
        
                   ---------- 
        
                   n_jobs: int 
        
                       The number of jobs. 
        
                   Returns 
        
                   ------- 
        
                   sharable_sorting: 
        
                       A sorting that can be used for multiprocessing. 
        
                   """ 
        
                   from .numpyextractors import NumpySorting, SharedMemorySorting 
        
                   from .sortingfolder import NumpyFolderSorting 
        
                   if n_jobs == 1: 
        
                       if isinstance(self, (NumpySorting, SharedMemorySorting, NumpyFolderSorting)): 
        
                           return self 
        
                       else: 
        
                           return NumpySorting.from_sorting(self) 
        
                   else: 
        
                       if isinstance(self, (SharedMemorySorting, NumpyFolderSorting)): 
        
                           return self 
        
                       else: 
        
                           return SharedMemorySorting.from_sorting(self)

cheydrick · 2024-05-08T13:44:12Z

@zm711 Looks like verbose is True by default.

spikeinterface/src/spikeinterface/exporters/to_phy.py

Line 37 in 42e1f02

verbose: bool = True,

I explicitly set it to True anyways, but the output was the same as in my screenshots above.

zm711 · 2024-05-08T13:52:19Z

Unfortunately (and this is why verbose needs to be fixed), is that that verbose is just for the function and not at a global level passed into the other functions that accept verbose. So that's why it doesn't do what I wanted. I'll try to dig a bit more by running the PCA by itself and see if I can see what is going on.

zm711 · 2024-05-08T19:12:35Z

I think we have to wait for @samuelgarcia to comment on this one. But based on my reading the n_jobs being fed into the ProcessPoolExecutor is just a max number of processes. So technically the OS can move those processes around however it wants (for example we could also set n_jobs > number of cores and it would just have to schedule those extra processes onto the processors). This doesn't explain why the switch occurred with changing to NumpySorting from SharedMemSorting, but n_jobs cannot absolutely be guaranteed to be equal to the cores since the OS has a say in this.

zm711 added exporters Related to exporters module concurrency Related to parallel processing labels May 7, 2024

alejoe91 self-assigned this May 9, 2024

zm711 mentioned this issue May 14, 2024

Remove separate default job_kwarg n_jobs for sorters #2712

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

export_to_phy() not respecting n_jobs #2817

export_to_phy() not respecting n_jobs #2817

cheydrick commented May 7, 2024

zm711 commented May 7, 2024 •

edited

cheydrick commented May 7, 2024

zm711 commented May 7, 2024

zm711 commented May 7, 2024

cheydrick commented May 8, 2024

zm711 commented May 8, 2024

zm711 commented May 8, 2024

export_to_phy() not respecting n_jobs #2817

export_to_phy() not respecting n_jobs #2817

Comments

cheydrick commented May 7, 2024

zm711 commented May 7, 2024 • edited

cheydrick commented May 7, 2024

zm711 commented May 7, 2024

zm711 commented May 7, 2024

cheydrick commented May 8, 2024

zm711 commented May 8, 2024

zm711 commented May 8, 2024

zm711 commented May 7, 2024 •

edited