Optimize cell detection (#398) #407

alessandrofelder · 2024-04-25T10:34:07Z

Description

What is this PR

Bug fix
Addition of a new feature
Other

Why is this PR needed?
All credit to #398 and @matham, just needed these changes to take a little detour via this branch so I could test them more in-depth.

What does this PR do?
Merges in the changes from #398 after some more testing.

References

How has this PR been tested?

For some practical triple-checking, I ran cell detection (no registration or classification) via brainmapper on our local HPC cluster with 16 cores and a GPU, with and without the changes proposed in #398, on a CFOS stained whole brain.

Overall cell candidate detection took ~5 h 30 mins with the changes, compared to ~8 h 30 mins before
Both runs found ~3.5 millions cells, with 98% matching coordinates
I compared coordinates on the 38'000 cell candidates found in 20 slices in the middle of the sample brain.
~800 cells didn't match
~200 out of the 800 cells were more than 2 um away from their closest neighbour
visual inspection in napari looked very reasonable

I'd say this is close enough.

Is this a breaking change?

No

Does this PR require an update to the documentation?

Yes, we should update https://brainglobe.info/community/developers/repositories/cellfinder-core/index.html#id1

Checklist:

The code has been tested locally
[n/a] Tests have been added to cover all new functionality (unit & integration)
The documentation has been updated to reflect any changes. To be addressed by [Feature] Update cellfinder developer docs brainglobe.github.io#174
The code has been formatted with pre-commit

* Replace coord map values with numba list/tuple for optim. * Switch to fortran layout for faster update of last dim. * Cache kernel. * jit ball filter. * Put z as first axis to speed z rolling (row-major memory). * Unroll recursion (no perf impact either way). * Parallelize cell cluster splitting. * Parallelize walking for full images. * Cleanup docs and pep8 etc. * Add pre-commit fixes. * Fix parallel always being selected and numba function 1st class warning. * Run hook. * Older python needs Union instead of |. * Accept review suggestion. Co-authored-by: Alessandro Felder <alessandrofelder@users.noreply.github.com> * Address review changes. * num_threads must be an int. --------- Co-authored-by: Alessandro Felder <alessandrofelder@users.noreply.github.com>

adamltyson · 2024-05-03T13:30:42Z

@alessandrofelder just to be clear, I'm basically just reviewing that you've double checked this on large data? You've already reviewed (and approved) the code changes themselves?

alessandrofelder · 2024-05-03T13:31:54Z

Yes, just a sanity-check that more than one of us are happy with the results of the comparison of cell detection candidates before and after.

* replace tensorflow Tensor with keras tensor * add case for TF prep in prep_model_weights * add different backends to pyproject.toml * add backend configuration to cellfinder init file. tests passing with jax locally * define extra dependencies for cellfinder with different backends. run tox with TF backend * run tox using TF and JAX backend * install TF in brainmapper environment before running tests in CI * add backends check to cellfinder init file * clean up comments * fix tf-nightly import check * specify TF backend in include guard check * clarify comment * remove 'backend' from dependencies specifications * Apply suggestions from code review Co-authored-by: Igor Tatarnikov <61896994+IgorTatarnikov@users.noreply.github.com> * PyTorch runs utilizing multiple cores * PyTorch fix with default models * Tests run on every push for now * Run test on torch backend only * Fixed guard test to set torch as KERAS_BACKEND * KERAS_BACKEND env variable set directly in test_include_guard.yaml * Run test on python 3.11 * Remove tf-nightly from __init__ version check * Added 3.11 to legacy tox config * Changed legacy tox config for real this time * Don't set the wrong max_processing value * Torch is now set as the default backend * Tests only run with torch, updated comments * Unpinned torch version * Add codecov token (#403) * add codecov token * generate xml coverage report * add timeout to testing jobs * Allow turning off classification or detection in GUI (#402) * Allow turning off classification or detection in GUI. * Fix test. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Refactor to fix code analysis errors. * Ensure array is always 2d. * Apply suggestions from code review Co-authored-by: Igor Tatarnikov <61896994+IgorTatarnikov@users.noreply.github.com> --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Igor Tatarnikov <61896994+IgorTatarnikov@users.noreply.github.com> * Support single z-stack tif file for input (#397) * Support single z-stack tif file for input. * Fix commit hook. * Apply review suggestions. * Remove modular asv benchmarks (#406) * remove modular asv benchmarks * recover old structure * remove asv-specific lines from gitignore and manifest * prune benchmarks * Adapt CI so it covers both new and old Macs, and installs required additional dependencies on M1 (#408) * naive attempt at adapting to silicon mac CI * run include guard test on Silicon CI * double-check hdf5 is needed * Optimize cell detection (#398) (#407) * Replace coord map values with numba list/tuple for optim. * Switch to fortran layout for faster update of last dim. * Cache kernel. * jit ball filter. * Put z as first axis to speed z rolling (row-major memory). * Unroll recursion (no perf impact either way). * Parallelize cell cluster splitting. * Parallelize walking for full images. * Cleanup docs and pep8 etc. * Add pre-commit fixes. * Fix parallel always being selected and numba function 1st class warning. * Run hook. * Older python needs Union instead of |. * Accept review suggestion. * Address review changes. * num_threads must be an int. --------- Co-authored-by: Matt Einhorn <matt@einhorn.dev> * [pre-commit.ci] pre-commit autoupdate (#412) updates: - [github.com/pre-commit/pre-commit-hooks: v4.5.0 → v4.6.0](pre-commit/pre-commit-hooks@v4.5.0...v4.6.0) - [github.com/astral-sh/ruff-pre-commit: v0.3.5 → v0.4.3](astral-sh/ruff-pre-commit@v0.3.5...v0.4.3) - [github.com/psf/black: 24.3.0 → 24.4.2](psf/black@24.3.0...24.4.2) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: sfmig <33267254+sfmig@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Simplify model download (#414) * Simplify model download * Update model cache * Remove jax and tf tests * Standardise the data types for inputs to all be float32 * Force torch to use CPU on arm based macOS during tests * Added PYTORCH_MPS_HIGH_WATERMARK_RATION env variable * Set env variables in test setup * Try to set the default device to cpu in the test itself * Add device call to Conv3D to force cpu * Revert changes, request one cpu left free * Revers the numb cores, don't use arm based mac runner * Merged main, removed torch flags on cellfinder install for guards and brainmapper * Lowercase Torch * Change cache directory --------- Co-authored-by: sfmig <33267254+sfmig@users.noreply.github.com> Co-authored-by: Kimberly Meechan <24316371+K-Meech@users.noreply.github.com> Co-authored-by: Matt Einhorn <matt@einhorn.dev> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Alessandro Felder <alessandrofelder@users.noreply.github.com> Co-authored-by: Adam Tyson <code@adamltyson.com>

alessandrofelder marked this pull request as ready for review May 3, 2024 13:27

alessandrofelder self-assigned this May 3, 2024

alessandrofelder requested a review from adamltyson May 3, 2024 13:27

adamltyson approved these changes May 3, 2024

View reviewed changes

adamltyson merged commit a283056 into main May 3, 2024
14 of 16 checks passed

adamltyson deleted the matham/optimise-cell-detection branch May 3, 2024 13:34

adamltyson mentioned this pull request May 9, 2024

cellfinder v1.3.0 blog post brainglobe/brainglobe.github.io#177

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize cell detection (#398) #407

Optimize cell detection (#398) #407

alessandrofelder commented Apr 25, 2024 •

edited

adamltyson commented May 3, 2024

alessandrofelder commented May 3, 2024 •

edited

Optimize cell detection (#398) #407

Optimize cell detection (#398) #407

Conversation

alessandrofelder commented Apr 25, 2024 • edited

Description

References

How has this PR been tested?

Is this a breaking change?

Does this PR require an update to the documentation?

Checklist:

adamltyson commented May 3, 2024

alessandrofelder commented May 3, 2024 • edited

alessandrofelder commented Apr 25, 2024 •

edited

alessandrofelder commented May 3, 2024 •

edited