asv benchmarks for imports and tools modules #184

sfmig · 2023-06-30T13:33:41Z

A suite of asv benchmarks for timing the main imports of the package and several functions from the tools module. Benchmarks are defined as methods of a class, with common setup and teardown functions.

Basic usage

First, install asv in the desired environment: pip install asv
To run all benchmarks locally, navigate to the directory where the asv config file is (cellfinder-core/benchmarks/asv.conf.json) and execute: asv run
To run a subset of benchmarks (e.g., the tensorflow prep benchmarks): asv run --bench tools.prep.PrepTF
The results will be saved in a newly created benchmarks/results directory as json files. These can be visualised in a static web site by running first asv publish and then asv preview.

For further details on usage and useful commands, see the benchmarks/README.md file.

Structure

The structure follows the approach from the astropy-benchmarks repo, in which the benchmarks modules roughly mimic the package modules. The numpy benchmarks follow a slightly different structure but it is also a useful reference.

mypy fixes

To avoid mypy errors:

I added one of the benchmarks file to the mypy exclude section in the pre-commit config, and
added cellfinder_core.tools.prep.* to the list of modules from which ignore import errors.
Not sure if this is best practice but I couldn't find a better way around it. Happy to get feedback on it!

Existing benchmarks

I moved a set of previous memory benchmarks written by @dstansby to a subdir in benchmarks called mem_benchmarks.

…ip mypy errors in IO benchmarks

alessandrofelder

This is a great start! 🎉
I've managed to run the benchmarks locally with ease, thanks to the instructions 🙏

I've tried to give sensible answers to the great questions 😅 I'll feed back on the mypy part separately after some more investigations 🤔

I think the key thing missing is a benchmark for the main function (as discussed)?

IIUC these benchmarks are not part of the CI yet - is it worth opening an issue for that (or is there one already) to discuss the details of what we should compare benchmarks to? (And that seems like a good next step after this?)

Minor point: I wonder whether instead of putting questions for reviewers into the code itself and risking that we inadvertently merge them, should they in future be added as comments on the PR? (the disadvantage would be that it's more annoying to keep track of the questions for the person opening the PR... don't know).

benchmarks/README.md

benchmarks/benchmarks/tools/IO.py

benchmarks/benchmarks/imports.py

benchmarks/benchmarks/tools/prep.py

adamltyson · 2023-07-04T07:32:24Z

Thanks @sfmig!

Minor point: I wonder whether instead of putting questions for reviewers into the code itself and risking that we inadvertently merge them, should they in future be added as comments on the PR?

Yes please add all comments to the PR itself rather than in the code. And create issues rather than TODO: etc. It's a bit of a pain for the developer, but much easier for everyone else to keep track.

alessandrofelder · 2023-07-07T15:50:56Z

On mypy I've done some thinking and discussing with @willGraham01 (thanks!). We understood that the import errors are caused by the benchmarks code being outside the cellfinder_core folder, so mypy treats cellfinder_core like a third-party library.

I would suggest to ignore any import errors happening in benchmarks/ by excluding them in pyproject.toml (until we fix brainglobe/cellfinder#278 which is not high-priority).

MANIFEST.in

replace exclude by prune in manifest file Co-authored-by: Will Graham <32364977+willGraham01@users.noreply.github.com>

…be/cellfinder-core into smg/basic-asv-benchmark

sfmig

Thanks for the feedback @alessandrofelder! Sorry that this PR was a bit longer than expected

I think the key thing missing is a benchmark for the main function (as discussed)?

Yes, I added an issue for it #206

IIUC these benchmarks are not part of the CI yet - is it worth opening an issue for that (or is there one already) to discuss the details of what we should compare benchmarks to? (And that seems like a good next step after this?)

Agreed! This is now issue #208 (I added these as cellfinder-core issues for now, with a view to expanding this work to other brainglobe tools later)

Minor point: I wonder whether instead of putting questions for reviewers into the code itself and risking that we inadvertently merge them, should they in future be added as comments on the PR?

Yes, that sounds like a better approach. Maybe opening a draft PR as soon as possible and adding my questions as they come to the main description of the PR is the least cumbersome? Will have a go. Also will aim to open issues rather than TODOs, thanks @adamltyson.

benchmarks/README.md

MANIFEST.in

benchmarks/benchmarks/imports.py

benchmarks/benchmarks/tools/IO.py

alessandrofelder

Awesome, @sfmig - thank you!

sfmig added 25 commits June 22, 2023 18:02

notebook w typical pipeline

2c5a65c

notes on writing benchmarks

2a4c729

pipeline notebook

9879ca6

update gitignore

fad8b8a

add imports benchmarks

4603bcb

basic benchmark for reading with dask

67faa02

parametrise dask benchmark

fb8ebe9

add tiff benchmark and refactor

a00131e

fix build command to use pyproject correctly

9bc0046

black formatting to IO benchmarks

2bf4c5a

prep benchmarks pending teardown

336be4c

remove voxel_size from IO and refactor. change precommit config to sk…

a2b200e

…ip mypy errors in IO benchmarks

remove list comprehension

3f3a344

add benchmarks imports

d20a97b

remove initial templates

80f12a0

add init to benchmarks

bfc7383

add readme and comments to asv config

589ffce

add teardown function to prep benchmarks

febb776

add comment for review

2e35a8a

Merge branch 'main' into smg/basic-asv-benchmark

ebd57fb

add cellfinder_core.tools.prep mypy fix

382a285

replace imlib by brainglobe_utils

357725f

small additions to readme

43fe398

move cellfinder_core.tool.prep to ignore imports section

f6b8f08

remove notebook

2051c4a

sfmig requested a review from alessandrofelder June 30, 2023 13:33

increase timeout

d0ce539

willGraham01 mentioned this pull request Jul 3, 2023

Tensorflow requires manual install if going via conda #187

Closed

7 tasks

alessandrofelder reviewed Jul 3, 2023

View reviewed changes

sfmig added 4 commits July 20, 2023 10:41

small additions and format edits to the readme

70dffb3

exclude benchmarks from manifest

701a135

small additions to the readme

8d3ed4b

reduce readme to basic commands

150eda7

willGraham01 reviewed Jul 20, 2023

View reviewed changes

MANIFEST.in Outdated Show resolved Hide resolved

sfmig and others added 5 commits July 21, 2023 10:54

fixes to IO benchmarks from review discussions

333155e

fix typo

45510da

Apply Will's suggestions from code review

397aeb3

replace exclude by prune in manifest file Co-authored-by: Will Graham <32364977+willGraham01@users.noreply.github.com>

Merge branch 'smg/basic-asv-benchmark' of https://github.com/brainglo…

0ddd6b7

…be/cellfinder-core into smg/basic-asv-benchmark

change install path. remove TODOs. increase default timeout further

a5fcee1

sfmig commented Jul 21, 2023

View reviewed changes

Merge branch 'main' into smg/basic-asv-benchmark

9e1d4c1

sfmig requested a review from alessandrofelder July 21, 2023 14:11

alessandrofelder approved these changes Jul 24, 2023

View reviewed changes

alessandrofelder merged commit 90c6c25 into main Jul 24, 2023
16 of 17 checks passed

alessandrofelder deleted the smg/basic-asv-benchmark branch July 24, 2023 09:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asv benchmarks for imports and tools modules #184

asv benchmarks for imports and tools modules #184

sfmig commented Jun 30, 2023

alessandrofelder left a comment

adamltyson commented Jul 4, 2023

alessandrofelder commented Jul 7, 2023

sfmig left a comment •

edited

alessandrofelder left a comment

asv benchmarks for imports and tools modules #184

asv benchmarks for imports and tools modules #184

Conversation

sfmig commented Jun 30, 2023

Basic usage

Structure

mypy fixes

Existing benchmarks

alessandrofelder left a comment

Choose a reason for hiding this comment

adamltyson commented Jul 4, 2023

alessandrofelder commented Jul 7, 2023

sfmig left a comment • edited

Choose a reason for hiding this comment

alessandrofelder left a comment

Choose a reason for hiding this comment

sfmig left a comment •

edited