perf: update datavzrd to wrapper `v3.10.2` datavzrd.smk #92

dlaehnemann · 2024-05-13T11:21:11Z

this then uses datavzrd=2.36.10

this then uses `datavzrd=2.36.10`

commit 1/3

commit 2/4

commit 3/4

commit 4/4

fxwiegand · 2024-05-27T08:07:27Z

.test/config/config.yaml

@@ -93,6 +93,9 @@ enrichment:
    # the species specified by resources -> ref -> species above
    pathway_database: "panther"

+report:
+  offer_excel: true


Suggested change

offer_excel: true

offer_excel: false

Depending on the size of the input data this might cause extremely long runtimes as the excel writer in rust requires to read all of the input data into memory before one can write it out:

https://github.com/datavzrd/datavzrd/blob/556feb7b6a39745b42900d27fbcad26ee050e158/src/render/portable/mod.rs#L1944-L1981

Many thanks for checking the template and for the clarification!

Did you already check for better options for xlsx file writing? Have you for example seen this one:
https://docs.rs/simple-xlsx-writer/latest/simple_xlsx_writer/

At least it doesn't read everything into memory. But not sure, if it will be any quicker and if the interface works for how you parse stuff in datavzrd...

This is literally the one we are using right now 😄

Hmmm. Maybe switch to something else, then... 🤔

But yeah, I think this switch of the default here is good enough for now. I'll add a comment warning of the performance hit when activating this...

Hmmm. Maybe switch to something else, then... 🤔

I am not sure there is something that would allow something like flushing the writer every thousand records for example. But it'd great to have something like that for sure. Might even have to do with excel itself maybe?

But yeah, I think this switch of the default here is good enough for now. I'll add a comment warning of the performance hit when activating this...

Yeah a warning when manually activated sounds good!

based on this very approximate statement, but started with a lower estimate here: pachterlab/sleuth#139 (comment) so we should increase this if we see this failing with out of memory on any real datasets

dlaehnemann added 9 commits May 13, 2024 13:20

perf: update datavzrd to wrapper v3.10.2 datavzrd.smk

994ec02

this then uses `datavzrd=2.36.10`

perf: switch diffexp_datavzrd to new wrapper structure

5770b83

remove comments that make snakefmt choke

a26cbff

turn yte template rendering on in template

cf1256e

perf: make offer-excel in diffexp-template.yaml configurable

50671a6

commit 1/3

perf: make offer-excel in diffexp-template.yaml configurable

0eb2a0e

commit 2/4

perf: make offer-excel in diffexp-template.yaml configurable

888706a

commit 3/4

perf: make offer-excel in diffexp-template.yaml configurable

accaaa7

commit 4/4

fix: chaining in config.get() statements

7065042

fxwiegand reviewed May 27, 2024

View reviewed changes

dlaehnemann added 4 commits May 27, 2024 13:20

fix: datavzrd template python code

cab060b

perf: dynamic threads-dependent mem_mb for sleuth_init

d5e7efa

based on this very approximate statement, but started with a lower estimate here: pachterlab/sleuth#139 (comment) so we should increase this if we see this failing with out of memory on any real datasets

fix: put back accidental deletion

082b03f

snakefmt

a75ce40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: update datavzrd to wrapper `v3.10.2` datavzrd.smk #92

perf: update datavzrd to wrapper `v3.10.2` datavzrd.smk #92

dlaehnemann commented May 13, 2024

fxwiegand May 27, 2024

dlaehnemann May 27, 2024

fxwiegand May 27, 2024

dlaehnemann May 28, 2024

fxwiegand May 28, 2024

perf: update datavzrd to wrapper v3.10.2 datavzrd.smk #92

Are you sure you want to change the base?

perf: update datavzrd to wrapper v3.10.2 datavzrd.smk #92

Conversation

dlaehnemann commented May 13, 2024

fxwiegand May 27, 2024

Choose a reason for hiding this comment

dlaehnemann May 27, 2024

Choose a reason for hiding this comment

fxwiegand May 27, 2024

Choose a reason for hiding this comment

dlaehnemann May 28, 2024

Choose a reason for hiding this comment

fxwiegand May 28, 2024

Choose a reason for hiding this comment

perf: update datavzrd to wrapper `v3.10.2` datavzrd.smk #92

perf: update datavzrd to wrapper `v3.10.2` datavzrd.smk #92