Interactive use + Pydantic models for plots #2442

vladsavelyev · 2024-03-18T17:04:29Z

Use Pydantic

Plots, tables, plot data (Plot, DataTable, Dataset, and derived classes). Helps validate dumps for JavaScript internally.
Configuration in interactive functions and command line (ClConfig). Validates run parameters.

Future improvements would include validating custom content with Pydantic.

Partially addresses #1790

Support interactive usage

Support interactive usage of MultiQC, i.e. in notebooks. Partly addresses #1051

Split the multiqc.py into submodules with better separated stages: parse logs, render plots and tables, write data to disk. That allowed to add functions like multiqc.parse_logs(), that parses more logs and appends them into the same report without rendering HTML or writing stuff to disk; multiqc.list_plots() multiqc.show_plot() without rendering all plots; multiqc.write_results() to only render and write results to disk.

Config parameters can be passed individually to any interactive method (like verbose, modules_order, etc).

Can also add custom sections into the report with multiqc.add_custom_content_section().

The write_report() function triggers HTML rendering, module ordering, adds special-case software versions and runs performance modules, and writes data and report. Though all those things are also separated internally and can be exposed more granuarly if needed.

Example notebook

Check this notebook for example usage:

https://github.com/MultiQC/example-notebook/blob/master/multiqc_example.ipynb

Performance benchmark

Tested the branch against main (with all recent performance improvements merged into both). Getting consistent speed up just from the refactoring:

main
Run took 103.14 seconds

5.14s: Searching files
54.13s: Running modules
20.40s: Compressing report data
3628933760 peak memory footprint

interactive-use-2
Run took 77.98 seconds

5.15s: Searching files
29.62s: Running modules
18.11s: Compressing report data
3638501952 peak memory footprint

Before performance PRs (196c9738)
Run took 115.36 seconds

11.50s: Searching files
55.16s: Running modules
35.53s: Compressing report data
5196197376 peak memory footprint

…ing to a run

vladsavelyev force-pushed the interactive-use-2 branch 2 times, most recently from 467c43b to 0a13d36 Compare March 19, 2024 16:38

vladsavelyev changed the base branch from main to refactor-module-3 March 19, 2024 16:49

vladsavelyev force-pushed the interactive-use-2 branch from 0a13d36 to 852319a Compare March 19, 2024 17:11

vladsavelyev changed the base branch from refactor-module-3 to split-up-main March 19, 2024 17:11

vladsavelyev force-pushed the split-up-main branch from 7cd5838 to 00e29bf Compare March 19, 2024 18:23

vladsavelyev force-pushed the interactive-use-2 branch from 852319a to 7ec4962 Compare March 19, 2024 18:24

vladsavelyev force-pushed the split-up-main branch from 00e29bf to adfcf4a Compare March 19, 2024 19:04

vladsavelyev force-pushed the interactive-use-2 branch from 7ec4962 to cc3242c Compare March 19, 2024 19:06

vladsavelyev added 2 commits March 19, 2024 22:17

Add types to report module

98d5590

Break up the main multiqc run function

732aac3

vladsavelyev force-pushed the split-up-main branch from adfcf4a to 732aac3 Compare March 19, 2024 21:18

vladsavelyev added 2 commits March 19, 2024 22:18

Initial setup for interactive use. WIP

7319f10

Allow multiple DOI

7f31d3c

vladsavelyev force-pushed the interactive-use-2 branch from cc3242c to 7f31d3c Compare March 19, 2024 21:18

vladsavelyev added 5 commits March 20, 2024 14:36

Use Pydantic for plots and datasets

2cd380d

Comlete Pydantic for all plot types, implement show_plot, support add…

dc7a241

…ing to a run

Add script to print module plotting stats

b0430f5

Support table. Add dt to plot model

0706450

Salmon: remove non-standard fields from intermediate data

dd1a551

vladsavelyev added the core: refactoring Code refactoring label Mar 20, 2024

vladsavelyev changed the title ~~Functions for interactive use~~ Pydantic + interactive use Mar 20, 2024

vladsavelyev added 4 commits March 20, 2024 23:53

Merge branch 'main' into report-types

4b7e54d

Merge branch 'report-types' into split-up-main

d39f78c

Merge branch 'split-up-main' into interactive-use-2

0d11e6a

No flat plot width for show function

651c267

vladsavelyev added this to the MultiQC v1.22: Pydantic milestone Mar 20, 2024

vladsavelyev added 3 commits March 21, 2024 00:14

Add pydantic dep

1aedfd8

Fix heatmap and scatter

9e57d7b

Fix table_object.py when keys are different

17667b9

vladsavelyev added 6 commits May 2, 2024 01:50

Support sections with name=None (e.g. custom content)

fd595ab

Merge branch 'main' into interactive-use-2

e2b7fc1

Merge branch 'main' into interactive-use-2

cd681ad

Move files around

275e32f

Make ClConfig a Pydantic model

efb8560

Fixes and expose ClConfig

20aeb63

vladsavelyev marked this pull request as ready for review May 2, 2024 19:44

vladsavelyev added 3 commits May 2, 2024 21:51

Move search patterns

c0825e6

Point to dev branch of website in CI for moving search patterns

908592e

Interactive functions to be configured with params rather than ClConfig

bc4f246

This was referenced May 3, 2024

Add API (function) docs #2530

Open

Unit testing for core library #2531

Open

vladsavelyev added 13 commits May 3, 2024 12:18

Merge branch 'main' into interactive-use-2

29076a8

Fix type hint

892e5b3

Apply ignore sample in add_data_source

e335183

Add unit test

708aca5

Make table empty message a debug message

a98ac5f

Suppoer flat flag in show_plot

27fa8cf

Fix import rich console

43dd3f5

Fix loading user configs

c91caa5

Pass config.kwargs

9463c50

Make config.analysis_dir absolute

42209d0

Merge branch 'main' into interactive-use-2

4b4709a

Remove violin debug messages

1a46b3b

Rename pydantic models

a15effe

vladsavelyev force-pushed the interactive-use-2 branch from a63ae73 to a15effe Compare May 4, 2024 11:53

vladsavelyev changed the title ~~Pydantic + interactive use~~ Interactive use + Pydantic models for plots May 4, 2024

vladsavelyev merged commit ccbb5c5 into main May 4, 2024
7 checks passed

vladsavelyev deleted the interactive-use-2 branch May 4, 2024 11:54

vladsavelyev mentioned this pull request May 6, 2024

Refactor core code to use Pydantic objects #1790

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interactive use + Pydantic models for plots #2442

Interactive use + Pydantic models for plots #2442

vladsavelyev commented Mar 18, 2024 •

edited

Interactive use + Pydantic models for plots #2442

Interactive use + Pydantic models for plots #2442

Conversation

vladsavelyev commented Mar 18, 2024 • edited

vladsavelyev commented Mar 18, 2024 •

edited