new module: sincei #1946

vivekbhr · 2023-07-04T17:07:56Z

Added sincei module to summarise read and count-level single-cell QC metrics from sincei

multiqc/modules/sincei/scFilterStats.py

vladsavelyev

Thanks a lot for the contribution!

First, a minor comment, I'd try to refactor to not to assume the order for columns (i.e. indexing them with numbers like cols[0], cols[1]). Since the header is available, I'd rather use csv to parse the file reader = csv.DictReader(open(filename), delimiter=",") and index with the column name (e.g. cols['Cell_ID'], row["# Reads"], etc), to make it more robust to future changes of columns and their order. And probably use same column name in MultiQC table whenever reasonable.

A bigger concern is that I feel like using cells as samples is not what MultiQC was designed for. The generated beeswarm plot already hangs my browser even on the test data (I know the beeswarm is really inefficient at the first place and the problem will probably go away with a future new plotting library; but not until we add more data). MultiQC is designed to work with summarised per-sample metrics, and I don't think cells can be counted as samples, as there are way too many of them, and feels too raw to be fed into MultiQC, I think.

It would be good if more summarised per-samples stats can be produced here, e.g. with some different tool downstream.

ewels

Thanks for the PR @vivekbhr! As @vladsavelyev, I think that we need to rethink what data goes into the report here. The docs have a section called Don’t add everything which I think applies. MultiQC should only summarise, but here we're really just reporting on raw data.

The TSV files have a column called sample (eg. sortChIC-BM-SL1-k4me1-1) so maybe we could summarise stats for all cells under a given sample identifier?

vivekbhr · 2023-08-29T13:41:52Z

Thanks a lot for the review and comments @vladsavelyev and @ewels . I understand what you say, I experienced the slow response on the output report myself when using bigger data, so summarizing per-sample stats, such as a sample-wise range and median value for each field could be a better way.

Do you think Its OK to do these calculations from the output .tsv file in the multiqc module, or are you saying that this output should already be reported by sincei? I think calculating sample-wise range and median should not be a big overhead for the multiqc module.

vladsavelyev · 2023-09-01T10:05:24Z

Do you think Its OK to do these calculations from the output .tsv file in the multiqc module, or are you saying that this output should already be reported by sincei? I think calculating sample-wise range and median should not be a big overhead for the multiqc module.

I think it should be totally alright to summarise it in MultiQC!

…sample

vivekbhr · 2023-12-10T07:35:02Z

@vladsavelyev @ewels I've finally found time to complete the changes as requested. The multiqc module now reports sample summaries (median values), instead of per-cell summary. Also, the column headers are used instead of integer positions. Other minor comments are also resolved. I hope the changes are satisfactory now.

Best,
Vivek

vivekbhr · 2023-12-19T05:29:05Z

@vladsavelyev @ewels I have made all the requested changes, but the merging is still blocked with "changes requested". Please let me know if there are still unresolved issues. Thanks!

ewels · 2023-12-19T11:38:43Z

Thanks @vivekbhr! We'll get back to this to review the updated code as soon as we can.

vivekbhr · 2024-05-06T13:21:42Z

@ewels @vladsavelyev Did you find some time at the end to review the updates?

vladsavelyev reviewed Aug 25, 2023

View reviewed changes

multiqc/modules/sincei/scFilterStats.py Outdated Show resolved Hide resolved

vladsavelyev reviewed Aug 25, 2023

View reviewed changes

multiqc/modules/sincei/scFilterStats.py Outdated Show resolved Hide resolved

vladsavelyev reviewed Aug 25, 2023

View reviewed changes

multiqc/modules/sincei/scFilterStats.py Outdated Show resolved Hide resolved

vladsavelyev reviewed Aug 25, 2023

View reviewed changes

multiqc/modules/sincei/scFilterStats.py Show resolved Hide resolved

vladsavelyev self-requested a review August 25, 2023 16:32

vladsavelyev requested changes Aug 25, 2023

View reviewed changes

vladsavelyev added the waiting: response Waiting for more information from user label Aug 25, 2023

ewels requested changes Aug 28, 2023

View reviewed changes

vladsavelyev added waiting: changes Issue / PR is on hold, waiting for requested changes and removed waiting: response Waiting for more information from user labels Sep 16, 2023

vivekbhr added 10 commits December 10, 2023 00:13

new module: sincei

517fdd7

added scCountQC

b80301e

sincei intro

0010a01

black reformatted

79aacdd

sort import

44abb26

resolve merge conflict with master

cbe0a68

reformat

1f777e5

switched to beeswarm, color as recommended

bde7016

imports

670f0dc

updated sicnei module to use sample names and take median values per …

31a8c72

…sample

vivekbhr force-pushed the master branch from 7c8c72d to 31a8c72 Compare December 9, 2023 18:47

vivekbhr added 5 commits December 10, 2023 00:31

lint

f2ebaf8

software version

76d96f9

ruff format

49e81ec

ruff format

8e952ad

ruff format

09536b1

ewels removed the waiting: changes Issue / PR is on hold, waiting for requested changes label Dec 10, 2023

vladsavelyev added this to the MultiQC v1.19 milestone Dec 11, 2023

Merge branch 'master' into master

b62d54b

vladsavelyev removed this from the MultiQC v1.19 milestone Dec 13, 2023

vivekbhr mentioned this pull request May 9, 2024

MultiQC integration bhardwaj-lab/sincei#13

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new module: sincei #1946

new module: sincei #1946

vivekbhr commented Jul 4, 2023 •

edited

vladsavelyev left a comment •

edited

ewels left a comment

vivekbhr commented Aug 29, 2023

vladsavelyev commented Sep 1, 2023

vivekbhr commented Dec 10, 2023 •

edited

vivekbhr commented Dec 19, 2023

ewels commented Dec 19, 2023

vivekbhr commented May 6, 2024

new module: sincei #1946

Are you sure you want to change the base?

new module: sincei #1946

Conversation

vivekbhr commented Jul 4, 2023 • edited

vladsavelyev left a comment • edited

Choose a reason for hiding this comment

ewels left a comment

Choose a reason for hiding this comment

vivekbhr commented Aug 29, 2023

vladsavelyev commented Sep 1, 2023

vivekbhr commented Dec 10, 2023 • edited

vivekbhr commented Dec 19, 2023

ewels commented Dec 19, 2023

vivekbhr commented May 6, 2024

vivekbhr commented Jul 4, 2023 •

edited

vladsavelyev left a comment •

edited

vivekbhr commented Dec 10, 2023 •

edited