Example stats #96

Phlya · 2018-04-25T10:42:26Z

Hi guys,

Would be great to add some examples of stats for good and bad experiments with explanations which step in the protocol might have failed, and how to understand this. I am currently troubleshooting why my recent Hi-C have not been working well, and with the coded pair type annotation this part of it is a little more complicated than I expected.

Also, more of a pairsamtools issue, but related: W pair type is still called C there in the docs (it's the same thing, right? I seem to remember it mentioned at some point).

Phlya · 2018-05-13T03:47:55Z

Also... As we briefly discussed with Sergey and Johan, having fragment-level stats like in hiclib and other pipelines (e.g. dangling ends, self-circles etc) would be very helpful for troubleshooting failed experiments.

sergpolly · 2018-05-15T16:44:29Z

Let's collect all of the stats update requests in one place.
So far we have this:
https://github.com/mirnylab/pairsamtools/issues/59
https://github.com/mirnylab/pairsamtools/issues/56
https://github.com/mirnylab/pairsamtools/issues/54
https://github.com/mirnylab/pairsamtools/issues/5
#94
#90

Please, @golobor , @Phlya , @nvictus review the list, prioritize and let's go from there

gfudenberg · 2018-05-15T17:13:34Z

I couldn't find these in the referenced posts, but it would also be nice to have:
a) P(s) for different read orientations separately as well-- this is useful for for finding where they converge and reads can be interpreted as "just measuring contact frequency"
b) number of reads with mitochondria is a nice stat (mito_vs_anyReads, mito_vs_mito, etc.)
c) number of single-sided and double-sided read pairs

Phlya · 2018-05-15T18:01:48Z

I personally think that adding new kinds of stats in principle is more important, and the different saving/printing options can be implemented later. Also, I don't think having optical dups is important (can we really do anything about them when preparing libraries? I doubt it...), but maybe I misunderstand something.

Phlya · 2018-05-15T20:32:38Z

Also note, that the fragment-level stats would require matching pairs with fragments... But perhaps with both inputs sorted and indexed it won't be very expensive?

Phlya · 2018-05-20T15:41:06Z

I guess it should be possible to address @gfudenberg 's point (a) quite easily, since these counts are all already present in the output of stats - https://github.com/mirnylab/pairsamtools/issues/68. Although, perhaps, the bins can be optimized a bit to make more smooth curves?

But is having plots in the output in the plans? As an html/pdf report with different things, or just a folder with individual pngs/pdfs? Should their generation be part of stats, or a separate job, which just takes the output of stats?

golobor · 2018-05-20T15:43:32Z

I know that DCIC was working QC for pairs for a while, the results are here: https://github.com/4dn-dcic/pairsqc

…

On 20 May 2018 at 11:41, Ilya Flyamer ***@***.***> wrote: I guess it should be possible to address @gfudenberg <https://github.com/gfudenberg> 's point (a) quite easily, since these counts are all already present in the output of stats - mirnylab/pairsamtools#68 <https://github.com/mirnylab/pairsamtools/issues/68>. Although, perhaps, the bins can be optimized a bit to make more smooth curves? But is having plots in the output in the plans? As an html/pdf report with different things, or just a folder with individual pngs/pdfs? Should their generation be part of stats, or a separate job, which just takes the output of stats? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#96 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AA3uCgIQk-OYylmYXG6f5S8-V0SQMnghks5t0Y6SgaJpZM4TjL69> .

Phlya · 2018-05-20T16:33:27Z

Yeah, I've seen that and even tried to install once without success. But considering there is pairsamtools stats already which calculate so many things, I don't think there is any point in using another QC tool?

Phlya mentioned this issue May 18, 2018

Fragment-level analysis open2c/pairtools#68

Open

sergpolly mentioned this issue Jan 29, 2020

Make a module for multi-qc open2c/pairtools#78

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example stats #96

Example stats #96

Phlya commented Apr 25, 2018

Phlya commented May 13, 2018 •

edited

sergpolly commented May 15, 2018

gfudenberg commented May 15, 2018

Phlya commented May 15, 2018

Phlya commented May 15, 2018

Phlya commented May 20, 2018

golobor commented May 20, 2018 via email

Phlya commented May 20, 2018 •

edited

Example stats #96

Example stats #96

Comments

Phlya commented Apr 25, 2018

Phlya commented May 13, 2018 • edited

sergpolly commented May 15, 2018

gfudenberg commented May 15, 2018

Phlya commented May 15, 2018

Phlya commented May 15, 2018

Phlya commented May 20, 2018

golobor commented May 20, 2018 via email

Phlya commented May 20, 2018 • edited

Phlya commented May 13, 2018 •

edited

Phlya commented May 20, 2018 •

edited