Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What causes differences in meth_qual distributions between samples? #141

Open
billytcl opened this issue Nov 27, 2023 · 1 comment
Open

Comments

@billytcl
Copy link

Looking at a few cfDNA samples across different runs, we've noticed instances where the meth_qual distribution can vary widely quite a bit. Eg. some samples are strongly piled up near 0 or 255, and others less so.

Since it's all cfDNA and all using the sample prep, I was wondering if there are nuances in the way remora calls methylation that we should consider or whether there is a way to bioinformatically batch correct/normalize for these differences?

@marcus1487
Copy link
Collaborator

There are a number of factors which can effect the probabilities output by the Remora model. These include the overall modified base context (including modified bases in close proximity; within 10 bases of one another). Additionally there may be some run to run variability contributing to the output probabilities. I would suggest that normalizing these may not be advisable. The model is fundamentally outputting a lower confidence at the calls which is likely meaningful. There may be settings where normalization of these output probabilities can be beneficial, but I would try to avoid this for most generic analyses.

We are certainly aiming to have these probabilities constrained to a more consistent distribution both with modeling and increased consistency on the platform. I hope this helps, but please post more details if you have particular downstream analyses which require that these probabilities be normalized.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants