Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sparse representation of genotype posterior #182

Open
timothymillar opened this issue Apr 9, 2024 · 1 comment
Open

Sparse representation of genotype posterior #182

timothymillar opened this issue Apr 9, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@timothymillar
Copy link
Collaborator

Investigate a sparse encoding of genotype posteriors. E.g. an equivalent to the PP field (phred-scaled probabilities) in which zeoro values are omitted. This can be represented as a map of genotype index to non-zero phred scaled probabilities. This effectively removes genotypes with probabilities <= 0.1. An example may look like "0=10,2=3,7=1" and have the String type in VCF.

@timothymillar timothymillar added the enhancement New feature or request label Apr 9, 2024
@timothymillar timothymillar self-assigned this Apr 9, 2024
@timothymillar
Copy link
Collaborator Author

We could also use a sparse equivalent of GP if we specify a minimum posterior probability to report. E.g., >= 0.01 would work well with MCMC approximations. Alternatively, we could report a phred score of 0 for non-zero values, but this is confusing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant