Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to reuse variants converted from vcf to json for population genotyping? #58

Open
malemay opened this issue Nov 30, 2020 · 0 comments

Comments

@malemay
Copy link

malemay commented Nov 30, 2020

The repository README recommends running multigrmpy.py independently for several samples when running an analysis at population scale.

My understanding is that multigrmpy.py first converts the input vcf to a set of .json files written to a temporary directory. These .json files are then used by the grmpy program to carry genotyping.

If my understanding is right, then the conversion step is conducted as many times as multigrmpy.py is launched, whereas we only really need conversion to happen once for a given vcf. This results in wasted computing time and a lot more temporary files than what is needed, which causes problems on my system because the large number of temporary files becomes hard to manage.

Therefore, I would like to know if there is a way that I could use the tools provided by Paragraph to first convert the vcf file to a set of .json files, and then use those as input for genotyping. I believe this would not be too complicated, but I can't figure out how to do this based on the information that is provided.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant