Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

calculate Tajima'D on haploid #28

Open
AlexWanghaoming opened this issue Jul 29, 2020 · 2 comments
Open

calculate Tajima'D on haploid #28

AlexWanghaoming opened this issue Jul 29, 2020 · 2 comments

Comments

@AlexWanghaoming
Copy link

Dear developers,
My fungi strains are haploidy and I would like to calculate Tajima'D via vk tajima. When I read doc, it says that require vcf file be diploid sites and the result is mysterious. I call snp with bcftools:
bcftools call --ploidy 1
Could you please help me how to solve it?
Thanks,
Alex

@danielecook
Copy link
Contributor

Hi @AlexWanghaoming - I would recommend simply replacing your genotype calls from 0 or 1 to 0/0 and 1/1 in the VCF. Then you should be able to run VCF kit with a ploidy state of 1. I'm not sure whether this will affect your caculation - but I do not think it will.

@Rohit-Satyam
Copy link

Hi @danielecook, my VCF file contains genotype calls in 0/0 format. I got it from MalariaGen and now I was trying to calculate the Tajima's D (TD) for the two most polymorphic genes in Plasmodium (which is haploid) and yet I get negative TD values for them. I am not sure if the subcommand vk tajima is doing the right thing because highly polymorphic gene should have high TD values

Here are the subsetted VCF files for the same genes:
File1
File2

PF3D7_0930300 Pf3D7_09_v3:1201305-1207576
PF3D7_1133400 Pf3D7_11_v3:1292966-1296696

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants