Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create GDS file from imputed data using dosage || No variable 'annotation/format/DS' in the FORMAT field. #71

Open
complexgenome opened this issue Aug 4, 2021 · 2 comments

Comments

@complexgenome
Copy link

complexgenome commented Aug 4, 2021

Hi @zhengxw-ab

I am interested to create GDS file using VCF from imputed data. I would like to keep dosage information intact in this process.
I use command as:

seqVCF2GDS("CHR22.recode.vcf.gz","check.gds",verbose=TRUE,genotype.var.name="annotation/format/DS",scenario=c("imputation"))

I get messages as:

 verbose=TRUE,genotype.var.name="annotation/format/DS",scenario=c("imputation"))
Wed Aug  4 13:14:47 2021
Variant Call Format (VCF) Import:
    file(s):
        CHR22_CHGWAS_rsq80_MAC10.recode.vcf.gz (1.8G)
    file format: VCFv4.1
    the number of sets of chromosomes (ploidy): 2
    the number of samples: 12,508
    genotype storage: bit2
    compression method: LZMA_RA
    # of samples: 12508
    scenario: imputation
        annotation/format/DS: packedreal16
        annotation/format/GP: packedreal16
No variable 'annotation/format/DS' in the FORMAT field.
Output:
    check.gds
Parsing 'CHR22_CHGWAS_rsq80_MAC10.recode.vcf.gz':

It says 'annotation/format/DS' in the FORMAT field.

How do I ensure to guide seqVCF2GDS function to pick dosage value?

Thanks,

@complexgenome complexgenome changed the title Create GDS file || No variable 'annotation/format/DS' in the FORMAT field. Create GDS file from imputed data using dosage || No variable 'annotation/format/DS' in the FORMAT field. Aug 4, 2021
@zhengxwen
Copy link
Owner

zhengxwen commented Aug 4, 2021

You misused "genotype.var.name", dosages are always stored in 'annotation/format/DS'.
Remove ,genotype.var.name="annotation/format/DS"

@complexgenome
Copy link
Author

@zhengxwen thank you for your reply.

Would following command be fine to tell seqVCF2GDS to use dosage values?
SeqArray::seqVCF2GDS(vcf.fn,output_file, verbose=TRUE,scenario=c("imputation"))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants