You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to run the hybrid score test with -doAsso 5 and received the cited error "Problem computing cdf for chiseq, input is NaN, will exit". Notably, the score test (-doAsso 2) and latent genotype (-doAsso 4) models both run on the same data. I would guess the issue lies with my data, but I'm not sure how to filter to avoid it. I have tried with both the conda and github versions of ANGSD. Command line is below. Any thoughts? TIA
Without having access to the data it is difficult to know what causes it.
I am afraid it will be some trial and error to figure out which position that is causing it, and why it is happening.
The programs reports the current chr and position for where it has successfully been able to complete an analyses for a region of the genome. I would relaunch the program but from the last known good position and then printout for every chunk -howoften 1
I would continue until I know the exact position that was causing the issue and then look a bit more into the sequencing data for that position. My guess is that it is a non so widely used combination of filters that causes a lack of data which in effect turns into NaN in the analyses. But I of course don't know.
Sorry for not being more helpful and for the late reply.
Thanks, I will give that a try. It just seems odd that the constituent models (-doAsso2 and 4) would run fine but the hybrid would be tripped. I suppose if the hybrid is just a time saver over the latent genotype model, then there's no major concern other than time...
I'm trying to run the hybrid score test with -doAsso 5 and received the cited error "Problem computing cdf for chiseq, input is NaN, will exit". Notably, the score test (-doAsso 2) and latent genotype (-doAsso 4) models both run on the same data. I would guess the issue lies with my data, but I'm not sure how to filter to avoid it. I have tried with both the conda and github versions of ANGSD. Command line is below. Any thoughts? TIA
/home/swillis/bin/angsd/angsd -b ocean-summer-NOR-male-bam.txt -out OUT_doAssoc2/ocean-summer-NOR-male-pheno-fork-assoc -ref /data/genomes/Ots/Otsh_v2.0/Ots2.0_LGnamed.fasta -sites OUTp/Ots_AAM_angsd.ALL.majorminor.txt -rf OUTp/Ots_AAM_angsd.ALL.regions.txt -uniqueOnly 1 -remove_bads 1 -only_proper_pairs 1 -trim 0 -C 50 -baq 1 -minMapQ 20 -minQ 20 -doCounts 1 -minInd 10 -setMinDepthInd 3 -GL 1 -nThreads 8 -doMajorMinor 3 -SNP_pval 0.000001 -doSnpStat 1 -sb_pval 0.001 -doHWE 1 -maxHetFreq 0.95 -doMaf 1 -minMaf 0.05 -doPost 1 -skipTriallelic 0.05 -doAsso 5 -minHigh 1 -minCount 10 -yQuant ocean-summer-NOR-male-pheno-fork.txt -cov ocean-summer-NOR-male-cov.txt -Pvalue 1 |& tee log_doAssoc5_ocean-summer-NOR-male-pheno-fork.txt
The text was updated successfully, but these errors were encountered: