Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Removing Mitochondrial hits is not consistent between replicate and pseudo replicates #116

Open
amirshams84 opened this issue May 17, 2018 · 1 comment

Comments

@amirshams84
Copy link

---Removing Mit Hit for replicate

zcat AB2969-L1_R1.trim.PE2SE.nodup.tagAlign.gz | grep -P -v 'chrM' | gzip -nc > AB2969-L1_R1.trim.PE2SE.nodup.rm_chr.tmp.gz; mv AB2969-L1_R1.trim.PE2SE.nodup.rm_chr.tmp.gz AB2969-L1_R1.trim.PE2SE.nodup.tagAlign.gz

---Not Removing Mit Hit for Pseudo Replicates

zcat AB2969-L1_R1.trim.PE2SE.nodup.bedpe.gz | shuf --random-source=AB2969-L1_R1.trim.PE2SE.nodup.bedpe.gz | split -d -l $((nlines)) - AB2969-L1_R1.trim.PE2SE.nodup.

awk 'BEGIN{OFS="\t"}{printf "%s\t%s\t%s\tN\t1000\t%s\n%s\t%s\t%s\tN\t1000\t%s\n",$1,$2,$3,$9,$4,$5,$6,$10}' "AB2969-L1_R1.trim.PE2SE.nodup.00" | gzip -nc > AB2969-L1_R1.trim.PE2SE.nodup.pr1.tagAlign.gz

rm -f AB2969-L1_R1.trim.PE2SE.nodup.00

awk 'BEGIN{OFS="\t"}{printf "%s\t%s\t%s\tN\t1000\t%s\n%s\t%s\t%s\tN\t1000\t%s\n",$1,$2,$3,$9,$4,$5,$6,$10}' "AB2969-L1_R1.trim.PE2SE.nodup.01" | gzip -nc > AB2969-L1_R1.trim.PE2SE.nodup.pr2.tagAlign.gz

rm -f AB2969-L1_R1.trim.PE2SE.nodup.01

@leepc12
Copy link
Collaborator

leepc12 commented May 17, 2018

It looks like you are looking at old pipeline?

Pipeline no longer uses BEDPE to make pseudo replicates for paired end data sets. Please take a look at spr_tag_PE() in modules/postalign_bed.bds. It uses TAG-ALIGN to make pseudos.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants