Skip to content

A pipeline for variant calling of Exome using GATK best practice workflow

Notifications You must be signed in to change notification settings

drveera/exomepipe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 

Repository files navigation

THIS PIPELINE IS DEPRECATED

#Exomepipe This is a pipeline for variant calling of exomes using GATK's best practices workflow. In a nutshell, you input fastq files (or aligned bam files) and in just 3 steps, you get the vcf files.

Note: This pipeline is made to work inside the iPSYCH cluster environment where the required softwares and the resource files (GATK bundles) are readily available to the users. So the pipeline is fully functional only inside the iPSYCH cluster. If you would like to use the pipeline on your own system outside the iPSYCH cluster, please let me know.


##Main features of this pipeline

  • The workflow is scripted using GNU make and so there will be no unwanted repetition when you resubmit the jobs. Only those files that are required will be created and other files will be reused. This saves time and cluster resource.
  • The pipeline will submit the slurm jobs for you with appropriate settings and will make sure effective utilization of cluster nodes.

##Installation

####Clone the repository git clone https://github.com/veera-dr/exomepipe.git ####run the install script (it just adds an alias inside bashprofile) sh install.sh


##Quick reference

####Step1 - generate g.vcf files from fastq files exomepipe --fqlist <value> --outdir <value> --bed <value>

####Step2 - combine the g.vcf files exomepipe --gvcflist <value> --outdir <value>

####Step3 - genotype g.vcf files exomepipe --gzlist <value> --outdir <value>


##Step 1

####Usage exomepipe --fqlist <value> --outdir <value> --bed <value>

or

exomepipe --bamlist <value> --outdir <value> --bed <value>

Argument Value
--fqlist a file with the list of fastq files
--bamlist a file with the list of bam files
--outdir folder where you need the output
--bed exome capture intervals

always use full path when specifying the files or folders

the fq files should have extension '.fq1'

##Step 2

####Usage exomepipe --gvcflist <value> --outdir <value>

Argument Value
--gvcflist a file with the list of g.vcf files generated from step 1
--outdir folder where you need the output

##Step 3

####Usage exomepipe --gzlist <value> --outdir <value>

Argument Value
--gzlist a file with the list of g.vcf files generated from step 1
--outdir folder where you need the output

About

A pipeline for variant calling of Exome using GATK best practice workflow

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published