LOGAN 🔬

LOGAN-whoLe genOme-sequencinG Analysis pipeliNe. Call germline and somatic variants, CNVs, and SVs and annotate variants!

Overview

Welcome to LOGAN! Before getting started, we highly recommend reading through LOGAN's documentation.

LOGAN is a comprehensive whole genome-sequencing pipeline following the Broad's set of best practices. It relies on technologies like Singularity¹ to maintain the highest-level of reproducibility. The pipeline consists of a series of data processing and quality-control steps orchestrated by Nextflow², a flexible and scalable workflow management system, to submit jobs to a cluster or cloud provider.

Before getting started, we highly recommend reading through the usage section of each available sub command.

For more information about issues or trouble-shooting a problem, please checkout our FAQ prior to opening an issue on Github.

Original pipelining and code forked from the CCBR Exome-seek Pipeline Exome-seek and OpenOmics

Dependencies

Requires: singularity>=3.5 nextflow>=22.10.2

singularity must be installed on the target system. Snakemake orchestrates the execution of each step in the pipeline. To guarantee the highest level of reproducibility, each step relies on versioned images from DockerHub. Nextflow uses singularity to pull these images onto the local filesystem prior to job execution, and as so, nextflow and singularity are the only two dependencies.

Setup

LOGAN is installed on the Biowulf in the ccbrpipeliner module. Please clone this repository to your local filesystem using the following command:

# start an interactive node
sinteractive --mem=2g --cpus-per-task=2 --gres=lscratch:200
# load the ccbrpipeliener module
module load ccbrpipeliner

Usage

LOGAN supports either

Input Files

LOGAN supports inputs of either

paired end fastq files

--fastq_input- A glob can be used to include all FASTQ files. Like --fastq_input "*R{1,2}.fastq.gz". Globbing requires quotes.

Pre aligned BAM files with BAI indices

--bam_input- A glob can be used to include all FASTQ files. Like --bam_input "*.bam". Globbing requires quotes.

A sheet that indicates the sample name and either FASTQs or BAM file locations

--fastq_file_input- A headerless tab delimited sheet that has the sample name, R1, and R2 file locations

--bam_file_input - A headerless tab delimited sheet that has the sample name, bam, and bam index (bai) file locations

Operating Modes

1. Paired Tumor/Normal Mode

Required for Paired Tumor/Normal Mode

--sample_sheet In Paired mode a sample sheet must be provided with the basename of the Tumor and Normal samples. This sheet must be Tab separated with a header for Tumor and Normal.

2. Tumor only mode

No flags are required

Calling Mode

Adding flags determines SNV (germline and/or somatic), SV, and/or CNV calling modes

--vc- Enables somatic SNV calling using mutect2, vardict, varscan, octopus, sage, MUSE (TN only), and lofreq (TN only)

--germline- Enables germline using DV

--sv- Enables somatic SV calling using Manta and SVABA

--cnv- Enables somatic CNV calling using FREEC, Sequenza, and Purple (hg38 only)

Optional Arguments

--indelrealign - Enables indel realignment when running alignment steps. May be helpful for certain callers (VarScan, VarDict)

--callers- Comma separated argument for callers, the default is to use all available.
Example: --callers mutect2,octopus

--cnvcallers- - Comma separated argument for cnvcallers. Adding flag allows only certain callers to run.
Example: --cnvcallers purple

Running LOGAN

Example of Tumor_Normal calling mode

# copy the logan config files to your current directory
logan init
# preview the logan jobs that will run 
logan run --mode local -profile ci_stub --genome hg38 --sample_sheet samplesheet.tsv --outdir out --fastq_input "*R{1,2}.fastq.gz" -preview --vc --sv --cnv
# run a stub/dryrun of the logan jobs 
logan run --mode local -profile ci_stub --genome hg38 --sample_sheet samplesheet.tsv --outdir out --fastq_input "*R{1,2}.fastq.gz" -stub --vc --sv --cnv
# launch a logan run on slurm with the test dataset
logan run --mode slurm -profile biowulf,slurm --genome hg38 --sample_sheet samplesheet.tsv --outdir out --fastq_input "*R{1,2}.fastq.gz" --vc --sv --cnv

Example of Tumor only calling mode

# copy the logan config files to your current directory
logan init
# preview the logan jobs that will run 
logan run --mode local -profile ci_stub --genome hg38 --outdir out --fastq_input "*R{1,2}.fastq.gz" --callers octopus,mutect2 -preview --vc --sv --cnv
# run a stub/dryrun of the logan jobs 
logan run --mode local -profile ci_stub --genome hg38 --outdir out --fastq_input "*R{1,2}.fastq.gz" --callers octopus,mutect2 -stub --vc --sv --cnv
# launch a logan run on slurm with the test dataset
logan run --mode slurm -profile biowulf,slurm --genome hg38 --outdir out --fastq_input "*R{1,2}.fastq.gz" --callers octopus,mutect2 --vc --sv --cnv

We currently support the hg38, hg19 (in progress), and mm10 genomes.

Contribute

This site is a living document, created for and by members like you. LOGAN is maintained by the members of CCBR and is improved by continous feedback! We encourage you to contribute new content and make improvements to existing content via pull request to our repository.

References

This repo was originally generated from the CCBR Nextflow Template.

^{1. Kurtzer GM, Sochat V, Bauer MW (2017). Singularity: Scientific containers for mobility of compute. PLoS ONE 12(5): e0177459.}

Name		Name	Last commit message	Last commit date
Latest commit History 296 Commits
.github		.github
.tests		.tests
assets		assets
bin		bin
conf		conf
docker		docker
docs		docs
lib		lib
modules/local		modules/local
src		src
subworkflows/local		subworkflows/local
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
main.nf		main.nf
mkdocs.yml		mkdocs.yml
nextflow.config		nextflow.config
pyproject.toml		pyproject.toml
setup.py		setup.py

License

CCBR/LOGAN

Folders and files

Latest commit

History

Repository files navigation

LOGAN 🔬

Overview

Dependencies

Setup

Usage

Input Files

Operating Modes

1. Paired Tumor/Normal Mode

2. Tumor only mode

Calling Mode

Optional Arguments

Running LOGAN

Contribute

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages