Name		Name	Last commit message	Last commit date
parent directory ..
Mapping_stats		Mapping_stats
Pipeline		Pipeline
README.md		README.md

README.md

Alignment and mapping statistics

Workflow

A workflow for aligning the raw data to the reference genome and retrieving mapping statistics has been developed with Snakemake. Snakemake pipelines are formed of three files:

Snakefile - Python based files with the core instructions structured in concatenated module/rules
Config file - YAML file that contains the technical details of the workflow (e.g. input/output files, wildcards, software path and version)
Cluster file - JSON file including the cluster details for each rule (e.g. memory and cores requested, log location)

Additionally, a bash script has been generated for retrieving the mapping statistics. This bash script is triggered as part of the Snakemake workflow

Submission

Snakemake workflows are submitted by using the following bash script:

#!/bin/bash

module load python_gpu/3.7.4

snakemake --jobs 500 -rp --latency-wait 40 --keep-going --rerun-incomplete --cluster-config cluster.json --cluster "bsub -J {cluster.jobname} -n {cluster.ncore} -W {cluster.jobtime} -oo {cluster.logi} -R \"rusage[mem={cluster.memo}]\""

Notes

Input files need to be named following the wildcard patterns (UCD/Angus in our case)
Sample names are provided as Python list in the config.yaml file - the sample names are not included for privacy reasons but an example is provided
Log files are generated in the locations specified in the cluster.json only if the relevant folders have been created within the log_folder
PDF files with the Snakemake graph (DAG) can be created as follows:

#!/bin/bash
module load python_gpu/3.6.4
module load gcc/4.8.5 graphviz/2.40.1
name="workflow_name"
snakemake --forceall --dag | dot -Tpdf > ${name}_dag.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alignment

Alignment

Mapping_stats

Mapping_stats

Pipeline

Pipeline

README.md

README.md

README.md

Alignment and mapping statistics

Workflow

Submission

Notes

Files

Alignment

Directory actions

More options

Directory actions

More options

Latest commit

History

Alignment

Folders and files

parent directory

Mapping_stats

Mapping_stats

Pipeline

Pipeline

README.md

README.md

README.md

Alignment and mapping statistics

Workflow

Submission

Notes