Analysis pipeline for ONT long read data

This is a Snakemake pipeline for comprehensive analysis of longread sequencing data. It is mainly intented for research and method establishment at the Institute of medicinal genetics and applied genomics (IMGAG) of UKT Tübingen but can be adapted easily to fit your requirements.

Overview

Detailed information for the transcriptome analysis can be found in the document at doc/transcriptome_analysis.md.

Analyses

Installation

Dependencies:

Snakemake (pip install snakemake)
Conda
Docker/Singularity for Variant Calling with Pepper-Margin-Deepvariant

For cDNA (Transcriptome) analysis you need to manually install the following tool and adjust the application paths in the config file:

SQANTI3

All other required tools will be installed into generated Conda environments.

Configuration

All configuration takes place in a config file (config.yml) located in the work directory. Look into the default settings (config/config_defaults) for a list of options. Most important is to to change the path to the genome reference and select required analysis steps.

The pipeline should be run in an analysis folder that will contain final and intermediate results. The location of the raw data is defined by the sample_run_table.tsv placed in the working dir. In the simplest form this is a two-column tab separated textfile containing sample names and data folder:

sample1
sample1
sample2

For more config options look into the input configuration.

Run the pipeline

Assuming you are in the working directory which contains a config.yml and a sample_run_table.tsv run the pipeline with:

snakemake --use-conda -s /path_to_repo/megLR/workflow/Snakefile

Name		Name	Last commit message	Last commit date
Latest commit History 251 Commits
config		config
doc		doc
lr_scripts		lr_scripts
resources		resources
tools		tools
workflow		workflow
.gitignore		.gitignore
README.md		README.md
TODO		TODO

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

doc

doc

lr_scripts

lr_scripts

resources

resources

tools

tools

workflow

workflow

.gitignore

.gitignore

README.md

README.md

TODO

TODO

Repository files navigation

Analysis pipeline for ONT long read data

Overview

Analyses

Installation

Configuration

Run the pipeline

About

Releases 1

Packages

Contributors 2

Languages

imgag/megLR

Folders and files

Latest commit

History

Repository files navigation

Analysis pipeline for ONT long read data

Overview

Analyses

Installation

Configuration

Run the pipeline

About

Resources

Stars

Watchers

Forks

Languages