snakemake_transcriptome_workflow

Workflow for de novo transcriptome assembly from paired-end reads, protein prediction and annotation, and quality checks. It includes:

Quality check (fastqc)
De novo transcriptome assembly (Trinity)
Assembly quality checks (with bowtie2 and support scripts from Trinity)
Prediction of open reading frames (Transdecoder)
Functional annotation (with blastp and hmmer)

this workflow needs the following software

Snakemake: https://snakemake.readthedocs.io/en/stable/index.html

To run succefully every step you need to pre-install: FASTQC Trinity Transdecoder BLAST HMMER

Installation can be done using the conda package manager bioconda

usage

From the workflow directory run the example command lines:

Dry-run

snakemake -np --use-conda

Run

snakemake --cores <max_n_cores> -p --use-conda

Generate pipeline diagram

snakemake --dag | dot -Tsvg > dag.svg --use-conda

run new input

Move new fastq files to input/fastq_file
Change sample names in config/sample.tsv
Update resources/databases with those needed to run blast and hmmer

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.vscode		.vscode
config		config
input/fastq_files		input/fastq_files
results		results
workflow		workflow
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vscode

.vscode

config

config

input/fastq_files

input/fastq_files

results

results

workflow

workflow

README.md

README.md

Repository files navigation

snakemake_transcriptome_workflow

this workflow needs the following software

usage

run new input

About

Releases

Packages

Languages

PaulaRS/snakemake_transcriptome_workflow

Folders and files

Latest commit

History

Repository files navigation

snakemake_transcriptome_workflow

this workflow needs the following software

usage

run new input

About

Topics

Resources

Stars

Watchers

Forks

Languages