JAMIRA WORKFLOW

– a reproducible and scalable workflow for prokaryote genomic data analysis designed for the genera Enterococcus spp.

JAMIRA WORKFLOW

JAMIRA is a Bioinformatics Workflow for Integrative Exploration of Genomic Features of bacterias including:

Virulence factors (ABRICATE);
Resistome profile (RGI);
Plasmid prediction (ABRICATE);
Prophage prediction (IslandPath);
Genomic Islands prediction (Phispy);

Prerequisites

Getting Started

To run JAMIRA you need to install Conda (prerequisites). JAMIRA Workflow is intended to be executed in a Conda environment to ensure data reproducibillity and modularization among different genomic tools used in this pipeline. Thus, for each tool an isolated Conda environment was created, in which it encapsulates all the software dependencies necessary for execution.

Python version 3.7 is recommended.

Note: this tutorial was done using the Linux operating system. We believe that the same steps can be reproduced on macOS.

Configuration

After complete Conda installation you need to add the necessary files present in this github in your conda folder. Follow the steps:

Add the bioconda channel with the following commands:

conda config --add channels defaults
conda config --add channels bioconda
conda config --add channels conda-forge

Create a conda environment for JAMIRA with the following command:

conda env create -f envs/config.yaml -n jamira

Activate your JAMIRA environment:

conda activate jamira

Install SnakeMake:

mamba install -c conda-forge -c bioconda snakemake

Install rename, if you don't have yet:

sudo apt install rename

Congratulations! The JAMIRA workflow is ready to be used!

How to run JAMIRA workflow:

Complete JAMIRA workflow can be executed with a single concise command line call.

snakemake --use-conda

Addionally the user can run the complete workflow specyfing the number of cores (e.g 4 cores):

snakemake -j 4 --use-conda

Additional features

Generate a summary of jamira workflow in HTML format:

After completing the workflow execution, the pipeline provides an option to generate a summary web report in HTML format. The interactive report can be generated with the following command call:

snakemake -n --report myresults.html

Generate a visual representation of JAMIRA workflow:

You can create a Direct Acyclic Graph (DAG) representation to visualize all steps executed with jamira workflow

Display your DAG representation:

snakemake -j 4 --use-conda -n --dag | dot -Tsvg | display

Save your DAG representation in SVG format:

snakemake -j 4 --use-conda -n --dag | dot -Tsvg > dag.svg

JAMIRA WORKFLOW MODULES

JAMIRA incorporate a collection of modules for specific data analysis tasks commonly applied in comparative genomic studies, such as: (i) virulence genes identification; (ii) antimicrobial resistance genes identification; (iii) plasmid sequences prediction; (iv) genomic islands prediction and (v) prophage prediction.

Genomic Islands Prediction

The genomic islands prediction module searches large segments of exogenous DNA inserted into bacterial genomes, well known as genomic islands (GIs), frequently associated with particular adaptations of microbes that are of medical, agricultural, or environmental importance.

In this workflow we use IslandPath-DIMOB:

IslandPath-DIMOB is a standalone software to predict genomic islands in bacterial and archaeal genomes based on the presence of dinucleotide biases and mobility genes.

Please cite

Bertelli and Brinkman, 2018
Hsiao et al., 2005

Prophage Prediction

The prophage prediction module searches for mobile elements, responsible for carrying and disseminate virulence factors and antimicrobial resistance genes between bacteria.

In this workflow we use PhiSpy to identify the most likely prophage regions in Bacterial genomes.

Please cite

Akhter et al., 2012

Antimicrobial Resistance Identification

The antimicrobial resistance identification module enables the prediction of complete resistome profiles from genomic data.

In this workflow we use RGI to predict resistomes based on homology and SNP models.

Please cite

Jia et al., 2017

Plasmid Prediction

The plasmid prediction module searches for well-known replicon sequences to detect related plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

In this workflow we use ABRICATE to perform a BLAST against a curated database of plasmid sequences, PlasmidFinder database (Carattoli et al., 2014).

Please cite

Seemann, 2018 Carattoli et al., 2014

Virulence factors identification

The plasmid prediction module searches for well-known replicon sequences to detect related plasmids that are often associated with antimicrobial resistance in clinically relevant bacterial pathogens.

In this workflow we use ABRICATE to perform a BLAST against a curated database of virulence factors related to bacterial pathogens, VFDB database (Chen et al., 2016).

Please cite

Seemann, 2018 Chen et al., 2016

Authors

Ícaro Castro - Workflow development - Github
Rafaella Bueno - Web server development - Github
Robson Ruiz - Web server development - Github

Enteromar Group

Learn more about our projects: Enteromar Group

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
backup_fastas		backup_fastas
data/samples		data/samples
databases		databases
envs		envs
report_description		report_description
scripts		scripts
src		src
README.md		README.md
Snakefile		Snakefile
_config.yml		_config.yml
run_pipeline.sh		run_pipeline.sh
teste_docker.sh		teste_docker.sh

enteromar/JAMIRA

Folders and files

Latest commit

History

Repository files navigation

– a reproducible and scalable workflow for prokaryote genomic data analysis designed for the genera Enterococcus spp.

JAMIRA WORKFLOW

Prerequisites

Getting Started

Configuration

How to run JAMIRA workflow:

Additional features

Generate a summary of jamira workflow in HTML format:

Generate a visual representation of JAMIRA workflow:

JAMIRA WORKFLOW MODULES

Genomic Islands Prediction

Please cite

Prophage Prediction

Please cite

Antimicrobial Resistance Identification

Please cite

Plasmid Prediction

Please cite

Virulence factors identification

Please cite

Authors

Enteromar Group

About

Topics

Resources

Stars

Watchers

Forks

Languages