Releases: bactopia/bactopia
Releases · bactopia/bactopia
v3.0.1
v3.0.1 bactopia/bactopia "That's My Girl" - 2024/03/25
"That's My Girl" is a phrase my youngest says quite often when playing with her friends
Added
- Updated
bactopia-assembler
to include updated Medaka models - profile for
arcc_hawk
- TB Profiler results are now merged with
collate
command - sample name to plasmidfinder results
- support config files from nf-core/configs
- updated AMRFinder+ database for v3.12.8
- bump program versions in modules
abritamr
: 1.0.14 -> 1.0.17bactopia-assembler
: 1.0.3 -> 1.0.4bactopia-teton
: 1.0.1 -> 1.0.2bactopia-variants
: 1.0.1 -> 1.0.2bakta
: 1.8.2 -> 1.9.3blast
: 2.14.1 -> 2.15.0busco
: 5.5.0 -> 5.7.0gubbins
: 3.3.0 -> 3.3.4multiqc
: 1.15 -> 1.21mashtree
: 1.4.5 -> 1.4.6mykrobe
: 0.12.2 -> 0.13.0ncbi-amrfinderplus
: 3.11.18 -> 3.12.8panaroo
: 1.3.3 -> 1.4.2rgi
: 6.0.2 -> 6.0.3sistr
: 1.1.1 -> 1.1.2stecfinder
: 1.1.0 -> 1.1.1tbprofiler
: 5.0.0 -> 6.1.0
Fixed
- variable name in sketcher module
--para-off
not correctly implemented in PIRATE module- extra space in RGI container image name
- sourmash version STDERR not parsed out
- FASTQC writing to /tmp dir on HPC when non-readable
- abricate and ariba not using output subdirectory
- nextflow tower typos @iferres
- phispy not working with Bakta genbank files
- missed check of
--download_bakta
param in main workflow - quast not working when estimated genome size is 0
abricate
now gets database subdirectories
Enhancements to OSS
- fix amrfinderplus pinning in abritamr bioconda/bioconda-recipes#46714
- adjust python pinning in gubbins bioconda/bioconda-recipes#46713
- fix issue with sistr container bioconda/bioconda-recipes#46712
- update rgi pinning for pyrodigal bioconda/bioconda-recipes#46669
- pin tabix version in snippy bioconda/bioconda-recipes#46458
v3.0.0
bactopia-v3-announcement.mp4
v3.0.0 bactopia/bactopia "Black Cat and Brown Dog" - 2023/09/11
"Black Cat and Brown Dog" are in memory of Stinky ("Black Cat") and Twotee ("Brown Dog")
Added
- Bactopia Tools (
bactopia --wf <NAME>
)abritamr
- A NATA accredited tool for AMR detectionblastn
- Search against nucleotide BLAST databases using nucleotide queriesblastp
- Search against protein BLAST databases using protein queriesblastx
- Search against protein BLAST databases using translated nucleotide queriesbracken
- Taxonomic classification and species abundance estimation of sequence readsbtyper3
- Taxonomic classification of Bacillus cereus group isolatesmidas
- Estimate bacterial species abundances from FASTQ filesquast
- Assess the quality of assembled contigsphispy
- Predict prophages in bacterial genomespneumocat
- Assign capsular type to Streptococcus pneumoniae from sequence readssra-human-scrubber
- Scrub human reads from FASTQ filesstecfinder
- Serotyping Shigella toxin producing Escherichia coli genomestblastn
- Search against translated nucleotide BLAST databases using protein queriestblastx
- Search against translated nucleotide BLAST databases using translated nucleotide queries
- per-release databases for
amrfinderplus
andmlst
- new directory structure for outputs
- Renamed
--R1
,--R2
, and--SE
to--r1
,--r2
, and--se
--ont
now accepts a FASTQ file- GitHub Action to build environments
fastp
is default read cleaner, can use previous methods with--use_bbmap
- ability to use a BED file to mask regions in
snippy-core
--save_sketches
to save Mash sketches created duringmashtree
run5- Porechop is now optional (
--use_porechop
) - unified
publishDir
across modules - download datasets using
storeDir
BACTOPIA:DATASETS
ariba
sra-human-scrubber
- bump program versions in modules
bakta
: 1.6.0 -> 1.8.2blast
: 2.11.0 -> 2.14.1busco
: 5.4.3 -> 5.5.0csktk
: 0.25.0 -> 0.27.2eggnog-mapper
: 2.1.9 -> 2.1.12genotyphi
: 1.9.1 -> 2.0gtdbtk
: 2.1.1 -> 2.3.2gubbins
: 3.2.1 -> 3.3iqtree
: 2.2.0.3 -> 2.2.2.7fastani
: 1.33 -> 1.34mashtree
: 1.2.0 -> 1.4.5mob_suite
: 3.1.0 -> 3.1.7multiqc
: 1.11 -> 1.15mykrobe
: 0.12.0 -> 0.12.2ncbi-amrfinderplus
: 3.10.45 -> 3.11.18ncbi-genome-download
: 0.3.1 -> 0.3.3panaroo
: 1.3.0 -> 1.3.3pasty
: 1.0.0 -> 1.0.3phyloflash
: 3.4 -> 3.4.2rgi
: 6.0.1 -> 6.0.2shigatyper
: 2.0.3 -> 2.0.5shigeifinder
: 1.3.2 -> 1.3.5tbprofiler
: 4.4.0 -> 5.0.0
Fixed
- All modules correctly initiate
params.options
- OpenJDK java cpuset warning message (@idolawoye)
- Broken badges in README.md
- Pinned GSL to v2.6
- symlink in amrfinder+ update (
bactopia datasets
) - hardcoded
--plus
inamrfinderplus_run
- tests for new directory structure
--nanohq
not being properly passed todragonflye
Removed
- Bactopia Steps
ANNOTATE_GENOME
- now handled byprokka
orbakta
ANTIMICROBIAL_RESISTANCE
- now handled byamrfinderplus
CALL_VARIANTS
- now handled bysnippy
SEQUENCE_TYPE
- now handled bymlst
bactopia datasets
is now incorporated intobactopia
- Conda/Containers for all bactopia-main steps
- custom process labels, for generic
nf-core
process labels
Enhancements to OSS
- add module for midas nf-core/modules#2696
- add modules for sra-human-scrubber nf-core/modules#2694
- add module for stecfinder nf-core/modules#2702
- update midas pinnings to match docs bioconda/bioconda-recipes#38566
- add missing scripts to MLST bioconda/bioconda-recipes#38826
- fix GTDB-Tk container bioconda/bioconda-recipes#40333
- patch ncbi-genome-download bioconda-recipes#41640
- add module for pneumocat nf-core/modules#3592
- add module for abritamr/run nf-core/modules#3725
- fix hostrange() missing 1 required positional argument: 'database_directory' phac-nml/mob-suite#149
- add module for btyper3 nf-core/modules#3817
v2.2.0
v2.2.0 bactopia/bactopia "Tornado Tempo" - 2022/11/28
Added
--dragonflye_opts
to pass additional params to dragonflye--use_fastp
to use fastp for QCing readsbactopia datasets
no longer depends onariba
--skip_spell_check
inbactopia datasets
- updated organisms available from PubMLST
- custom profile for
wsvl
- updated citations
- use
mambaforge
for docker builds - Support tarball inputs for large databases
- Bactopia Tools (
bactopia --wf <NAME>
)pasty
- in silico serogrouping of Pseudomonas aeruginosa isolatespbptyper
- In silico Penicillin Binding Protein (PBP) typer for Streptococcus pneumoniae assembliesshigeifinder
- Serotyping Shigella and EIEC assemblies
- bump program versions in modules
bakta
-> 1.5.1bbmap
-> 39.01checkm-genome
-> 1.2.2csvtk
-> 0.25.0dragonflye
-> 1.0.13eggnog-mapper
-> 2.1.9fastani
-> 1.33fastq-scan
-> 1.0.1gamma
-> 2.2gtdbtk
-> 2.1.1iqtree
-> 2.2.0.3kleborate
-> 2.2.0mlst
-> 2.23.0mob_suite
-> 3.1.0mykrobe
-> 0.12.0nanoplot
-> 1.40.2ncbi-amrfinderplus
-> 3.10.45panaroo
-> 1.3.0pirate
-> 1.0.5rasusa
-> 0.7.0rgi
-> 6.0.1shigatyper
-> 2.0.3snpeff
-> 5.1sourmash
-> 4.5.0sra-tools
-> 3.0.0tb-profiler
-> 4.4.0
Fixed
- respect short polish hybrid assembly requests
- missing variable in
bakta_download
module - restructure to match
nf-core/modules
- compatibility with Nextflow >=22.10
- separate versions for Linux and OSX in
assemble_genomes
bactopia datasets
error messages switched to warnings
Enhancements to OSS
- bump Bakta to 1.5.0 nf-core/modules#2000
- add recipe for pasty bioconda/bioconda-recipes#35930
- add recipe for pbptyper bioconda/bioconda-recipes#36222
- add module for pasty nf-core/modules#2003
- add module for pbptyper nf-core/modules#2005
- add module for shigeifinder nf-core/modules#2523
v2.1.1
v2.1.1 bactopia/bactopia "Dragon Twister" - 2022/07/09
Added
- Add Panaroo to the
pangenome
Bactopia Tools - Add
genotyphi
andseroba
to Merlin - bump
fastq-dl
to v1.1.1 - Bactopia Tools (
bactopia --wf <NAME>
)busco
- Assembly completeness based on evolutionarily informed expectationsgenotyphi
- Salmonella Typhi genotyping with Mykrobe outputsgubbins
- Rapid phylogenetic analysis of recombinant bacterial sequencesmcroni
- Sequence variation in mobilized colistin resistance (mcr-1) genesmykrobe
- Antimicrobial resistance detection for specific speciespanaroo
- Pipeline for pangenome investigationsplasmidfinder
- Plasmid identification from assembliesseroba
- Serotyping of Streptococcus pneumoniae from sequence readssnippy
- Rapid variant calling from Illumina sequence reads with optional core-SNP phylogeny
Fixed
- improve GTDB db handling
- Properly capture assemblies with 0
- user provided adapters and phix fastas
--available_datasets
in thebactopia datasets
- name collisions in the Ariba Bactopia Tool
- bare double-quotes in Ariba reports
- updated tests for new version
Enhancements to OSS
- Add database to Seroba recipe bioconda/bioconda-recipes#35378
- Update ariba dependencies bioconda/bioconda-recipes#35383
- patch pymummer recipe bioconda/bioconda-recipes#35379
- Update PlasmidFinder for better container support bioconda/bioconda-recipes#35314
- add module for ariba nf-core/modules#1731
- add module for mcroni nf-core/modules#1750
- add module for Mykrobe and GenoTyphi nf-core/modules#1731
- add module for plasmidfinder nf-core/modules#1773
- add module for Seroba nf-core/modules#1731
- add module for snippy/core nf-core/modules#1855
- let tempfile determine temp dir location sanger-pathogens/seroba#68
- allow the user to specify temp dir or use the system default sanger-pathogens/pymummer#36
v2.1.0
v2.1.0 bactopia/bactopia "Silver Mist" - 2022/06/08
Added
--use_bakta
to replace Prokka with Bakta in main workflow- Split Bakta into two processes
download
andrun
- added
shigatyper
to Merlin - custom profile for
arcc
bactopia prepare
error message when nothing found, and--examples
for example use cases- renamed
--fastqs
to--samples
to better reflect its usage (fastqs and assemblies) - add
--check_samples
to validate user generated FOFNs --short_polish
for short read polishing for long-read assemblies- Moved Ariba to Bactopia Tools
- bump versions in Bactopia envs
- bump Bactopia Tool versions
amrfinder+
version to 3.10.30gtdb
version to 2.1.0mlst
version to 2.22.0
- Bactopia Tools (
bactopia --wf <NAME>
)ariba
- Gene identification by local assembliesgamma
- Identification, classification, and annotation of translated gene matchesshigatyper
: Shigella serotype from Illumina or Oxford Nanopore reads
Fixed
- @nickjhathaway Fixed assembly_level in
bactopia datasets
and autocall_variants
bactopia search
now supports ONT readsvcf-annotator
unable to write to/dev/stdout
- typo in
bactopia download
when using singularity bactopia download
will not rebuild envs its already built in a run--include_genus
withbactopia datasets
now works with multiple samples- Parameters to copy optional species-specific files in
bactopia datasets
- Don't use
AUTO
for IQ-TREE cpus gtdb
download working again- empty FASTQs after adapter and phix removal
- low read count error for ONT samples
- Removed
sequence_type
environment
v2.0.3
v2.0.3 bactopia/bactopia "King Cobra" - 2022/03/23
Added
- logo and new metro style workflow
- added
test
profile for easy testing mashdist
now outputs a merged TSV- print missing required parameters in Bactopia Tools
- bump bakta to v1.4.0
Fixed
- typo in
bakta
subworkflow ismapper
not available inbactopia-tools.nf
kraken2
not available inbactopia-tools.nf
- Traits file being required for
pangenome
mashtree
module not collecting all FASTA- tests for ISMapper
Full Changelog: v2.0.2...v2.0.3
v2.0.2
v2.0.2 bactopia/bactopia "Black Mamba" - 2022/02/22
Added
bactopia download
which merges thebactopia build
andbactopia pull
commands- modules can now use prebuilt envs
--available_species
to print the species in a bactopia dataset--bbduk_opts
to fine tune bbduk- set
quay
as default docker registry - Let Nextflow handle stdout and stderr
- added
hpsuissero
,legsta
,sistr
,ssuissero
to Merlin - Bactopia Tools (
bactopia --wf <NAME>
)amrfinderplus
: Identify antimicrobial resistance in genes or proteinsabricate
: Mass screening of contigs for antimicrobial and virulence genescheckm
: Assess the assembly quality of your sampleshpsuissero
: Serotype prediction of Haemophilus parasuis assemblieskraken2
: Taxonomic classification of sequence readslegsta
: Typing of Legionella pneumophila assembliesmlst
: Scan contig files against PubMLST typing schemesmobsuite
: Reconstruct and annotate plasmids in bacterial assembliesrgi
: Predict antibiotic resistance from assembliessistr
: Serovar prediction of Salmonella assembliesssuissero
: Serotype prediction of Streptococcus suis assemblies
Fixed
toInteger
onnull
genome size- Index out of bounds in mlst-blast.py (caused by non allele columns)
- FOFN will fail if sample name is
null
- improve available workflow logic
- PIRATE not exporting gene_presence_absence.csv correctly
- channel imports for
scoary
workflow argument type mismatch
when using--use_roary
- missing channel when
agrvate
not ran asstaphtyper
- float required options to the top when missing
- broken tests due to changes
bactopia datasets
including hidden directories- Bioperl pinnings for prokka and pirate
v2.0.1
v2.0.1 bactopia/bactopia "Red Roc" - 2022/01/06
Added
- Merlin for auto selection of species-specific tools based
--ask_merlin
to execute the Merlin subworkflow in Bactopia- Schema compatible with NF-Tower
--skip_qc_plots
to skip FastQC and Nanoplot- Disable CheckM by default, must use
--run_checkm
--cluster_opts
for passing additional options to SLURM- inclusion of local files (assembly, gff) in some subworkflows
- workflow for dev conda builds
- all subworkflows can be imported on other workflows
- Bactopia Tools (
bactopia --wf <NAME>
)gtdb
: Identify marker genes and assign taxonomic classificationsmashdist
: Calculate Mash distances between sequencesmerlin
: MinmER assisted species-specific bactopia tool seLectIoN
Fixed
- Resource over allocation for java related programs (-Xmx)
- ordering of skipped steps when
--datasets
not used - logic in container selection for nf-core modules
- channel manipulation in
ncbigenomedownload
- missing logic for
fastani
- version captures in subworkflows
- spell check for species with
[
or]
in their names - version outputs for Staphtyper subworkflow
- use process names in version.yml
CSVTK_CONCAT
empty channelsoptions.suffix
now used asprefix
in modules
v2.0.0
v2.0.0 bactopia/bactopia "Red Hawk" - 2021/12/05
With Bactopia v2 comes a lot of changes! I would like to extend a huge thanks to Davi Marcon and Abhinav Sharma for their work initially converting Bactopia to DSL2. Your efforts were the momentum I needed to get the ball rolling on Bactopia v2. Thank you very much for taking your time to make such a siginificant contribution!
Added
- support for Nanopore reads
staphopia
as a named pipeline (alias forbactopia --wf staphopia
) for S. aureus genomesbactopia/bactopia-tests
repo with test data- walkthrough for testing
bactopia-datasets/staphylococcus_aureus
repo with curatated S. aureus datasets- per-module testing via
pytest
(100+ tests and 7000+ outputs tested) - per-module
meta.yml
andparams.json
for auto-building docs site - framefork for adding new Bactopia Tools
- 19 total Bactopia Tools (
bactopia --wf <NAME>
)- Subworkflows (3)
eggnog
: Functional annotation of proteins using orthologous groups and phylogeniespangenome
: Pangenome analysis with optional core-genome phylogenystaphtyper
: Determine the agr, spa and SCCmec types for Staphylococcus aureus genomes
- Modules (16):
agrvate
: Rapid identification of Staphylococcus aureus agr locus type and agr operon variants.bakta
: Rapid annotation of bacterial genomes and plasmidsectyper
: In-silico prediction of Escherichia coli serotypeemmtyper
: emm-typing of Streptococcus pyogenes assembliesfastani
: fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI)hicap
: Identify cap locus serotype and structure in your Haemophilus influenzae assembliesismapper
: Identify insertion sites positions in bacterial genomeskleborate
: Screening Klebsiella genome assemblies for MLST, sub-species, and other related genes of interestlissero
: Serogroup typing prediction for Listeria monocytogenesmashtree
: Quickly create a tree using Mash distancesmeningotype
: Serotyping of Neisseria meningitidisngmaster
: Multi-antigen sequence typing for Neisseria gonorrhoeaeseqsero2
: Salmonella serotype prediction from reads or assembliesspatyper
: Computational method for finding spa types in Staphylococcus aureusstaphopiasccmec
: Primer based SCCmec typing of Staphylococcus aureus genomestbprofiler
: Detect resistance and lineages of Mycobacterium tuberculosis genomes
- Subworkflows (3)
- Use
mamba
instead of conda for env building - Reduced total Conda envs/Docker containers down to 7 (previously 12 not including bactopia tools)
- default to compressed outputs (
--skip_compression
to output uncompressed outputs) - Tutorial outputs made available
- update github actions
Fixed
- Cache issue causing
-resume
to fail - amrfinder+ database not compatible error
- incorrectly parsed system memory
Adapted from nf-core
- nf-core pytest setup
- nf-core/modules for bactopia tools
- Bactopia v2 release contributed 20+ modules to nf-core/modules
- nf-core/tools arg parser
- adapted to import params and usage based on config file
Process Consolidation
makeblastdb
->assemble_genome
call_variants
,download_reference
->call_variants
fastq_status
,estiamte_genome_size
->gather_samples
count_31mers
->minmer_sketch
Removed
bactopia tools
-> Handled by Nextflow now (bactopia --wf <NAME>
)bactopia versions
-> Program versions are output ever run now.