Skip to content

Commit

Permalink
feat: BUSCO auto lineage mode, tempdir, updated version (#526)
Browse files Browse the repository at this point in the history
* Added tempdir to store all outputs that are copied as requested, options to specify dataset_dir, and new example

* Fixed linting

* Added full_table and missing_busco to output

* Fixed typo

* Added lineage checks

* Update bio/busco/environment.yaml

Co-authored-by: Johannes Köster <johannes.koester@uni-due.de>
  • Loading branch information
fgvieira and johanneskoester committed Aug 16, 2022
1 parent 754dfc1 commit 67cf8e1
Show file tree
Hide file tree
Showing 35 changed files with 175 additions and 13,300 deletions.
3 changes: 1 addition & 2 deletions bio/busco/environment.yaml
Expand Up @@ -3,5 +3,4 @@ channels:
- bioconda
- nodefaults
dependencies:
- busco==5.1.2

- busco =5.4
6 changes: 6 additions & 0 deletions bio/busco/meta.yaml
@@ -1,9 +1,15 @@
name: busco
description: |
Assess assembly and annotation completeness with BUSCO
url: https://busco.ezlab.org/
authors:
- Tessa Pierce
- Filipe G. Vieira
input:
- assembly fasta
output:
- annotation quality files
notes: |
* The `mode` parameter sets the assessment mode (mandatory), and only allows for arguments: "genome", "transcriptome", and "proteins".
* The `lineage` parameter sets the lineage dataset ot use (optional). In `auto-lineage` mode, `output` or `dataset` folders need to be retrieved as a whole, since it is not possible to infer the output file names (they depend on the best lineage match).
* The `extra` param allows for additional program arguments.
51 changes: 44 additions & 7 deletions bio/busco/test/Snakefile
@@ -1,16 +1,53 @@
rule run_busco:
input:
"transcripts.fasta"
"protein.fasta",
output:
directory("txome_busco")
short_json="txome_busco/short_summary.json",
short_txt="txome_busco/short_summary.txt",
full_table="txome_busco/full_table.tsv",
miss_list="txome_busco/busco_missing.tsv",
dataset_dir=directory("resources/busco_downloads"),
log:
"logs/quality/transcriptome_busco.log"
threads: 8
"logs/proteins_busco.log",
params:
mode="transcriptome",
mode="proteins",
lineage="stramenopiles_odb10",
downloads_path="resources/busco_downloads",
# optional parameters
extra=""
extra="",
threads: 8
wrapper:
"master/bio/busco"


rule run_busco_euk:
input:
"protein.fasta",
output:
out_dir=directory("txome_busco/euk"),
dataset_dir=directory("resources/busco_downloads"),
log:
"logs/proteins_busco_euk.log",
params:
mode="proteins",
# optional parameters
extra="--auto-lineage-euk",
threads: 8
wrapper:
"master/bio/busco"


rule run_busco_prok:
input:
"protein.fasta",
output:
out_dir=directory("txome_busco/prok"),
dataset_dir=directory("resources/busco_downloads"),
log:
"logs/proteins_busco_prok.log",
params:
mode="proteins",
# optional parameters
extra="--auto-lineage-prok",
threads: 8
wrapper:
"master/bio/busco"
64 changes: 64 additions & 0 deletions bio/busco/test/protein.fasta

Large diffs are not rendered by default.

77 changes: 0 additions & 77 deletions bio/busco/test/sample_data/example/ancestral

This file was deleted.

77 changes: 0 additions & 77 deletions bio/busco/test/sample_data/example/ancestral_variants

This file was deleted.

6 changes: 0 additions & 6 deletions bio/busco/test/sample_data/example/dataset.cfg

This file was deleted.

5 changes: 0 additions & 5 deletions bio/busco/test/sample_data/example/hmms/.directory

This file was deleted.

0 comments on commit 67cf8e1

Please sign in to comment.