Skip to content

schellt/busco2snap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

busco2snap.pl v0.2

Description

Creation of a SNAP model from BUSCO output.

Using MAKER's cegma2zff as archetype, busco2snap.pl creates a SNAP model from complete, single-copy BUSCOs by predicting the gene structure with the retrained Augustus model and converting to zff format. The tools augustus and blastp need to be in your $PATH. If fathom, forge and hmm-assembler.pl are in your $PATH the SNAP model will be created automatically.
If Augustus predicts more than one gene at a BUSCO locus, the predicted proteins are blasted against the ancestral variants of the corresponding BUSCO lineage. The gene(s) where the best hit (defined by highest bit score) is on the searched BUSCO are used only.

Tested with BUSCO versions 3.0.2, 4.1.4 and 5.2.2

Dependencies

busco2snap.pl needs the following Perl modules and will search for executables in your $PATH:

Mandatory:

Optional:

  • SNAP: fathom, forge, hmm-assembler.pl

Usage

busco2snap.pl -b <busco_dir>

Mandatory:
	-b STR		BUSCO output directory (run_*)
			(augustus_output/retraining_parameters/)

Options: [default]
	-dup		Include duplicated BUSCOs [off]
	-frag		Include fragmented BUSCOs [off]
	-o STR		Output directory [.]
	-pre STR	Prefix for file name of the SNAP model file [species model name]
	-t INT		Number of parallel processed BUSCOs [1]
	-kt		Keep temporary files [off]
	-v		Print executed commands to STDERR [off]
	-dry-run	Only print commands to STDERR instead of executing [off]

	-h or -help	Print this help and exit
	-version	Print version number and exit

Citation

If you use this tool please cite the dependencies: