Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adjust parameters to improve assemblies. #269

Open
dgs108 opened this issue Apr 22, 2024 · 0 comments
Open

adjust parameters to improve assemblies. #269

dgs108 opened this issue Apr 22, 2024 · 0 comments

Comments

@dgs108
Copy link

dgs108 commented Apr 22, 2024

I have ~21X PacBio CCS reads and have produced multiple assemblies using hifiasm, flye, and wtdbg2. The number of contigs in each assembly range from 4,965 (hifiasm) to 20,315 (flye), but wtdbg2 produces the best overall assembly (5,004 contigs; N50: 2,788,165; largest contig: 14,599,089). However, I would like to improve this assembly and would appreciate advice on parameters.

The species is a hammerhead shark with genome size ~2.7 Gbp; sharks have very repetitive genomes. I have used wtdbg2 presets 1, 3, and 4, and preset 4 produced the assembly with the fewest contigs. Adding -L 5000 to -x ccs marginally reduced the number of contigs. My code is below.

wtdbg2 -x ccs -g 2.7g -i hifi_wo_adapters_mtdna.fastq.gz -t 20 -o wtdbg_02.21.24 -f -L5000

wtpoa-cns -i wtdbg_02.21.24.ctg.lay.gz -t 20 -o wtdbg_02.21.24.ctg.fa -f

Please advise on parameters to tweak and if i should polish between the steps above.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant