Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question regarding -L (min variant length) parameter #65

Open
cjain7 opened this issue May 6, 2022 · 3 comments
Open

Question regarding -L (min variant length) parameter #65

cjain7 opened this issue May 6, 2022 · 3 comments
Labels
bug Something isn't working

Comments

@cjain7
Copy link

cjain7 commented May 6, 2022

Ideally, -L 1 should include maximum number of variants in genome graph, but this does not happen. Please see the runs below:

$ minigraph -cxggs -L 2 hap1.chr18.fa hap2.chr18.fa > output.gfa
[M::main::0.286*0.55] loaded the graph from "hap1.chr18.fa"
[M::mg_index::2.271*1.22] indexed the graph
[M::mg_opt_update::2.422*1.20] occ_max1=66; lc_max_occ=2
[M::ggen_map::2.900*1.07] loaded file "hap2.chr18.fa"
[M::ggen_map::24.119*1.54] mapped 2 sequence(s) to the graph
[M::mg_ggsimple_cigar::24.243*1.54] inserted 5736 events, including 67 inversions
[M::main] Version: 0.17-r529-dirty
[M::main] CMD: minigraph -cxggs -L 2 hap1.chr18.fa hap2.chr18.fa
[M::main] Real time: 24.353 sec; CPU: 37.454 sec; Peak RSS: 1.531 GB
$ minigraph -cxggs -L 1 hap1.chr18.fa hap2.chr18.fa > output.gfa
[M::main::0.286*0.55] loaded the graph from "hap1.chr18.fa"
[M::mg_index::2.271*1.22] indexed the graph
[M::mg_opt_update::2.422*1.20] occ_max1=66; lc_max_occ=2
[M::ggen_map::2.900*1.07] loaded file "hap2.chr18.fa"
[M::ggen_map::24.119*1.54] mapped 2 sequence(s) to the graph
[M::mg_ggsimple_cigar::24.243*1.54] inserted 0 events, including 0 inversions
[M::main] Version: 0.17-r529-dirty
[M::main] CMD: minigraph -cxggs -L 2 hap1.chr18.fa hap2.chr18.fa
[M::main] Real time: 24.353 sec; CPU: 37.454 sec; Peak RSS: 1.531 GB

I understand that -L 1 is far below the intended default value (50), but the above behaviour appears to be weird. Minigraph when run with -L 1 recognises 0 variants in the sequences. I was curious to visualise the graph that includes all het variants. This appears to be a bug.

Thanks!

@ASLeonard
Copy link

Did you test other values between 2 and 50 @cjain7 ? Curious if you observe a general increase in variants with decreasing L or if L<<50 just breaks down.

@cjain7
Copy link
Author

cjain7 commented May 6, 2022

I did observe a general increase while setting values 50, 10 and 2.

@lh3 lh3 added the bug Something isn't working label May 9, 2022
@lh3
Copy link
Owner

lh3 commented May 9, 2022

I never thought to support single base changes. Probably some part of the code makes incorrect assumptions and fails given -L 1. Need to check.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants