Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mappings not full-length, alignment extension problem? #86

Open
tobiasrausch opened this issue Jan 25, 2023 · 0 comments
Open

Mappings not full-length, alignment extension problem? #86

tobiasrausch opened this issue Jan 25, 2023 · 0 comments

Comments

@tobiasrausch
Copy link

Hi,

Read bases before and after query start and end reported in minigraph mappings sometimes perfectly match the segment (S) sequence. Here is an example where I converted GAF ​​to BAM and treated bases as soft clipped after the query end to visualize them in IGV:

softclips

Below is the read from the IGV image that aligns full-length using minimap2 and GRCh38 but gets "clipped" using minigraph:

minigraph --vc -k 15 -w 10 -cx lr GRCh38-90c.r518.gfa.gz read.fa

>79c62919-2910-4b7d-92d3-78016cd6679b
TGTACTTCGTTCAGTTACGTATTATAGGGGTGACCAGGGCCGGTTGAGCACTCACGTGTGGCTATCTCTGTGCTCTGTGGCAGGTGACGGATGGTGGCACCACTCAGCAAAGATATCTTCACCTTCGACACCATGTTCTCCACCAACTACTCACACAGAGGAGAACTACCGCAAGCGAGGGACCTGGTGTACCAGTCCACTGTGAGGTGAGTGCCTGGGGGTGGCGGGGTGACAGCGGGGGAAGGGCGGAGGGATGGGGAGTGGGGCAGAGAGCAGTTCTCCAGCCCTTTCACATCAATCCCTCGGTGCTCTTGGGCTTGGAAGGCAGAGACTGGGGGCTCCCCTTGTAAGAGGTGGGACCCATTCCTGGAGGACATCCCCTGGAGGGAAGAGGCAGGAGAGGGCCAGGCCACCGCTCTTCTGACTGGCCTCCTCTGCAGCTCTCCAGCAGTCCTGGAAGGTGGGTATTCCAAACTCCATTTATTTCCAAGTGAGATACTGAGGCCCAGAGAGGGAGCAATGGGCCTAAAGTCACACAGGCAACAATGACACAGCTGGGCGGTGAAGCAGTTCTGCCCGGCTTGGAAGCTCAAACACCACATTCTACTGTTTTTGTGTGCTCCAAGGCCATAAGGCCCCGAGCTTAGAGAATGACCCCAAGCAACTGGACTCCCAGGTCAGAAGCAGCAGGGTGGGAAGGAGCAGTGCTCAAGTCGGAGATGCTTGATTTTCCTACCCCATCCCTGTTTTCCAGGAGGCAGGTGGCAGAATCAAGCTGACTCTGATCTTCCAGAGCCTGCTGTTCTCTATGTGTGTGGCTCTATCTCCTTCCCACAAACTCCCCTACAAGATCCTAGTCCTGTTTTATCCCTTTGACCCTTAACACACACAGGTCCCTCTTCCTCAAGGGCCTTTCTCCCTTCGGTCATCTGGTAAACTCCTCATCAGCCTTCAGAACCCCACTCCAGCAATTCCACCCATGGGAGCCCTCACCTTAGCTGAACTGATCCCCCAAAGCTGGTCGGCTGCAACTGCCCCAGTACCCACCCCTCCTCAGAGGGAATCTCACCTGCATTCATGAAGAGTGCACAGCTTTGCACACACACTCTTCCACGCCACACCGCCAAATGCTTTGCCAGGTGCCCAGCCTGAAAGTAGTGGTCTGCTCCAAAGCTGGAGGGGGACCGTGCTGTGGGGTACCGTGAGGGGGACTGCAGCCTCGGCAAGCCCACATGATGTGCACTGTGGCACCATGCTATGTGGATGTCACAGCTAGTGCCTGGTACTGTGTTGTCTTTAGACTGAGCCTGGAGGAGGGGGAAGCCGTACTCTCCAGCACGGGCTGTGGGTCTCTGTGACCGTGAGCTCCAGGAGGACAAGGCACATGCTCCTCCTTCACCTTGCATCCCCAGCTCTCAGCCCAGGGCCTGGCACACAGTGGGTACATAATGAAATTTGCAAAATGAGTGGACAACTCAGGCTACACTCCTCAGCCCTTCAGGCCTGTTTTCCTATTGGGTGAAATTAGAAAGCGGGACAAGAAGATTGTAAAGGGTAAGTCCAACCCTGCATCAAGAATCTGACCCAGGCATCCTCCTCTGCCTGCTTTTGTAAAAATGGGAAGGTGAGTGTGCTCCCCAGGCGGCCCGGCTGGGTCCTCGTCTCTCTGGGCTCTGTGCCAGGACCATGCTGAGAATGTGGACGTGAAGGTGTGTGAAAGAAGGTGTGTGAACAGCCGCACAAACTCCTTCATCCCCCTAGTTATGATTTCTCCTGAACCTGCTTCCAAAAAGAAATTATGGTGCCTTAAAATATTAAAGCACACCTAAAACAAGCCAGTTACAATCAAGATGAAAAAGGAGATAAAAATTCATTTTGGAGAAAATAGCTATTCTGGAAACCACAGATGGTTTGTAGTAAGTGAACTCCCAAATAATCTCCACGACTCCCAGAGGAAGCCCTAATGCCACCCAGCCCACCCCTGCAGGTCACTGTGTCTCAGAGAATTCTTTCCTCACCAAGCCACCCGCTGCCACCAGCAAGGCCTCTAGGACTCCTTGGTTCTGATTTCTAGCTAAGAAAGAAACCCAGGGACCATCTAGAACATTCCTGAGCACCACCCTCTCCTGCTTCCTGGAGCCTCAAGGGCAAGCCCTGCTACAGAAGACCCAGATGAAAATCCCAGCTCAGTCTCTGCCTACTGTATGACCTGGGATGAGAGCCTTAACCTCTCTCTCTCTCTGCTTCAGGTCCTTCGCCTGTAGAGCAGACTTGACATGACAGGCTTTGTACAATTGTGGAATTTCAGTGAAATTTATGTCCAACAGCCTGAGTTCAAATCCTGCCTCTTCCACATCCTGGCATTAGTGAAATCTGGGGCTGGTAACCCCACAGTTGAGTTTCCTGACCTCCAAAGTAACATGACAGCACTAAGAGGCTAGCTGCAAGTTGTGAGGAGCTGATGGCGTGCATTCAAGTGAAGTGCTGGTTTCCGGCCTCTAAGTGGTACATGTTTGCTGCCATTGCCTAGAACAACCTCTCCTTCTCCCACCATAAAACAAAACAAAAAAAATTCACCTTTTTACAAATCTCATAAAGCCTCCTTTTATGCGCTTTCTGTTTTGATCTGCACAAACAACCCCATGAATGCAGGCAGGTATTATTGTTATTAATCATTTTCTCTAGCCCTGTTCTTACTACAGAAGAGGTCAAAGTTCAGCAGGCCCAACAGTCTCCCTCTGGAGATGACATAGACTTGACCAGACCTTTTCAGTTGTGGAAGCCAAGGCCCAGAGAGGGAATGAGGCTTGCCTGAGGTCACACAGCGGCCCTGACTTAAGCCTGATATCCAGGGAGCCAGTGCTCTTTTCTTCCCCACATGGGGGCCAATCTCCCTGAGCCTCAGTAACAGGGGCGGTCTCCCAAGCAAAGAACACCCCAGGGAGCTGAGCCGTCCTGTTTGGTGACCACCAAGGGAGAGGGAAGGAAGGAGCAGACCCTCTGGAGCCTAACGGGATCACGGAGGATTGTCAGCCCTCCCAACCAGCCATGCTGGGGGCGACTGCACCTCTTCAGGCTGCAGAGCCCCATCAGGGCTGGCGACGGGCGCGGGACTCCGCATTTACATATAAATGAAGATACACCCCAAATTAATGCATCTCAACAGAGAAGGAGTCCCAGCTCCACAGCTGTCTGAGCCGTTTCCTCTTCAGCTGGGGAAAGTAGGGAGAGAATGAGACTGAGAAGATTACCAAGAATTGGCTGAGAGCAATTTCGCGGGTAGCCTAGAGGAGCCCCAAGGACTCCTGGAGCTTTTTGTCCCACTGATGGGGCCCTGGAGGGATGGAAGTCGGGTGGATCATCAGGACTTCTGCCCCCAGGGGGTGCAGGTGTGTATCCCTGGTTGTATGACCTTGGACAGATCACTGCCTCTCTGGATAGAGCGTCCCAGAGGCACATCTATCAAATGCTTTTCTAGGTCTTAGCCCCAGAGCCCAGAACTCCCAGGTTGGAATTGACAAGAATGTATTCCCTCCAAGGCTATTTAGTTGAGCACCTACTAGGACCTTGGCTTGGTGGAAGGTGCCCAGGGATACAGGGGGAATGAGAAAAGCACGGCTTTGCAATCTAGTCGGAGAGACATGGCAAAATGTAGATGAATACATCATTCCACATTCTGCAAAGTGATTCTAAACCATGGGCGTGTAATGAAGGGGCAAGTGGGGTGCAGTACCGAGGAAGGAATGAGATGCTGGGTGCCATTTAGATGGGTGGTCAGGCAGGGCCATCGAGGAAGGTGGCACTTGGGTGGAGAACTGAGGGCAGGAAGGAGCCCTCTGTGAAGGGAGGAGGCAGATGTGGCCGACAGTTATCCCACCCTGCCCACTTCCTCTTACCCCTCGCTCTTCCTGGCAGTGCCAGCATCCTTGGTGCTTCCACAGGGCTTTCTCAGGGCACTGGAGACCACTCAGCCTGCACATGAGAAAGGTGAGAAGTGCCGGGGAGTTGACGCCACCCCGGGAGCAGCCTTGACCAATGACTGTTTATTGGGTGCATACCCAGCCCCCTCGTCCCTCCAGGGGGACAGTTCAGATGCATGCTCAGGAGGACAAAGTACAAGTTCCTCCTTCACCTCGTATGCCCAGCTCTCAGCCCAGGGCCTGGCACACAGTGGGTGCATAATGAAATGTGCAAAATGAGCGGGCAGTTCTGGGTATGCTCCTTGGCCCCTTGGGGGTGTTTCCTAATAAATTAAAATTAGAAAATGGGATAAGAAGGGGGACATTGAGAAACTATGGGATGAATGCTGTCAACTCTCTCCCGGAGTTTCTCAGTGACCCCCCAGAATAACCTTGAGAAGGCACCCATGTTGGCTTCCTTCCCTTCTAGCTCATTCCATCAGCTTCCTGGGACCCCCTTCTCCCACATAAACCCTTTCCTCCTAAATCCTTGTTTCAAGGTCTACTCTTCATTCCACCCTTCTAGGTTCTTGGCTGGGTCCCCAGTAGCAAAAGACGAATTCACAAGACAAAAGGATACAAGTCTAGTTTGTATACGTTTTAGGTGATGCAGGAGATTTTATAAGGAAATGAAGACCCACAGAAGTGGTTCTAGCTGAGTGTTTTGCTGGGTTGGATGAAGGGTGGGGAGTCATGGGAAAGTGTGAAGGATAGAAGGACCTGAGCTAAGGGCAGTAAACTGGGGACACTCAGCAGGGTGGCTTGTTCAGATCCCTTGGAAGTGAAGATGCTGCCTTCCTCCAGGTACAGAGAGGGAACCTCACGTGAAGATCTTCATGACCTGCTTCAGGGAAAGGTCAGAGAGTCCTTCCTGCGCCTGCCATTTCTCAAATTCCATCTGCTTAAAATATTCAATATACCAAGGTGCCAAATTTGAGGTGGCGTGTCCTGGAACCCCCTCCCATGGAATGTGGCCATTCCAGAGGGCAGCTGTGACCTCCTTGGAGGTCAAGTGAATCACAGTGAGGCTTCAACCACTCCGCCCACACACATGCTGTGTCTTCCATTTCCTGAAGCACCCTGAGGCCCTTGTCCCATTCGATCTGCTCAGGAAGCGATGAGGAACACTGGACAGATTTCACTGGCCTTAATTCACTGGTGCAGGGAATCTAGATTCGAGGGGCTCAGTGTATTCCTGCCCCCGACCTGCTATTGCAGATCCCTCCAACACCTTGGCCAATGCTCTTTGCCAACAAGGTCCTACTTGTCCTTTCAAGTTCAGCTTAAAGGTTGCTTCCTACAGGAAGTCAGCCCCAACTGCTCAAGCTCACCGACTGATGCTATGTCTTCAGAGTCTGCCCGCCCCCCCACTAGACTGGAAGCCCCATGAGGACAGGGACTAAGGTCTGATGTTTTTCACCACCATCCTCTCCCCAACCCCCAGGACAGCACCCAACCCTGGGCATGTGGTAGATGATTTGTAAATATCTGTGGGGTGAATGAGTTCGTGGTGAACATGGTGGTGGAGTCCTTTCAATGCAACACTTGCTGCTGGGGTGGATGAACAAGACACCTTAGGAAAGGAGCATCAGGCCCCCGTTCCTTCCCCCAACACTCTCCACATCATTCATTCATCACAAAGGAATTGAGTAGTTACCATATATCAGGGACAATGATGGAGATAAAGTGGTGAACAAGACATAATCAGACCTGGAGGTTTCTGATCTAGTGGGTGAGGAGAGACACACCTAAAAATGCCTTTAAAATAATAAGGAAAAACAGTTACAG

Thanks, Tobias

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant