Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: Compatibility with Starlong #218

Open
adbeggs opened this issue Oct 7, 2023 · 3 comments
Open

Feature request: Compatibility with Starlong #218

adbeggs opened this issue Oct 7, 2023 · 3 comments

Comments

@adbeggs
Copy link

adbeggs commented Oct 7, 2023

Hi

I am trying to do fusion neoantigen prediction using Arriba, however STAR can't cope with long reads and ideally STARlong should be used - do you think this will be able to be supported please?

THanks

Andrew

@suhrig
Copy link
Owner

suhrig commented Oct 24, 2023

To some extent, this is already possible. Technically, Arriba is compatible with STARlong. I just gave it a spin with PacBio long reads and it worked.

The mismappers filter took a very long time to run. It should be disabled for long reads (-f mismappers), since there is a very slim chance of a long read being mapped to the wrong locus anyway.

There weren't any false positives. I am not sure about the false negative rate. Looking at the discarded fusions file, I mainly saw false negatives that were discarded due to lack of support (they had only 1 supporting read). So Arriba reported pretty much everything that can be reported with confidence. So if anything was missed, it was mainly because STAR did not find an alignment. I believe STAR should be able to find most fusions with ease given the long read length. It may have trouble with reads spanning multiple breakpoints (=> unmapped, too short), which may be ameliorated through STAR parameter tweaking. However, STAR will only ever report one chimera per read. Multiple chimeric alignments of the same read are supported by neither STAR nor Arriba.

In addition to disabling the mismappers filter, other Arriba parameters may improve the sensitivity some more. For example, one could reduce the minimum number of supporting reads to 1. I need to give this a try.

What made you think that Arriba is not compatible with STARlong? Did it fail on a sample of yours? Or did it miss fusions in a sample?

@adbeggs
Copy link
Author

adbeggs commented Oct 25, 2023

HI @suhrig - apologies after my post I managed to make it work fine with StarLong - sorry for wasting your time!

@suhrig
Copy link
Owner

suhrig commented Oct 25, 2023

No time wasted. I will make an enhancement that the mismappers filter is skipped for long reads.

If you can share any STAR or Arriba parameter optimizations that improve calling, let me know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants