Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

match dsODN sequences #201

Open
YichaoOU opened this issue Feb 17, 2022 · 2 comments
Open

match dsODN sequences #201

YichaoOU opened this issue Feb 17, 2022 · 2 comments
Labels
enhancement New feature or request

Comments

@YichaoOU
Copy link

Is your feature request related to a problem? Please describe.
Current dsODN sequence matching doesn't allow mismatches
df_alleles["Aligned_Sequence"].str.find(args.dsODN) > 0

Describe the solution you'd like
we could just do a from skbio.alignment import StripedSmithWaterman to match dsODN.

Yichao

@kclem kclem added the enhancement New feature or request label Feb 18, 2022
@kclem
Copy link
Member

kclem commented Feb 18, 2022

Would you also allow gaps? And how many mismatches?

@YichaoOU
Copy link
Author

For the 34bp dsODN sequence, we use a score cutoff of 30, which could potentially allow up to 7 mismatches or a short gap.

for StripedSmithWaterman, a match is +2 and a mismatch is -3.

<style> </style>
#Match #Mis-Match Score
34 0 68
33 1 63
32 2 58
31 3 53
30 4 48
29 5 43
28 6 38
27 7 33
26 8 28
25 9 23

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants