You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
I have found that SeqRepo intermittently returns incorrect sequences. I've loaded millions of small protein sequences. I started seeing sequences returned that couldn't be accounted for. I eventually decided to write a loop on pulling sequence data from Seqrepo and comparing the returned sequence to the known sequence. (each All species match) is an iteration of checking the same 100 species or 100 loaded protein sequences:
Seq repo makes it 31 iterations, or 3,100 sequence comparisons before randomly returning incorrect data:
It then churns through about the same amount before again returning an incorrect sequence:
To Reproduce
Steps to reproduce the behavior:
Load a few million protein sequences and then query several thousand at a time while doing a check on the known sequence identity.
Expected behavior
For seqrepo to return the same sequence always.
Additional context
One bizarre aspect of this, I can't identify where the incorrect sequences are coming from. If I grab one of the incorrect returned sequences and check all of my sequence data that has been loaded. I don't see it. Also, I'm getting sequence back that isn't even sequence:
Also, it looks like this is isolated to having many concurrent calls. (100 in my case) It doesn't reproduce with serial calls.
I am using this version: seqrepo-rest-service:0.2.2
The text was updated successfully, but these errors were encountered:
Describe the bug
I have found that SeqRepo intermittently returns incorrect sequences. I've loaded millions of small protein sequences. I started seeing sequences returned that couldn't be accounted for. I eventually decided to write a loop on pulling sequence data from Seqrepo and comparing the returned sequence to the known sequence. (each All species match) is an iteration of checking the same 100 species or 100 loaded protein sequences:
Seq repo makes it 31 iterations, or 3,100 sequence comparisons before randomly returning incorrect data:
It then churns through about the same amount before again returning an incorrect sequence:
To Reproduce
Steps to reproduce the behavior:
Load a few million protein sequences and then query several thousand at a time while doing a check on the known sequence identity.
Expected behavior
For seqrepo to return the same sequence always.
Additional context
One bizarre aspect of this, I can't identify where the incorrect sequences are coming from. If I grab one of the incorrect returned sequences and check all of my sequence data that has been loaded. I don't see it. Also, I'm getting sequence back that isn't even sequence:
Also, it looks like this is isolated to having many concurrent calls. (100 in my case) It doesn't reproduce with serial calls.
I am using this version: seqrepo-rest-service:0.2.2
The text was updated successfully, but these errors were encountered: