Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

subseq silent fail on malformatted fq #212

Open
cmsoulette opened this issue May 6, 2024 · 0 comments
Open

subseq silent fail on malformatted fq #212

cmsoulette opened this issue May 6, 2024 · 0 comments

Comments

@cmsoulette
Copy link

seqtk will fail silently with malformatted fq.

Aligned BAM was converted with pysam, using .get_forward_sequence() and .get_forward_qualities() functions. The later returns array dtype instead of string and could lead to malformatted fq if not converted to string. Running seqtk subseq on malformatted file will not throw any error.

Example FQ entry:

@SRR.ABC.123
AGGGCAATGTACTTCGTTCA.....
+SRR.ABC.123
array('B', [3, 3, 3, 3, 2,....])

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant