Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Assessing Read Quality: Improve the visualisation of Phred Quality Scores #220

Open
theheking opened this issue Apr 16, 2021 · 1 comment

Comments

@theheking
Copy link

I think the Details on the FASTQ format section is fairly confusing for beginners. For a better readibility and understanding of the quality scores. I think that the sentence "This quality score is logarithmically based, so a quality score of 10 reflects a base call accuracy of 90%, but a quality score of 20 reflects a base call accuracy of 99%" should be paired with the classic table displaying Quality Score.

Quality Score Probability of Base Error Base Confidence Sanger Encoded ASCII Character
10 0.1 90% "+"
20 0.01 99% "5"
30 0.001 99.9% "?"
40 0.0001 99.99% "I"
@amishaporet
Copy link

Agreed. Also, with the increasing use of NovaSeq as a sequencing platform, it would be good to include an explanation of NovaSeq quality scores. The quality scores are binned and correspond to marginal (<Q15, reported value of 12), medium (~Q20, reported value of 23), high (>Q30, reported value of 37), and a null score for no-calls is reported as 2 (https://www.illumina.com/content/dam/illumina-marketing/documents/products/appnotes/novaseq-hiseq-q30-app-note-770-2017-010.pdf)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants