Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recommend 1bp SEQ with QUAL="*" as quality-unavailable. #752

Open
wants to merge 2 commits into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
6 changes: 5 additions & 1 deletion SAMv1.tex
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ \section{The SAM Format Specification}
Alternatively and equivalently, SAM files are encoded in UTF-8 but non-ASCII characters are permitted only within certain field values as explicitly specified in the descriptions of those fields.%
\footnote{Hence in particular SAM files must not begin with a byte order mark~(BOM) and lines of text are delimited by ASCII line terminator characters only.
% Unicode identifies VT and FF as line break characters as well, but no one uses them in SAM.
In addition to the local platform's text file line termination conventions, implementations may wish to support \textsc{lf} and \textsc{cr\>lf} for interoperability with other platforms.}
In addition to the local platform's text file line termination conventions, implementations may wish to support \textsc{lf} and \textsc{cr\,lf} for interoperability with other platforms.}

Where it makes a difference, SAM file contents should be read and written using the POSIX\,/\,C locale.
For example, floating-point values in SAM always use `{\tt .}' for the decimal-point character.
Expand Down Expand Up @@ -651,6 +651,10 @@ \subsection{The alignment section: mandatory fields}\label{sec:alnrecord}
wrong}\}$. This field can be a `*' when quality is not stored. If
not a `*', {\sf SEQ} must not be a `*' and the length of the quality string
ought to equal the length of {\sf SEQ}.
\footnote{There is a small ambiguity with a sequence exactly one
base-pair long having quality 9. This is ASCII `*' so it could be
interpreted as either QUAL 9 or as quality unavailable. Given
this ambiguity, we recommend interpreting it as unavailable.}
\end{enumerate}

\subsection{The alignment section: optional fields}\label{sec:alnaux}
Expand Down