sequence paired identity large than 100% #35

Chenglin20170390 · 2022-09-17T02:26:07Z

Hi , I test with follow data, but the sequence identity of matrix large than 1 . I don't know how to explan the result...
./famsa -dist_export -pid -square_matrix test.fa pid.csv
`>P
MMMMMRRRRR

T
MMMRRRRRRR
E
RRRRRRRRRR
F
RRRRRRRRRR`

output

,P,T,E,F
P,10000.000000,2.000000,0.500000,0.500000
T,2.000000,10000.000000,1.166667,1.166667
E,0.500000,1.166667,10000.000000,10000.000000
F,0.500000,1.166667,10000.000000,10000.000000

The text was updated successfully, but these errors were encountered:

agudys · 2022-09-21T19:25:19Z

@Chenglin20170390
Thank you for raporting the issue. Indeed, in -pid mode the matrix contains inverse of the dissimilarities which may result in such strange values. We will fix it in the next release to make sure that identities are from [0,1] interval.

…matching residues divided by the shorter sequence length (#35)

agudys · 2022-10-05T11:56:26Z

@Chenglin20170390
In the latest (2.2.1) version pairwise identity is calculated as the number of matching residues divided by the length of the shorter sequence. Please let me know if everything works as expected.

Adam

agudys self-assigned this Sep 21, 2022

agudys added a commit that referenced this issue Oct 5, 2022

Pairwise identity (-pid switch) properly calculated as the number of …

36557be

…matching residues divided by the shorter sequence length (#35)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sequence paired identity large than 100% #35

sequence paired identity large than 100% #35

Chenglin20170390 commented Sep 17, 2022 •

edited

agudys commented Sep 21, 2022

agudys commented Oct 5, 2022

sequence paired identity large than 100% #35

sequence paired identity large than 100% #35

Comments

Chenglin20170390 commented Sep 17, 2022 • edited

agudys commented Sep 21, 2022

agudys commented Oct 5, 2022

Chenglin20170390 commented Sep 17, 2022 •

edited