Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to conduct clusters based on global structures instead of local substructures? #265

Open
krinoz opened this issue Apr 19, 2024 · 1 comment

Comments

@krinoz
Copy link

krinoz commented Apr 19, 2024

Thanks for your work. However, as I tried to conduct protein structure clustering, the program always returns clusters based on proteins' substructure. I am wondering how can I get the cluster based on global structures?

--alignment-type 1 seems can get the global results based on TM align. Here is the code that I used:
./foldseek/bin/foldseek easy-cluster ./protein res tmp --lddt-threshold 0.5 --c 1 --alignment-type 1

However, the results I get look like:

1ai6.pdb_B 1ai6.pdb_B
1ai6.pdb_B 1ai7.pdb_B
1ai6.pdb_B 1ajn.pdb_B
1ai6.pdb_B 1ajp.pdb_B
1ai6.pdb_B 1ajq.pdb_B
1ai7.pdb_A 1ai7.pdb_A
1ai7.pdb_A 1ai6.pdb_A
1ai7.pdb_A 1ajn.pdb_A
1ai7.pdb_A 1ajp.pdb_A
1ai7.pdb_A 1ajq.pdb_A

And I would like to get the format of the clusters similar to:

1ai6.pdb 1ai6.pdb
1ai6.pdb 1ajn.pdb
1ai6.pdb 1ajq.pdb
1ai7.pdb 1ai7.pdb
1ai7.pdb 1ajp.pdb

Thanks.

@milot-mirdita
Copy link
Member

We are working on a new method that covers multimer clustering. It's not quite ready yet, but we should have something out soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants