foldseek easy-cluster iteratively with different batches at different times #249

josephhughes · 2024-03-05T16:42:42Z

Hi,

Is it possible to do foldseek easy-cluster at different points in time with different batches without needing to reprocess everything. For example, I have 10,000 pdb files that I clustered today. Then in 3 weeks time, I add another 10,000 sequences to the folder of pdb files.
When I run foldseek easy-cluster, is there a way for me to tell it that it can use the results of the first 10,000 files to minimise compute?

CRC63 · 2024-03-27T15:02:29Z

Hi,
I am also interested in this possibility. Thanks in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

foldseek easy-cluster iteratively with different batches at different times #249

foldseek easy-cluster iteratively with different batches at different times #249

josephhughes commented Mar 5, 2024

CRC63 commented Mar 27, 2024

foldseek easy-cluster iteratively with different batches at different times #249

foldseek easy-cluster iteratively with different batches at different times #249

Comments

josephhughes commented Mar 5, 2024

CRC63 commented Mar 27, 2024