Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

jellyfish kmer counter memory estimation #53

Open
bgruening opened this issue Dec 23, 2023 · 2 comments
Open

jellyfish kmer counter memory estimation #53

bgruening opened this issue Dec 23, 2023 · 2 comments

Comments

@bgruening
Copy link
Member

It would be nice if we can convert the memory estimation of jellyfish to python.

https://github.com/gmarcais/Jellyfish/blob/43b1ab27abdb8c9399c386cc998bb9fd33648412/include/jellyfish/large_hash_array.hpp#L97

jellyfish mem is giving already a nice estimation.

jellyfish mem --mer-len 27 --size 100M  '/data/dnb09/galaxy_db/files/c/0/2/dataset_c02f0b7f-79d1-498d-ba17-c39374511657.dat'
149933428608 (139G)

the M after --size is Million, G for Billion, k for kilo

@mr-c
Copy link

mr-c commented Dec 23, 2023

k-h-mer has a Python accessible implementation of the same algorithm

@bgruening
Copy link
Member Author

Thanks @mr-c

Are you talking about the mem part? It would be nice to have a simple Python function that can give us an estimate when we provide kmer-len and size. Any pointers where we can find that in the source?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants