Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add entries for ebi-gxa anndata_ops and scanpy #48

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

cat-bro
Copy link
Collaborator

@cat-bro cat-bro commented Nov 20, 2023

Add scaling rules for some ebi-gxa tools.

Add a general fallback for ebi-gxa/scanpy* as well as specific ones for anndata_ops, scanpy_multiplet_scrublet, scanpy_scale_data and scanpy_run_pca.

@bgruening
Copy link
Member

@pavanvidem ping

@pavanvidem
Copy link
Member

I don't have much experience with ebi-gxa tool suite except that I used some from training materials. Maybe @pcm32 has a better understanding.

@pcm32
Copy link
Member

pcm32 commented Nov 22, 2023

Input size is on which units?

@cat-bro
Copy link
Collaborator Author

cat-bro commented Dec 6, 2023

GB

@pcm32
Copy link
Member

pcm32 commented Dec 6, 2023

I suspect then that the multipliers might be too big and very quickly leave you with no choice of node to run. I had some memory usage stats that I left as supplementary material in a paper, let me try to dig that. In our pipeline, for AnnDatas of around 5 GB you rarely need more than 20 GBs of ram.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants