Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Memory footprint in Rstudio #343

Open
LucasMcNU opened this issue Jan 26, 2024 · 6 comments
Open

Memory footprint in Rstudio #343

LucasMcNU opened this issue Jan 26, 2024 · 6 comments

Comments

@LucasMcNU
Copy link

When loading .hic files is finished, there is a ton of memory not cleaned up. My .hic file after loading is ~ 3-4 GB, but the memory footprint in my R session is upwards of 25-40 GB. It is causing my Rsession to crash. This is on an institutional HPC.

R version: R/4.2.3 (also seen on R/4.0.0)

using the most recent version of GENOVA (remotes::install_github("robinweide/GENOVA"))

RStudio 2023.06.0+421 "Mountain Hydrangea" Release (583b465ecc45e60ee9de085148cd2f9741cc5214, 2023-06-06) for CentOS 7
Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:122.0) Gecko/20100101 Firefox/122.0

@teunbrand
Copy link
Collaborator

This remains the case after you call gc()?

@LucasMcNU
Copy link
Author

LucasMcNU commented Jan 26, 2024

yes, after I call gc() there is a decrease in the memory footprint. I am throwing quite a bit of RAM on the HPC towards this issue. To open and dump the contacts from a 1.5 gb .hic file, around ~40-50 GB are used by R studio. This then relaxes to around 25 to 40 GB and stays static after calling gc().

@LucasMcNU
Copy link
Author

Hi,
I wanted to follow up on this - I'm trying to load micro-C data using GENOVA. The data is large (~25-35 GB for .hic file). I'm using an HPC with ~700 GB of available RAM. GENOVA seemingly cant handle something this large, or freezes up. Is there anything you can suggest that would allow loading these contacts short of more memory?

@teunbrand
Copy link
Collaborator

The dev version has load_contacts_subset() to only load ROIs. GENOVA preceded micro-C and wasn't really build with data of that resolution in mind.

@LucasMcNU
Copy link
Author

Thank you! Where can i find usage guidelines for load_contacts_subset()?

@LucasMcNU
Copy link
Author

It looks like support for juicer .hic files hasn't been added yet?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants