Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Memory exhausted - too many files #57

Open
alistairhockey opened this issue Aug 14, 2023 · 2 comments
Open

Memory exhausted - too many files #57

alistairhockey opened this issue Aug 14, 2023 · 2 comments

Comments

@alistairhockey
Copy link

Hi there,

I am trying to run 'intervene upset'on 73 BED files that have ~40,000 intervals each.

intervene upset -i /data/alistairh/projects/SV_calling/data/peaks/DiscRegions/*{1,2,3}.bed --output SV_calling/data/peaks/DiscRegions/results_RT --save-overlaps

However, Intervene uses up all the available memory (62G) before being killed by the server. Is there a setting or a fix to limit the memory use of Intervene so it doesn't get killed by the server? This hasn't been a problem before when I have used intervene for 15-20 BED files.

@asntech
Copy link
Owner

asntech commented Aug 14, 2023

That is a lot of BED files! The memory issue is familiar with bedtools/pybedtools using large datasets. Can you use the latest versions of bedtools and pybedtools? Also, try to sort your bed files using bedtools before running the intervene upset.

We're aiming for a parallel processing option in the upcoming version of Intervene!

@alistairhockey
Copy link
Author

I haven't had any issues with bedtools multiinter - but maybe the sorting has played a part in that!
Also, have you considered having an option for BEDPE files? I would be interested to see if you could modify the script to use 'pairToPair' in place of 'intersect' to get all the BEDPE paired region combinations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants