Reduce available memory for Picard Mark Duplicates #272

ryan-moreno · 2023-05-01T16:08:39Z

Right now, the pipeline fails for me with the following error:

Error executing process > 'NFCORE_ATACSEQ:ATACSEQ:MERGED_LIBRARY_MARKDUPLICATES_PICARD:PICARD_MARKDUPLICATES (DMSO_REP4)'

Caused by:
  Process `NFCORE_ATACSEQ:ATACSEQ:MERGED_LIBRARY_MARKDUPLICATES_PICARD:PICARD_MARKDUPLICATES (DMSO_REP4)` terminated with an error exit status (1)

Command executed:

  picard \
      -Xmx36g \
      MarkDuplicates \
      --ASSUME_SORTED true --REMOVE_DUPLICATES false --VALIDATION_STRINGENCY LENIENT --TMP_DIR tmp \
      --INPUT DMSO_REP4.mLb.sorted.bam \
      --OUTPUT DMSO_REP4.mLb.mkD.sorted.bam \
      --REFERENCE_SEQUENCE genome.fa \
      --METRICS_FILE DMSO_REP4.mLb.mkD.sorted.MarkDuplicates.metrics.txt
  
  cat <<-END_VERSIONS > versions.yml
  "NFCORE_ATACSEQ:ATACSEQ:MERGED_LIBRARY_MARKDUPLICATES_PICARD:PICARD_MARKDUPLICATES":
      picard: $(echo $(picard MarkDuplicates --version 2>&1) | grep -o 'Version:.*' | cut -f2- -d:)
  END_VERSIONS

Command exit status:
  1

Command output:
  Error occurred during initialization of VM
  Could not reserve enough space for 37748736KB object heap

I ran into this issue with the cutandrun pipeline and the rnaseq pipeline. In both cases, the fix was to allocate only a fraction of the available memory when launching the process. Here is the relevant change in the cutandrun pipeline. Thanks @drpatelh for the change in the cutandrun repo.

PR checklist

This comment contains a description of changes (with reason).
If you've fixed a bug or added code that should be tested, add tests!
If you've added a new tool - have you followed the pipeline conventions in the contribution docs- [ ] If necessary, also make a PR on the nf-core/atacseq branch on the nf-core/test-datasets repository.
Make sure your code lints (nf-core lint).
Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
Usage Documentation in docs/usage.md is updated.
Output Documentation in docs/output.md is updated.
CHANGELOG.md is updated.
README.md is updated (including new tool citations and authors/contributors).

Dev -> Master for v2.0 release

ryan-moreno · 2023-05-01T21:26:45Z

To make this run with my setup, I also had to hard code the memory allocated in /atacseq/modules/nf-core/picard/collectmultiplemetrics/main.nf as 2g. I'm not sure the proper way to deal with that.

drpatelh and others added 5 commits November 30, 2022 20:15

Merge pull request nf-core#212 from nf-core/dev

0add188

Dev -> Master for v2.0 release

Limit task memory to 80%

eb07e47

Merge branch 'dev-mem'

f565f97

Reduce available memory for mark duplicates

db14fd7

Change reduction back to 80%

ed379da

ryan-moreno marked this pull request as ready for review May 1, 2023 16:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce available memory for Picard Mark Duplicates #272

Reduce available memory for Picard Mark Duplicates #272

ryan-moreno commented May 1, 2023

ryan-moreno commented May 1, 2023

Reduce available memory for Picard Mark Duplicates #272

Are you sure you want to change the base?

Reduce available memory for Picard Mark Duplicates #272

Conversation

ryan-moreno commented May 1, 2023

PR checklist

ryan-moreno commented May 1, 2023