You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
When running large drmaa jobs if a job errors (killed for example) Snakemake will throw a missing file error, and the exit ungracefully with a key error with the rule name.
Logs
Output of Snakemake as the jobs fails
MissingOutputException in line 25 of https://github.com/mrvollger/Rhodonite/raw/v0.12-alpha/workflow/rules/trf.smk:
Job Missing files after 60 seconds:
results/HG01978.pat/trf/183-of-200/183-of-200.dat
This might be due to filesystem latency. If that is the case, consider to increase the wait time with --
latency-wait.
Job id: 8509 completed successfully, but some output files are missing. 8509
Traceback (most recent call last):
File "/net/eichler/vol26/15000/nobackups/mvollger/miniconda3/envs/snakemake/lib/python3.9/site-package
s/snakemake/__init__.py", line 699, in snakemake
success = workflow.execute(
File "/net/eichler/vol26/15000/nobackups/mvollger/miniconda3/envs/snakemake/lib/python3.9/site-package
s/snakemake/workflow.py", line 1073, in execute
success = self.scheduler.schedule()
File "/net/eichler/vol26/15000/nobackups/mvollger/miniconda3/envs/snakemake/lib/python3.9/site-package
s/snakemake/scheduler.py", line 440, in schedule
self._finish_jobs()
File "/net/eichler/vol26/15000/nobackups/mvollger/miniconda3/envs/snakemake/lib/python3.9/site-package
s/snakemake/scheduler.py", line 540, in _finish_jobs
self.running.remove(job)
KeyError: run_split_trf
Log file of the job says it was killed (not enough memory).
Minimal example
This seems to only happen when I submit a very large number of jobs (>~5000) so a minimal example is hard to create.
Additional context
If I downgrade to snakemake 6.7 the problem seems to go away. I wonder if this PR may have something to do with it? #1156
The text was updated successfully, but these errors were encountered:
Snakemake version
Any version >=6.8.0
Describe the bug
When running large drmaa jobs if a job errors (killed for example) Snakemake will throw a missing file error, and the exit ungracefully with a key error with the rule name.
Logs
Output of Snakemake as the jobs fails
Log file of the job says it was killed (not enough memory).
Minimal example
This seems to only happen when I submit a very large number of jobs (>~5000) so a minimal example is hard to create.
Additional context
If I downgrade to snakemake 6.7 the problem seems to go away. I wonder if this PR may have something to do with it?
#1156
The text was updated successfully, but these errors were encountered: