-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
flux queue idle hangs with no active jobs #5964
Comments
Although dmesg shows that it ran successfullly:
The gdb dump above shows that Edit: |
This also shows the problem:
When a queue is stopped, any pending sched.alloc requests for jobs in that queue are canceled. They are then resent when the queue is started. However, I don't see this occurring during a queue reconfiguration. So if a job has an outstanding alloc request in a queue that is deleted, it might still be outstanding after |
Closing. The problem as stated in this issue's description was resolved by flux-framework/flux-sched#1209. |
Problem: when flux (0.62.0) was shut down on fluke, the
flux queue idle
command executed during cleanup hung.A manual run of
flux queue idle
also hangs.flux module stats job-manager
shows zero active jobs# flux module stats job-manager { "journal": { "listeners": 1 }, "active_jobs": 0, "inactive_jobs": 52, "max_jobid": 891940360861779968 }
But the debugger shows
ctx->alloc.alloc_pending_count = 1
. This value must be zero in order for the queue to be called idle.The text was updated successfully, but these errors were encountered: