New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase backoff limit in K8s jobs #5032
Comments
I've been trying to configure the backoff in the k8s executor template, but every time I add the label it is ignored. I have tried it as you can see in the screenshot, but also inside pod and container fields. I also have tried backoff restart policies, but with all of them I have the same result reference: https://kubernetes.io/docs/concepts/workloads/controllers/job/ |
Hi,@wangxiaoyou1993 |
@dcahillm1 @edulodgify k8s_executor_config: ... job_config: active_deadline_seconds: 120 backoff_limit: 3 ttl_seconds_after_finished: 86400 https://docs.mage.ai/production/configuring-production-settings/compute-resource#kubernetes-executor |
Hi @artche we have been trying to implement the fix but it seems that is not working, at least for us. I don't know if could be due to we are not using this kind of configuration
we are using k8s configuration template like
We have tried to use the job_config inside the container field, inside the pod field and outside of all the fields as it seems to be in what you shared. |
Is your feature request related to a problem? Please describe.
Some of our pipelines have been failing due to: " Job has reached the specified backoff limit"
Describe the solution you'd like
Is it possible to increase the backoff limit?
The text was updated successfully, but these errors were encountered: