Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

volcano controller repeatedly create pod after job had deleted #3357

Open
Wang-Kai opened this issue Mar 18, 2024 · 0 comments
Open

volcano controller repeatedly create pod after job had deleted #3357

Wang-Kai opened this issue Mar 18, 2024 · 0 comments
Labels
kind/bug Categorizes issue or PR as related to a bug.

Comments

@Wang-Kai
Copy link

Wang-Kai commented Mar 18, 2024

What happened:

I created a job which has just one task and one pod. After the job, podgroup and pod are running phase, i update pod and then delete job almost at the same time. The operation may cause the job to be deleted at etcd, but it is still present at volcano controller cache. So, the K8s garbage-collector deleted the pod due to its owner having deleted it, and the volcano controller created the pod due to job alway in cache.

As the user can see, pods are repeatedly created and deleted.

What you expected to happen:

volcano controller removes data from cache completely after job has deleted, and doesn't need to re-add to cache while updating pod.

How to reproduce it (as minimally and precisely as possible):

Just update pod and delete job as close to the time as possible.

Anything else we need to know?:

controller parameters:

    - --logtostderr=false
    - --leader-elect=true
    - --lock-object-namespace=volcano-system
    - --enable-healthz=true
    - -v=3
    - --log-dir=/home/
    - --log-file-max-size=0
    - --kube-api-qps=2000
    - --kube-api-burst=3000
    - --worker-threads=300
    - 2>&1

Environment:

  • Volcano Version: v1.8.2
  • Kubernetes version (use kubectl version): v1.20
  • Cloud provider or hardware configuration: 128c 256G
  • OS (e.g. from /etc/os-release):Debian GNU/Linux 9 (stretch)
  • Kernel (e.g. uname -a):Linux 5.10.0-103-bili-colo
  • Install tools: kubectl
@Wang-Kai Wang-Kai added the kind/bug Categorizes issue or PR as related to a bug. label Mar 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug.
Projects
None yet
Development

No branches or pull requests

1 participant