Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PodGroup update phase Error] PodGroup cannot Update phase from Scheduling to Scheduled when gangMember larger than two #2030

Closed
PeterChg opened this issue May 7, 2024 · 4 comments
Assignees
Labels
area/koord-scheduler kind/bug Create a report to help us improve
Milestone

Comments

@PeterChg
Copy link
Contributor

PeterChg commented May 7, 2024

What happened:
PodGroup cannot Update phase from Scheduling to Scheduled when gangMembers larger than two。
Regardless of whether all gangMembers have been successfully scheduled and running

....
status:
phase: Scheduling
running: 4
scheduleStartTime: "2024-05-06T13:59:43Z"
scheduled: 3

What you expected to happen:
when all gangMembers have been successfully scheduled and running,PodGroup should Update phase from Scheduling to Scheduled

How to reproduce it (as minimally and precisely as possible):
Enable gang scheduling, create job with more than one two pods. The more pods, the greater the probability of occurrence

Anything else we need to know?:
Occurrence version: 1.3.0

Environment:

  • App version: 1.3.0
  • Kubernetes version (use kubectl version): 1.22.3
  • Install details (e.g. helm install args):
  • Node environment (for koordlet/runtime-proxy issue):
    • Containerd/Docker version:
    • OS version:
    • Kernal version:
    • Cgroup driver: cgroupfs/systemd
  • Others:
@PeterChg PeterChg added the kind/bug Create a report to help us improve label May 7, 2024
@PeterChg PeterChg changed the title [PodGroup update phase Error] PodGroup cannot Update phase from Scheduling to Scheduled when gangMember larger than one [PodGroup update phase Error] PodGroup cannot Update phase from Scheduling to Scheduled when gangMember larger than two May 7, 2024
@saintube
Copy link
Member

saintube commented May 7, 2024

PTAL /cc @ZiMengSheng @xulinfei1996

@ZiMengSheng
Copy link
Contributor

/assign xulinfei1996

@ZiMengSheng ZiMengSheng added this to the v1.5 milestone May 7, 2024
@jasonliu747
Copy link
Member

/close
fix by #2032

@koordinator-bot
Copy link

@jasonliu747: Closing this issue.

In response to this:

/close
fix by #2032

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/koord-scheduler kind/bug Create a report to help us improve
Projects
None yet
Development

No branches or pull requests

5 participants