Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scheduler: Observe/Improve event handling throughput #124566

Open
3 tasks
Tracked by #122597
sanposhiho opened this issue Apr 26, 2024 · 5 comments
Open
3 tasks
Tracked by #122597

Scheduler: Observe/Improve event handling throughput #124566

sanposhiho opened this issue Apr 26, 2024 · 5 comments
Assignees
Labels
kind/feature Categorizes issue or PR as related to a new feature. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling.

Comments

@sanposhiho
Copy link
Member

/kind feature
/assign
/sig scheduling


We don't have any observability around event handling (= requeueing) throughput. Adding observability can be a monitor for a degradation around it.
Also, based on the metric we add, we can find how fast event handling should be on a certain large scale, like we have 300 pods/s target in scheduling throughput. And then we may or may not need to improve the event handling throughput accordingly.

  • 1. Add an observability for event handling throughput.
  • 2. Benchmark and decide the ideal throughput.
  • 3. Improve the throughput based on (2). (possible actions: simplify slower QHint(s), etc)

Reference

@k8s-ci-robot k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. labels Apr 26, 2024
@k8s-ci-robot
Copy link
Contributor

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@k8s-ci-robot k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Apr 26, 2024
@sanposhiho
Copy link
Member Author

Registered as Beta requirement: QueueingHint.

@utam0k
Copy link
Member

utam0k commented Apr 28, 2024

I'd like to take part in this issue if possible. Is there any good issue for newer ones?

@sanposhiho
Copy link
Member Author

@utam0k

Let me dig in first.
Later maybe I'll be able to split it into some tasks that can be done in parallel; then, I'll let you know here.

@utam0k
Copy link
Member

utam0k commented Apr 29, 2024

Sure!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling.
Projects
None yet
Development

No branches or pull requests

3 participants