KEP-9270: Introduce configuration for the MultiKueue Incremental Dispatcher#10877
KEP-9270: Introduce configuration for the MultiKueue Incremental Dispatcher#10877Mostafahassen1 wants to merge 19 commits into
Conversation
✅ Deploy Preview for kubernetes-sigs-kueue canceled.
|
olekzabl
left a comment
There was a problem hiding this comment.
Hi, thank you for your PR!
This is my first round of comments; I plan to come back soon.
Co-authored-by: Olek Zabłocki <olekz@google.com>
Co-authored-by: Olek Zabłocki <olekz@google.com>
Co-authored-by: Olek Zabłocki <olekz@google.com>
@mimowo You know the customs better; what would you say? (P.S. I think a separate KEP makes much sense if we want to have a new dedicated feature gate - but do we? see below) |
No problem at all! I’m completely open to following the Kueue standards here. If you think it’s better to update KEP-693 instead of having a new one |
I'm sorry, I'm puzzled by this question (i.e. "update old or write new or both"). For me, right now, this feels a delicate judgement call... so for now I'd keep it as is and wait to see what the other reviewers will say. |
olekzabl
left a comment
There was a problem hiding this comment.
LGTM; I just have one more round of tiny comments.
Now I'm feeling we're just blocked by this thread.
|
/release-note-none |
|
/retitle KEP-9270: Introduce configuration for the MultiKueue Incremental Dispatcher |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: Mostafahassen1, olekzabl The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
/lgtm |
|
LGTM label has been added. DetailsGit tree hash: 4d86aed56a9e407dda87bb843a6dcc364bf223f4 |
What type of PR is this?
/kind kep
/area multikueue
What this PR does / why we need it:
This PR proposes a KEP to introduce the
stepSizeconfiguration parameter for the MultiKueue Incremental Dispatcher.Currently, the Incremental Dispatcher can either query worker clusters one-by-one (which is slow) or all-at-once (which creates a heavy API load and risks race conditions). This KEP outlines a design to allow administrators to batch these queries by specifying a
stepSize, finding the optimal balance between dispatch speed and system stability.Which issue(s) this PR fixes:
Fixes #9270
Special notes for your reviewer:
This PR contains the initial KEP document (
README.mdandkep.yaml) for the feature proposed in #9270.Once this KEP is reviewed and approved (status: implementable), I will open follow-up PRs containing the API additions to
MultiKueueConfigand the core logic updates to the Workload Controller.Does this PR introduce a user-facing change?