feat: increase Workload PodSet limit to 10#11388
Conversation
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: yuluo-yx The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
Hi @yuluo-yx. Thanks for your PR. I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with Tip We noticed you've done this a few times! Consider joining the org to skip this step and gain Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
✅ Deploy Preview for kubernetes-sigs-kueue ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
|
/hold I'd like to figure out if increasing the number of PodSets degrades TAS performance. |
Agree. 👀 |
|
/ok-to-test |
The performance of TAS (or general scheduler) does not degrade unless a user requests to have more PodSets. Also, users can scale up Kueue (add more CPU/mem to the kueue deployment), but they cannot workaround validation except for forking Kueue. Maybe to mitigate the issue we can start with some smaller number like 10 or 12, at the same time to mitigate the additional "technical" PodSets for Ray: #11260 - we want to introduce there a new PodSet for RedisCleanup job and that would mean one less PodSet for "Workers". wdyt? |
The computation costs are N × M in flavor assignment at worst, where N is the number of flavors and M is the number of PodSets. And in the TAS calculation pays more computation costs because it considers the tree topology structure. But, I am assuming that 10 has less impact. |
|
@yuluo-yx could you adjust the PR to 10? |
got it, I'll update tonight. |
|
/retitle feat: increase Workload PodSet limit to 10 |
Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>
|
/unhold |
|
@yuluo-yx: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
What type of PR is this?
/kind feature
Which issue(s) this PR fixes:
Fixes #11379
Does this PR introduce a user-facing change?