feat: add new kep to provider cpuidle capability for besteffort QoS pod#5904
feat: add new kep to provider cpuidle capability for besteffort QoS pod#5904hahahaheihei wants to merge 2 commits intokubernetes:masterfrom
Conversation
|
Welcome @hahahaheihei! |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: hahahaheihei The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
Hi @hahahaheihei. Thanks for your PR. I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
@ffromani CC |
|
/ok-to-test |
| replaces: [] | ||
|
|
||
| # The target maturity stage in the current dev cycle for this KEP. | ||
| stage: stable |
There was a problem hiding this comment.
should be "beta", this tracks the current feature maturity, which starts as beta
| The feature require cgroupV2 & linux kernel 5.4 or later(cpu.idle enable) | ||
| Users can configure the use of the CPU.idle function according to their needs or continue using cpu.shares/cpu.weight. | ||
|
|
||
| The feature prototype code is ready. [code](https://github.com/kubernetes/kubernetes/pull/136458) |
There was a problem hiding this comment.
the KEP represent the design of the feature, so a PoC can help to better illustrate the design. However, the code is transitory (evolves and changes), while the design should be stable and fixed in time.
I don't think the PR helps too much here.
| Safe Colocation: It allows safe colocation of BestEffort batch jobs with latency-sensitive services without the risk of performance degradation for the latter | ||
|
|
||
|
|
||
| ### Non-Goals |
There was a problem hiding this comment.
I guess non-goals are
- change the cgroups weighting except the minimal changes required for this work
- review QOS class handling in general
|
|
||
| the new design , besteffort levels pod cpu.idle values set to 1 | ||
|
|
||
|  |
There was a problem hiding this comment.
how do we plan to implement the enable/disable toggle hinted in the "Proposal" above?
| cat besteffort qos level cpu.idle value | ||
| 0 mean the feature is disabled | ||
| 1 mean the feature is enabled |
There was a problem hiding this comment.
we need automated tests here
|
/retest |
|
Thank you so much for contributing to Kubernetes, I'm gathering information to be able to help you better.. |
|
Hi @hahahaheihei, after reviewing the test failures (pull-enhancements-test and verify), here's what's needed for them to pass: Fix the kep.yaml:Change stage: stable to stage: beta (as suggested by @ffromani). Make sure the latest-milestone field matches the current Kubernetes version. Complete the PRR questionnaire:In the README.md file, find the Production Readiness Review Questionnaire section. You must answer all the questions (Enablement, Rollback, Monitoring, etc.). The tests are failing because the validator sees that these sections are empty or contain the default answers. Update the Table of Contents:Run Cleanup:Remove the link to the prototype code from README.md, as the KEP should focus on the design, not a temporary implementation. Thank you again for your help, have a great day. |
|
/assign |
|
/cc |
|
@AutuSnow: GitHub didn't allow me to request PR reviews from the following users: AutuSnow. Note that only kubernetes members and repo collaborators can review this PR, and authors cannot review their own PRs. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
@hahahaheihei: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
@hahahaheihei Is there any update ? |
The design remains unchanged, with only the document format and obvious errors modified. |
Hi @hahahaheihei @ffromani , Since this KEP missed the v1.36 release window, I would love to see how we can keep the momentum going. Is it possible to target the upcoming v1.37 release? |
Yes, totally (note is too early to set the labels and we'd need a sig-node lead anyway) |
add new kep about enable cpu.idle for besteffort QoS pod