KEP-2594 implementable for alpha #2946

rahulkjoshi · 2021-09-06T16:22:55Z

One-line PR description: PR Review for KEP 2594 and marking implementable for alpha

Issue link: Enhancing NodeIPAM to support multiple ClusterCIDRs #2593

k8s-ci-robot · 2021-09-06T16:23:03Z

Hi @rahulkjoshi. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

rahulkjoshi · 2021-09-06T16:23:59Z

/assign @wojtek-t

thockin · 2021-09-06T19:31:56Z

Thanks!

/lgtm
/approve

wojtek-t

@rahulkjoshi - the PRR for Alpha looks reasonable - I just added couple smaller comment. But I also added couple comments for the proposal itself. PTAL

wojtek-t · 2021-09-07T07:53:01Z

keps/sig-network/2594-multiple-cluster-cidrs/README.md

-   [ ] (R) Production readiness review approved
+-   [X] (R) Graduation criteria is in place
+-   [X] (R) Production readiness review completed
+-   [X] (R) Production readiness review approved


Sorry - I can't comment on unchanged lines, so commenting here:

L226: Will this be a built-in API or a CRD?

L241: I suggest using NodeSelector instead of LabelSelector:
https://github.com/kubernetes/kubernetes/blob/master/staging/src/k8s.io/api/core/v1/types.go#L2643

L299: I think that we need to mention how we handle cases where there aren't any free IPs anymore.
So basically the algorithm should be sth like:
When trying to allocate a pod-cidr for a node:
(1) discard all ClusterCIDRConfigs where there aren't any free IPs or those that doesn't match the selector.
(2) From the remaining prefer ....
<--- update ---> I see that being discussed below - would be more intuitive to merge it to sth like I mentioned above.

L364: It might be a mental shortcut, but we should be clear - periodic polling doesn't scale well - the controller should be watching (and potentially polling from the local cache to check, though even that isn't necessary needed with our informer machinery).

L452: I think I'm not fully following - are you saying that: whether we apply IPv4, IPv6 or both is decided purely on their availability in the ClusterCIDRConfig object?
[If not can you clarify? If so, can we also make it more explicit?]

L500: if the IPs from the previously existing created-from-flags-\<hash\> are in-use, then finalizer will block its deletion. Does that block initialization of the new ipam controller? That sounds like a problem to me...

L515: This shouldn't be different from a regular operation - on failed operation we should simply be re-queuing (with backoff) the node to retry [we should need any additional logic.] (This pattern is used in couple other controllers).

L732: I would say that we should have a feature gate too. So basically:

a feature-gate that is deciding if the new controller can even be started

and on top of that a flag that decides which controller to use

L758: Not really restart nodes - but rather recreate them.

L773: nit: "No - the (...) tests will be added before graduating the feature to Alpha."
[The tests doesn't yet exist, right?]

I've made changes to reflect the comments above. I hope I clarified everything.

For L500: The bit that's probably getting lost is that the controller chooses a random hash each time it starts up. So if it needs to create a new object and delete an old one, it need not block. However, users will still have to recreate their Nodes -- there's no way around that.

Thank you - that's perfect!

wojtek-t · 2021-09-07T07:53:56Z

/ok-to-test

wojtek-t · 2021-09-08T08:12:29Z

/lgtm
/approve

k8s-ci-robot · 2021-09-08T08:12:43Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rahulkjoshi, thockin, wojtek-t

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [wojtek-t]
~~keps/sig-network/OWNERS~~ [thockin]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Sep 6, 2021

k8s-ci-robot requested review from caseydavenport and thockin September 6, 2021 16:23

k8s-ci-robot assigned wojtek-t Sep 6, 2021

k8s-ci-robot assigned thockin Sep 6, 2021

k8s-ci-robot added lgtm "Looks good to me", indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Sep 6, 2021

rahulkjoshi force-pushed the cluster-cidr branch from 174c029 to 5fd48a4 Compare September 7, 2021 04:33

k8s-ci-robot removed lgtm "Looks good to me", indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Sep 7, 2021

wojtek-t reviewed Sep 7, 2021

View reviewed changes

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Sep 7, 2021

Mark KEP as implementable following PRR review.

e159865

rahulkjoshi force-pushed the cluster-cidr branch from 5fd48a4 to e159865 Compare September 7, 2021 22:22

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Sep 7, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 8, 2021

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 8, 2021

k8s-ci-robot merged commit ec8483a into kubernetes:master Sep 8, 2021

k8s-ci-robot added this to the v1.23 milestone Sep 8, 2021

rahulkjoshi deleted the cluster-cidr branch September 8, 2021 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-2594 implementable for alpha #2946

KEP-2594 implementable for alpha #2946

rahulkjoshi commented Sep 6, 2021 •

edited

Loading

k8s-ci-robot commented Sep 6, 2021

rahulkjoshi commented Sep 6, 2021

thockin commented Sep 6, 2021

wojtek-t left a comment

wojtek-t Sep 7, 2021

rahulkjoshi Sep 7, 2021

wojtek-t Sep 8, 2021

wojtek-t commented Sep 7, 2021

wojtek-t commented Sep 8, 2021

k8s-ci-robot commented Sep 8, 2021

KEP-2594 implementable for alpha #2946

KEP-2594 implementable for alpha #2946

Conversation

rahulkjoshi commented Sep 6, 2021 • edited Loading

k8s-ci-robot commented Sep 6, 2021

rahulkjoshi commented Sep 6, 2021

thockin commented Sep 6, 2021

wojtek-t left a comment

Choose a reason for hiding this comment

wojtek-t Sep 7, 2021

Choose a reason for hiding this comment

rahulkjoshi Sep 7, 2021

Choose a reason for hiding this comment

wojtek-t Sep 8, 2021

Choose a reason for hiding this comment

wojtek-t commented Sep 7, 2021

wojtek-t commented Sep 8, 2021

k8s-ci-robot commented Sep 8, 2021

rahulkjoshi commented Sep 6, 2021 •

edited

Loading