Maintaining openshift/kubernetes

OpenShift is based on upstream Kubernetes. With every release of Kubernetes that is intended to be shipped as OCP, it is necessary to incorporate the upstream changes while ensuring that our downstream customizations are maintained.

Rebasing for releases < 4.6

The instructions in this document apply to OpenShift releases 4.6 and above. For previous releases, please see the rebase enhancement.

Maintaining this document

An openshift/kubernetes rebase is a complex process involving many manual and potentially error-prone steps. If, while performing a rebase, you find areas where the documented procedure is unclear or missing detail, please update this document and include the change in the rebase PR. This will ensure that the instructions are as comprehensive and accurate as possible for the person performing the next rebase.

Rebase Checklists

The checklists provided below highlight the key responsibilities of someone performing an openshift/kubernetes rebase.

In preparation for submitting a PR to the openshift fork of kubernetes, the following should be true:

The new rebase branch has been created from the upstream tag
The new rebase branch includes relevant carries from target branch
Dependencies have been updated
Hyperkube dockerfile version has been updated
make update has been invoked and the results committed
make executes without error
make verify executes without error
make test executes without error
The upstream tag is pushed to openshift/kubernetes to ensure that build artifacts are versioned correctly - Upstream tooling uses the value of the most recent tag (e.g. v1.20.0) in the branch history as the version of the binaries it builds. - Pushing the tag is easy as

git push git@github.com:openshift/kubernetes.git refs/tags/v1.20.0

Details to include in the description of the PR:

A link to the rebase spreadsheet for the benefit for reviewers

After the rebase PR has merged to openshift/kubernetes, vendor the changes into openshift/origin to ensure that the openshift-tests binary reflects the upstream test changes introduced by the rebase:

Find the SHA of the merge commit after your PR lands in openshift/kubernetes
Run hack/update-kube-vendor.sh <o/k SHA> in a clone of the origin repo and commit the results
Run make update and commit the results
Submit as a PR to origin

As a final step, send an email to the aos-devel mailing list announcing the rebase. Make sure to include:

The new version of upstream Kubernetes that OpenShift is now based on
Link(s) to upstream changelog(s) detailing what has changed since the last rebase landed
A reminder to component maintainers to bump their dependencies
Relevant details of the challenges involved in landing the rebase that could benefit from a wider audience.

Getting started

Before incorporating upstream changes you may want to:

Read this document
Get familiar with tig (text-mode interface for git)
Find the best tool for resolving merge conflicts
Use diff3 conflict resolution strategy (https://blog.nilbus.com/take-the-pain-out-of-git-conflict-resolution-use-diff3/)
Teach Git to remember how you’ve resolved a conflict so that the next time it can resolve it automatically (https://git-scm.com/book/en/v2/Git-Tools-Rerere)

Send email annoucing you're starting work

To better spread the information send the following email:

Title: k8s <version> bump is starting...

I'm starting the process of updating our fork to bring in
the latest available version of kubernetes. This means that
every PR landing in openshift/kubernetes should go through
extra scrutiny and only 2 exceptions allow merging PRs in the
upcoming time:
1. High priority backports which require landing master first
to start the backport process.
2. Critical PRs unblocking the org.
In both cases make sure to reach out to me for final approval.

There is no ETA yet, but feel free to reach out to me with
any questions.

Preparing the local repo clone

Clone from a personal fork of kubernetes via a pushable (ssh) url:

git clone git@github.com:<user id>/kubernetes

Add a remote for upstream and fetch its branches:

git remote add --fetch upstream https://github.com/kubernetes/kubernetes

Add a remote for the openshift fork and fetch its branches:

git remote add --fetch openshift https://github.com/openshift/kubernetes

Creating a new local branch for the new rebase

Branch the target k8s.io/kubernetes release tag (e.g. v1.20.0) to a new local branch

git checkout -b rebase-1.20.0 v1.20.0

Merge openshift(master) branch into the rebase-1.20.0 branch with merge strategy ours. It discards all changes from the other branch (openshift/master) and create a merge commit. This leaves the content of your branch unchanged, and when you next merge with the other branch, Git will only consider changes made from this point forward. (Do not confuse this with ours conflict resolution strategy for recursive merge strategy, -X option.)

git merge -s ours openshift/master

Creating a spreadsheet of carry commits from the previous release

Given the upstream tag (e.g. v1.19.2) of the most recent rebase and the name of the branch that is targeted for rebase (e.g. openshift/master), generate a tsv file containing the set of carry commits that need to be considered for picking:

echo 'Comment Sha\tAction\tClean\tSummary\tCommit link\tPR link' > ~/Documents/v1.19.2.tsv

git log $( git merge-base openshift/master v1.19.2 )..openshift/master --ancestry-path --reverse --no-merges --pretty='tformat:%x09%h%x09%x09%x09%s%x09https://github.com/openshift/kubernetes/commit/%h?w=1' | grep -E $'\t''UPSTREAM: .*'$'\t' | sed -E 's~UPSTREAM: ([0-9]+)(:.*)~UPSTREAM: \1\2\thttps://github.com/kubernetes/kubernetes/pull/\1~' >> ~/Documents/v1.19.2.tsv

This tsv file can be imported into a google sheets spreadsheet to track the progress of picking commits to the new rebase branch. The spreadsheet can also be a way of communicating with rebase reviewers. For an example of this communication, please see the the spreadsheet used for the 1.19 rebase.

Picking commits from the previous rebase branch to the new branch

Go through the spreadsheet and for every commit set one of the appropriate actions:

p, to pick the commit
s, to squash it (add a comment with the sha of the target)
d, to drop the commit (if it is not obvious, comment why)

Set up conditional formatting in the google sheet to color these lines appropriately.

Commits carried on rebase branches have commit messages prefixed as follows:

UPSTREAM: <carry>:
- A persistent carry that should probably be picked for the subsequent rebase branch.
- In general, these commits are used to modify behavior for consistency or compatibility with openshift.
UPSTREAM: <drop>:
- A carry that should probably not be picked for the subsequent rebase branch.
- In general, these commits are used to maintain the codebase in ways that are branch-specific, like the update of generated files or dependencies.
UPSTREAM: 77870:
- The number identifies a PR in upstream kubernetes (i.e. https://github.com/kubernetes/kubernetes/pull/<pr id>)
- A commit with this message should only be picked into the subsequent rebase branch if the commits of the referenced PR are not included in the upstream branch.
- To check if a given commit is included in the upstream branch, open the referenced upstream PR and check any of its commits for the release tag (e.g. v.1.20.0) targeted by the new rebase branch. For example:

With these guidelines in mind, pick the appropriate commits from the previous rebase branch into the new rebase branch. Create a new filter view in the spreadsheet to allow you get a view where Action==p || Action==s and copy paste the shas to git cherry-pick command. Use tr '\n' ' ' <<< "<line_separated_commits>" to get a space separated list from the copy&paste.

Where it makes sense to do so, squash carried changes that are tightly coupled to simplify future rebases. If the commit message of a carry does not conform to expectations, feel free to revise and note the change in the spreadsheet row for the commit.

If you first pick all the pick+squash commits first and push them for review it is easier for you and your reviewers to check the code changes and you squash it at the end.

When filling in Clean column in the spreadsheet make sure to use the following number to express the complexity of the pick:

0 - clean
1 - format fixups
2 - code fixups
3 - logic changes

Explicit commit rules:

Anything touching openshift-hack/, openshift specific READMEs or similar files should be squashed to 1 commit named "UPSTREAM: : Add OpenShift specific files"
Updating generated files coming from kubernetes should be <drop> commit
Generated changes should never be mixed with non-generated changes. If a carry is ever seen to contain generated changes, those changes should be dropped.

Update the hyperkube image version to the release tag

The hyperkube dockerfile hard-codes the Kubernetes version in an image label. It's necessary to manually set this label to the new release tag. Prefix the commit summary with UPSTREAM: <carry>: (squash) and squash it before merging the rebase PR.

Updating dependencies

Once the commits are all picked from the previous rebase branch, and your PR is mostly ready, each of the following repositories need to be updated to depend on the upstream tag targeted by the rebase:

https://github.com/openshift/api
https://github.com/openshift/apiserver-library-go
https://github.com/openshift/client-go
https://github.com/openshift/library-go

Often these repositories are updated in parallel by other team members, so make sure to ask around before starting the work of bumping their dependencies.

Once the above repos have been updated to depend on the target release, it will be necessary to update go.mod to point to the appropriate revision of these repos by running hack/pin-dependency.sh for each of them and then running hack/update-vendor.sh (as per the upstream documentation).

Make sure to commit the result of a vendoring update with UPSTREAM: <drop>: bump(*). If you have already bumped the dependencies to get the repo to compile, don't forget to squash the commits before merging the PR.

Updating dependencies for pending bumps

The upstream hack/pin-dependency.sh script only supports setting dependency for the original repository. To pin to a fork branch that has not yet been merged (i.e. to test a rebase ahead of shared library bumps having merged), the following go mod invocations are suggested:

go mod edit -replace github.com/openshift/<lib>=github.com/<username>/<lib>@SHA
go mod tidy && go mod vendor

Alternatively, you can edit go.mod file manually with your favourite editor and use search&replace.

Review test annotation rules

The names of upstream e2e tests are annotated according to the a set of declarative rules. These annotations are used to group tests into suites and to skip tests that are known not to be incompatible with some or all configurations of OpenShift.

When performing a rebase, it is important to review the rules to ensure they are still relevant:

Ensure that [Disabled:Alpha] rules are appropriate for the current kube level. Alpha features that are not enabled by default should be targeted by this annotation to ensure that tests of those features are skipped.
Add new skips (along with a bz to track resolution) where e2e tests fail consistently.

Test failures representing major issues affecting cluster capability will generally need to be addressed before merge of the rebase PR, but minor issues (e.g. tests that fail to execute correctly but don't appear to reflect a regression in behavior) can often be skipped and addressed post-merge.

Updating generated files

Update generated files by running make update
- This step depends on etcd being installed in the path, which can be accomplished by running hack/install-etcd.sh.
- Alternatively, run it in the same container as CI is using for build_root that already has the etcd at correct version

podman run -it --rm -v $( pwd ):/go/k8s.io/kubernetes:Z --workdir=/go/k8s.io/kubernetes registry.ci.openshift.org/openshift/release:rhel-8-release-golang-1.16-openshift-4.8 make update OS_RUN_WITHOUT_DOCKER=yes

Commit the resulting changes as UPSTREAM: <drop>: make update.

Building and testing

Build the code with make
Test the code with make test
- Where test failures are encountered and can't be trivially resolved, the spreadsheet can be used to track those failures to their resolution. The example spreadsheet should have a sheet that demonstrates this tracking.
- Where a test failure proves challenging to fix without specialized knowledge, make sure to coordinate with the team(s) responsible for area(s) of focus exhibiting test failure. If in doubt, ask for help!
Verify the code with make verify

Reacting to new commits

Inevitably, a rebase will take long enough that new commits will end up being merged to the targeted openshift/kubernetes branch after the rebase is underway. The following strategy is suggested to minimize the cost of incorporating these new commits:

rename existing rebase branch (e.g. 1.20.0-beta.2 -> 1.20.0-beta.2-old)
create new rebase branch from HEAD of master
merge the target upstream tag (e.g. 1.20.0-beta.2) with strategy ours
pick all carries from renamed rebase branch (e.g. 1.20.0-beta.2-old)
pick new carries from the openshift/kubernetes target branch
add details of the new carries to the spreadsheet
update generated files

With good tooling, the cost of this procedure should be ~10 minutes at most. Re-picking carries should not result in conflicts since the base of the rebase branch will be the same as before. The only potential sources of conflict will be the newly added commits.

Send email announcing upcoming merge

Second email should be send close O(~3 days) to merging the bump:

Title: k8s <version> bump landing...

<URL to your o/k PR> is bumping k8s to version <version>.
The following repositories have been already bumped as well:

<URLs to api/client-go/library-go/apiserver-library-go/operators>

Followup work has been assigned to appropriate teams
through bugzillas linked in the code. Please treat
them as the highest priority after landing the bump.

Finally, this means we are blocking ALL PRs to our
kubernetes fork.

After sending the email block the merge queue, see below.

Blocking the merge queue

Close to merging a rebase it is good practice to block any merges to openshift/kubernetes fork. To do that follow these steps:

Open new issues in openshift/kubernetes
Use Master Branch Frozen For Kubernetes Merging | branch:master as issue title
Add tide/merge-blocker label to issues (you might need group lead for this)
All PR's (including the rebase) are now forbidden to merge to master branch
Before landing the rebase PR, close this issue

Send email announcing work done

Last email should be send after merging the bump as a reply to previous:

<URL to your o/k PR> just merged.
It'll take some time to get newer kublet, but in the meantime we'll
continue to monitor CI. I encourage everyone to hold off from
merging any major changes to our kubernetes fork to provide clear CI
signal for the next 2-3 days.

Followup work

Update cluster-kube-apiserver-operator pre-release-lifecycle alert's removed_release version similarly to openshift/cluster-kube-apiserver-operator#1092.

Updating with `git merge` (experimental)

This is the preferred way to update to patch releases of kubernetes

After the initial bump as described above it is possible to update to newer released version using git merge. To do that follow these steps:

Push appropriate upstream tag you're updating to:

git push git@github.com:openshift/kubernetes.git refs/tags/v1.21.1

Revert any commits that were merged into kubernetes between previous update and current one.
git merge <new_version>, example git merge v1.21.1. Most likely you'll encounter conflicts, but most are around go.sum and go.mod, coming from newer versions. Manually verify them, and in most cases pick the newer versions of deps. This will be properly update when re-runing hack/update-vendor.sh in the next step.
Update openshift dependencies and re-run hack/update-vendor.sh.
Run make update see Updating generated files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REBASE.openshift.md

REBASE.openshift.md

Maintaining openshift/kubernetes

Rebasing for releases < 4.6

Maintaining this document

Rebase Checklists

Getting started

Send email annoucing you're starting work

Preparing the local repo clone

Creating a new local branch for the new rebase

Creating a spreadsheet of carry commits from the previous release

Picking commits from the previous rebase branch to the new branch

Update the hyperkube image version to the release tag

Updating dependencies

Updating dependencies for pending bumps

Review test annotation rules

Updating generated files

Building and testing

Reacting to new commits

Send email announcing upcoming merge

Blocking the merge queue

Send email announcing work done

Followup work

Updating with `git merge` (experimental)

Files

REBASE.openshift.md

Latest commit

History

REBASE.openshift.md

File metadata and controls

Maintaining openshift/kubernetes

Rebasing for releases < 4.6

Maintaining this document

Rebase Checklists

Getting started

Send email annoucing you're starting work

Preparing the local repo clone

Creating a new local branch for the new rebase

Creating a spreadsheet of carry commits from the previous release

Picking commits from the previous rebase branch to the new branch

Update the hyperkube image version to the release tag

Updating dependencies

Updating dependencies for pending bumps

Review test annotation rules

Updating generated files

Building and testing

Reacting to new commits

Send email announcing upcoming merge

Blocking the merge queue

Send email announcing work done

Followup work

Updating with git merge (experimental)

Updating with `git merge` (experimental)