Initial multi-stream enhancement #1692

sdodson · 2024-10-04T15:00:01Z

Copied from https://hackmd.io/q1Txm0twTYatSvSf1vNFeA?view where there's some comments which warrant understanding.

cgwalters

Looks sane to me overall

cgwalters · 2024-10-08T17:57:40Z

enhancements/rhcos/multi-stream.md

+At install time the **cluster creator** will either specify the desired os
+for `ControlPlane` and `Compute` or not, if they provide no value the installer
+is to render the current default stream into relevant resources.


Is there strong value in exposing this via install config versus just supporting it as "day 0" machineconfig?

I guess it doesn't have to be, for all platforms we already allow overriding the boot image and you could patch MCP manifests as well.

cgwalters · 2024-10-08T17:59:49Z

enhancements/rhcos/multi-stream.md

+Whenever compute resources aren't elastic should we support a special mode where the host
+OS is reinstalled across versions specifically NOT preserving any data / config?
+
+#### Single-node Deployments or MicroShift


Today MicroShift is pretty different in that the user chooses the base OS version already.

cgwalters · 2024-10-08T18:01:40Z

enhancements/rhcos/multi-stream.md

+When the installer is built for OCP valid `osStream` must start with "rhcos"
+and match the name of a file in data/data/coreos/
+
+Somewhere in MCO's templates/ add streams/{rhcos-9,rhcos-10} anything outside


What I suspect we may need to add to the MCO is a conditional like this for user provided machineconfig.

If we hold that stream is configured at MCP level and that value is effectively immutable, wouldn't they just provide MachineConfig that matches the labels for each pool?

Need to vet the viability of stream being immutable with how we would potentially handle in-place major OS upgrades. My thinking for now is that entails moving a node from one pool to another rather than reconfiguring a pool for a different stream.

cgwalters · 2024-10-08T18:03:17Z

enhancements/rhcos/multi-stream.md

+recorded in /etc/os-release or other available facilities. There's probably
+some systemd magic here or something. This probably also pushes more static


Yes, we I think we need to make it easy dispatch on the OS major/minor; this gets into a lot of details around our use of VERSION_ID and how that looks...and the fact that today our versions include the OCP version and the OS version...

cgwalters · 2024-10-08T18:06:31Z

There's intersections with OLM here right? Does it already exist as a way for operators to declare their compatible OS versions?

openshift-ci · 2024-10-09T03:37:45Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from sdodson. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sdodson · 2024-10-09T03:53:45Z

There's intersections with OLM here right? Does it already exist as a way for operators to declare their compatible OS versions?

@cgwalters dd32891 adds my opinion but I've put it as an open question because I think it warrants additional discussion.

I had previously asked if we had a way to determine which operators requested RBAC necessary to run privileged containers and we may or may not be able to assess that. I'm hoping to hear back on that soon so we can perform some more targeted analysis. However I assume that those who truly become part of the OS will have motivation not to ignore RHEL10, such as GPU management operators who embed drivers. They may skimp on the openshift operator side but it seems like they won't forego RHEL10 drivers all together and hopefully the additional operator work is minimal.

cgwalters · 2024-10-10T18:28:02Z

enhancements/rhcos/multi-stream.md

+* As an OpenShift admin adding newer hardware to an existing cluster I want the
+new hardware to boot, run, and update from a specific OS stream.
+
+* As an OpenShift admin wishing to migrate existing hosts to a newer stream I


We debated this live, but I want to write things down here for consistency. I have no opposition to making it obvious/easy for admins to use CAPI to spawn separate RHEL-$next hosts for testing, etc.

But I am pretty confident in stating that we can support seamless in-place upgrades from 9 to 10 for the majority of e.g. cloud-deployed clusters. Take our own Prow clusters for example. I bet we can just flip the flag to run those on rhel10 and watch it roll out in place by default and it would Just Work.

Initial multi-stream enhancement

1220aaa

Copied from https://hackmd.io/q1Txm0twTYatSvSf1vNFeA?view where there's some comments which warrant understanding.

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 4, 2024

cgwalters reviewed Oct 8, 2024

View reviewed changes

sdodson added 2 commits October 8, 2024 23:35

SNO and Microshift

207df98

Clarify a few places

6be2776

Add open question about OLM catalog content

dd32891

cgwalters reviewed Oct 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial multi-stream enhancement #1692

Initial multi-stream enhancement #1692

sdodson commented Oct 4, 2024

cgwalters left a comment

cgwalters Oct 8, 2024

sdodson Oct 9, 2024

cgwalters Oct 8, 2024

cgwalters Oct 8, 2024

sdodson Oct 9, 2024

cgwalters Oct 8, 2024

cgwalters commented Oct 8, 2024 •

edited

Loading

openshift-ci bot commented Oct 9, 2024

sdodson commented Oct 9, 2024 •

edited

Loading

cgwalters Oct 10, 2024

		recorded in /etc/os-release or other available facilities. There's probably
		some systemd magic here or something. This probably also pushes more static

Initial multi-stream enhancement #1692

Are you sure you want to change the base?

Initial multi-stream enhancement #1692

Conversation

sdodson commented Oct 4, 2024

cgwalters left a comment

Choose a reason for hiding this comment

cgwalters Oct 8, 2024

Choose a reason for hiding this comment

sdodson Oct 9, 2024

Choose a reason for hiding this comment

cgwalters Oct 8, 2024

Choose a reason for hiding this comment

cgwalters Oct 8, 2024

Choose a reason for hiding this comment

sdodson Oct 9, 2024

Choose a reason for hiding this comment

cgwalters Oct 8, 2024

Choose a reason for hiding this comment

cgwalters commented Oct 8, 2024 • edited Loading

openshift-ci bot commented Oct 9, 2024

sdodson commented Oct 9, 2024 • edited Loading

cgwalters Oct 10, 2024

Choose a reason for hiding this comment

cgwalters commented Oct 8, 2024 •

edited

Loading

sdodson commented Oct 9, 2024 •

edited

Loading