Add ManagedClusterVersion CRD #1578

2uasimojo · 2024-02-27T23:54:21Z

Propose an enhancement to introduce a new CRD, ManagedClusterVersion. This is a namespaced object to be used by fleet management software to provide a common view into managed clusters' version/upgrade information.

HIVE-2366
HIVE-2428

openshift-ci · 2024-02-27T23:54:32Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

openshift-ci · 2024-02-27T23:56:19Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from 2uasimojo. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

dhellmann · 2024-02-28T15:12:39Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+
+If SNO/MicroShift clusters are part of a fleet, their fleet manager may
+broker their ClusterVersion objects in the manner described [above](#workflow-description).
+In this scenario they are the same as any other OpenShift spoke.


The intent is to use ACM to manage some aspects of MicroShift deployments. I don't know if hive is involved in the integration between ACM and MicroShift.

MicroShift does not have a ClusterVersion API because upgrades are not driven by the CVO. MicroShift uses a ConfigMap to report its version data.

If hive is part of the integration of ACM and MicroShift, will hive have a separate implementation of where to get the version details for MicroShift?

If hive is not present, would something else need to create the ManagedClusterVersion CR in the ACM hub cluster? What will do that?

MicroShift does not have a ClusterVersion API

I didn't realize.

MicroShift uses a ConfigMap to report its version data.

Does that ConfigMap have the same scope of information as ClusterVersion? Not that we need to get into it here, but if so... why wouldn't we be using CVO?

If hive is part of the integration of ACM and MicroShift

It's not. Uh, unless Assisted supports MicroShift? Does it?

If hive is not present, would something else need to create the ManagedClusterVersion CR in the ACM hub cluster? What will do that?

Yes, exactly the point of making this CRD common rather than scoped to hive. In the case of hypershift, the idea is for hypershift to do it. If there are other fleet manager thingies in the world, they would (or could) do the same.

In the ACM scenario, both hive and hypershift would be present, each managing their own subset of clusters, each generating ManagedClusterVersion CRs for their subset, resulting in (identically-schemaed) objects for every spoke the ACM instance manages. That's the dream :)

I confirmed that Assisted doesn't do MicroShift, so I think we're in the clear here in terms of hive (not) having to understand the ConfigMap thing.

That doesn't mean ACM doesn't/won't support MicroShift, but since upgrades there are such a different beast, I don't imagine they'll be using this mechanism at all. I'll update accordingly.

LalatenduMohanty · 2024-02-29T04:43:26Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+enhancement, the \*CM layer is responsible for driving upgrades. To do so,
+it needs visibility into the spoke cluster's ClusterVersion data. Today the
+only mechanisms available for accessing this information entail logging into,
+or running an agent in, the spoke cluster. This is not ideal:


What is the example of the agent? Klusterlet (https://operatorhub.io/operator/klusterlet) ?

I think so, yes.

Except does klusterlet have a way to initiate communication with the hub, or does it only pull from the hub?

I get confused with the different OCMs -- is this the one ACM uses? Does Hypershift use this one as well?

In any case, is it worth mentioning/discussing in the document?

LalatenduMohanty · 2024-02-29T04:46:30Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+ want a common way to view version and upgrade information, regardless of the
+ software layer between me and the spokes, so that I can simplify my code,
+ reduce my test surface, and spend less on maintenance.
+


As SRE we want to get the recommended version information from the cluster-version-operator because it has the capability to evaluate conditional update risks and come up with recommended updates

We might need to expand this little more. let me know if you need more context on this.

I'll add your suggestion above. What else are you thinking?

Propose an enhancement to introduce a new CRD, ManagedClusterVersion. This is a *namespaced* object to be used by fleet management software to provide a common view into managed clusters' version/upgrade information. HIVE-2366 HIVE-2428

deads2k · 2024-03-18T13:09:31Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+
+# ManagedClusterVersion CRD
+
+## Summary


This summary looks similar to https:/kubernetes/enhancements/tree/master/keps/sig-multicluster/4322-cluster-inventory . Determining if usage is appropriate and how we would build extensions could assist both efforts.

Okay, I've read that KEP. It seems intentionally vague and non-prescriptive, and also not far enough along to obviate the need for us to invent pieces that are out of its scope. As currently proposed, the ManagedClusterVersion CRD is not intended to replace or wrap the ClusterDeployment/HostedCluster, nor to satisfy most of the use cases described (or hinted at) in the KEP. IMHO attempting to design in anticipation of that goal would a) be impossible; and b) inflate the effort and extend the timeline untenably.

I can see this EP incorporating including a spec.clusterManager.name field and matching x-k8s.io/cluster-manager label on the proposed CRD, if you think that's a good idea.

Re generated names: I can see value in prefixing the name of the ManagedClusterVersion CRD with the name of its manager (hive-$cdname/hypershift-$hcname) to preclude conflicts in cases where a single hub is managing spokes under different managers. However, I don't see value in adding a unique slug. In fact, I see it being beneficial not to do that, as I can map deterministically between the two CRDs without needing to rely on further labels/fields. Thoughts?

deads2k · 2024-03-18T13:12:57Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+ want a common way to view version and upgrade information, regardless of the
+ software layer between me and the spokes, so that I can simplify my code,
+ reduce my test surface, and spend less on maintenance.
+* As a Site Reliability Engineer (SRE) I want to get the recommended version


Does it make more sense to develop a tool for SRE that sits outside of a cluster and scans clusters instead of running the agent in every cluster?

That's actually exactly what this proposal is all about. Hive and hypershift are exactly such tools today: they sit on a hub cluster and collect data from the spokes*. This proposal is about adding ClusterVersion data to what is collected, and doing it in a CRD that both hive and hypershift (and others) can share.

*Though TBH I don't know whether hypershift does it via an in-cluster agent that reports back to the hub. Hive for sure does not -- the controller on the hub polls spoke clusters via clients constructed from admin kubeconfigs.

openshift-bot · 2024-04-16T01:15:56Z

Inactive enhancement proposals go stale after 28d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle stale.
Stale proposals rot after an additional 7d of inactivity and eventually close.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2024-04-23T08:45:10Z

Stale enhancement proposals rot after 7d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Rotten proposals close after an additional 7d of inactivity.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

2uasimojo

/remove-lifecycle rotten

2uasimojo · 2024-04-23T19:52:39Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+
+# ManagedClusterVersion CRD
+
+## Summary


Okay, I've read that KEP. It seems intentionally vague and non-prescriptive, and also not far enough along to obviate the need for us to invent pieces that are out of its scope. As currently proposed, the ManagedClusterVersion CRD is not intended to replace or wrap the ClusterDeployment/HostedCluster, nor to satisfy most of the use cases described (or hinted at) in the KEP. IMHO attempting to design in anticipation of that goal would a) be impossible; and b) inflate the effort and extend the timeline untenably.

I can see this EP incorporating including a spec.clusterManager.name field and matching x-k8s.io/cluster-manager label on the proposed CRD, if you think that's a good idea.

Re generated names: I can see value in prefixing the name of the ManagedClusterVersion CRD with the name of its manager (hive-$cdname/hypershift-$hcname) to preclude conflicts in cases where a single hub is managing spokes under different managers. However, I don't see value in adding a unique slug. In fact, I see it being beneficial not to do that, as I can map deterministically between the two CRDs without needing to rely on further labels/fields. Thoughts?

2uasimojo · 2024-04-23T20:00:29Z

enhancements/api-review/new-CRD-ManagedClusterVersion.md

+ want a common way to view version and upgrade information, regardless of the
+ software layer between me and the spokes, so that I can simplify my code,
+ reduce my test surface, and spend less on maintenance.
+* As a Site Reliability Engineer (SRE) I want to get the recommended version


That's actually exactly what this proposal is all about. Hive and hypershift are exactly such tools today: they sit on a hub cluster and collect data from the spokes*. This proposal is about adding ClusterVersion data to what is collected, and doing it in a CRD that both hive and hypershift (and others) can share.

*Though TBH I don't know whether hypershift does it via an in-cluster agent that reports back to the hub. Hive for sure does not -- the controller on the hub polls spoke clusters via clients constructed from admin kubeconfigs.

openshift-bot · 2024-05-22T01:15:17Z

Inactive enhancement proposals go stale after 28d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle stale.
Stale proposals rot after an additional 7d of inactivity and eventually close.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2024-05-29T08:45:59Z

Stale enhancement proposals rot after 7d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Rotten proposals close after an additional 7d of inactivity.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

openshift-bot · 2024-06-06T00:15:21Z

Rotten enhancement proposals close after 7d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Reopen the proposal by commenting /reopen.
Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Exclude this proposal from closing again by commenting /lifecycle frozen.

/close

openshift-ci · 2024-06-06T00:15:33Z

@openshift-bot: Closed this PR.

In response to this:

Rotten enhancement proposals close after 7d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Reopen the proposal by commenting /reopen.
Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Exclude this proposal from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

2uasimojo · 2024-07-24T18:33:32Z

/cc @derekwaynecarr @csrwng

2uasimojo · 2024-07-24T18:48:40Z

/cc @jnpacker @vkareh @JoelSpeed @berenss

2uasimojo · 2024-07-24T18:51:05Z

/cc @jupierce

2uasimojo · 2024-07-24T19:35:56Z

nts: Address how the CRD is lifecycled on a given hub. Maybe each controller ensures it is at least the max version it can handle: upgrade if lower, no-op if it is already greater or equal.

LalatenduMohanty · 2024-08-27T17:06:54Z

/remove-lifecycle rotten

LalatenduMohanty · 2024-08-27T17:07:13Z

/lifecycle frozen

openshift-ci · 2024-08-27T17:07:27Z

@LalatenduMohanty: The lifecycle/frozen label cannot be applied to Pull Requests.

In response to this:

/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

2uasimojo · 2024-08-27T17:07:54Z

/reopen
/remove-lifecycle rotten
/lifecycle frozen

openshift-ci · 2024-08-27T17:08:59Z

@2uasimojo: Reopened this PR.

In response to this:

/reopen
/remove-lifecycle rotten
/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-ci · 2024-08-27T17:09:01Z

@2uasimojo: The lifecycle/frozen label cannot be applied to Pull Requests.

In response to this:

/reopen
/remove-lifecycle rotten
/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-bot · 2024-09-25T01:15:10Z

Inactive enhancement proposals go stale after 28d of inactivity.

See https:/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle stale.
Stale proposals rot after an additional 7d of inactivity and eventually close.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle stale

2uasimojo · 2024-09-25T15:23:05Z

/remove-lifecycle stale

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 27, 2024

dhellmann reviewed Feb 28, 2024

View reviewed changes

LalatenduMohanty reviewed Feb 29, 2024

View reviewed changes

Add ManagedClusterVersion CRD

16f08dd

Propose an enhancement to introduce a new CRD, ManagedClusterVersion. This is a *namespaced* object to be used by fleet management software to provide a common view into managed clusters' version/upgrade information. HIVE-2366 HIVE-2428

2uasimojo force-pushed the HIVE-2428/ManagedClusterVersion branch from 0e67aa6 to 16f08dd Compare March 6, 2024 23:41

deads2k reviewed Mar 18, 2024

View reviewed changes

openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 16, 2024

openshift-ci bot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Apr 23, 2024

2uasimojo commented Apr 23, 2024

View reviewed changes

openshift-ci bot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Apr 23, 2024

openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 22, 2024

openshift-ci bot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels May 29, 2024

openshift-ci bot closed this Jun 6, 2024

openshift-ci bot requested review from csrwng and derekwaynecarr July 24, 2024 18:33

openshift-ci bot requested a review from berenss July 24, 2024 18:48

openshift-ci bot requested review from jnpacker, JoelSpeed and vkareh July 24, 2024 18:48

openshift-ci bot requested a review from jupierce July 24, 2024 18:51

openshift-ci bot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Aug 27, 2024

openshift-ci bot reopened this Aug 27, 2024

openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 25, 2024

openshift-ci bot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 25, 2024

Add ManagedClusterVersion CRD #1578

Are you sure you want to change the base?

Add ManagedClusterVersion CRD #1578

Conversation

2uasimojo commented Feb 27, 2024 • edited by openshift-ci bot Loading

openshift-ci bot commented Feb 27, 2024

openshift-ci bot commented Feb 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-bot commented Apr 16, 2024

openshift-bot commented Apr 23, 2024

2uasimojo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-bot commented May 22, 2024

openshift-bot commented May 29, 2024

openshift-bot commented Jun 6, 2024

openshift-ci bot commented Jun 6, 2024

2uasimojo commented Jul 24, 2024

2uasimojo commented Jul 24, 2024

2uasimojo commented Jul 24, 2024

2uasimojo commented Jul 24, 2024

LalatenduMohanty commented Aug 27, 2024

LalatenduMohanty commented Aug 27, 2024

openshift-ci bot commented Aug 27, 2024

2uasimojo commented Aug 27, 2024

openshift-ci bot commented Aug 27, 2024

openshift-ci bot commented Aug 27, 2024

openshift-bot commented Sep 25, 2024

2uasimojo commented Sep 25, 2024

2uasimojo commented Feb 27, 2024 •

edited by openshift-ci bot

Loading