[Feature] Support for anti-affinity/affinity rules for the created machines #175

sidharthsurana · 2019-01-22T19:09:15Z

vsphere DRS supports defining anti-affinity / affinity rules for VMs. This feature is to add support for the user to specify affinity/anti-affinity grouping for the VMs.

Use case: User created 3 Machine object, and want all 3 VMs to run on different hosts to improve resiliency from host failures. This can easily be realized by creating the anti-affinity rule for the 3 VMs

frapposelli · 2019-02-01T13:18:26Z

/kind feature
/priority important-longterm
/assign @sidharthsurana

sflxn · 2019-03-06T23:29:36Z

This potentially can get done for v1alpha1, but it isn't a feature that's absolutely needs to go in for this release. If it gets done in time and the PR has been reviewed, we can make a judgement call at that time.

fejta-bot · 2019-06-04T23:55:14Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

moshloop · 2019-06-07T08:32:21Z

Any reason why this can't be supported without DRS? Not sure DRS works across multiple clusters, and scheduling aross multiple clusters in a vcenter seems like a reasonable thing todo?

akutz · 2019-06-07T22:52:47Z

This may need to be punted to v1alpha2. If not we need to resource this work yesterday.

cc @frapposelli

frapposelli · 2019-06-07T23:05:29Z

@moshloop affinity/anti-affinity rules are defined within a DRS-enabled cluster, a cluster is also a fault-domain boundary for vSphere. Deploying across multiple clusters should be possible today using MachineSets.

@akutz this is potentially a noop (see kubernetes/cloud-provider-vsphere#179), but even if not, definitely needs to be punted to v1alpha2.

fejta-bot · 2019-07-07T23:52:34Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

moshloop · 2019-07-08T06:27:11Z

/remove-lifecycle rotten

sujeet-banerjee · 2019-08-06T09:10:06Z

Any reason why this can't be supported without DRS? Not sure DRS works across multiple clusters, and scheduling across multiple clusters in a vcenter seems like a reasonable thing todo?

Short Answer: No. Without-DRS, it won't be effective.

Long Answer:
#1 One may use VM-Host rules without the "DRS enabled" on a vShpere cluster. However, the worker nodes will not be auto-migrated (or vMotioned) to support a balanced spread of the nodes across Hosts (i.e. HA in true sense). And CRDs do not expose Host details.
#2 That also means "vMotion" must be enabled for the Hosts.
More Details: https://docs.vmware.com/en/VMware-vSphere/6.7/com.vmware.vsphere.resmgmt.doc/GUID-7D3ABD21-4524-42E9-B7FE-6AAF6766433B.html

I have been working on a design-proposal for supporting Antiafinity/Affinity. Attached.
Spec_changes_for_AntiAffinity.docx
Test_n_Demo.pdf

moshloop · 2019-08-06T09:26:04Z

@sujeet-banerjee Can you create a google doc for commenting and review?

vMotion is something I believe should be turned off for kubernetes clusters as it conflicts with kubernetes view of the system. If a host fails then kubernetes should reschedule the pods that become not ready, cluster-api machineset should then detect unresponsive nodes, and create new ones.

Same applies to DRS, it should be turned off and kubernetes native components like descheduler used to move and rebalance workloads.

Without DRS or vMotion, machine affinity/antiAffinity can easily be implemented when creating a machine by listing all available hosts/clusters and running through rules list in the same way pod affinity is implemented.

andrewsykim · 2019-08-06T15:29:09Z

Can you create a google doc for commenting and review?

+1 to this, please use Google docs so we can comment/review :)

sujeet-banerjee · 2019-08-07T06:31:50Z

use Google Docs...

Could you point me to the shared location where I could add the doc?

@moshloop

Without DRS or vMotion, machine affinity/antiAffinity can easily be implemented when creating a machine by listing all available hosts/clusters and running through rules list in the same way pod affinity is implemented.

I somehow disagree on a few things,
#1 in my opinion, it's not a good idea to tie/specify host-details in the affinity-definition (machine/machine-set CRDs). As I understand, ESXi hosts may be added/taken-down at the will of the end-users, within a vSphere cluster.
#2 Similarly, Enabling/Disabling DRS should be end-users' will. An end-user may want to use DRS features, and CAPI/k8s should not be restrictive against using DRS features.

sujeet-banerjee · 2019-08-08T07:51:25Z

Proposal Doc: https://docs.google.com/document/d/1fNm53l5K0OfPrGc3zhjNDYzVQNBEc85uEJfNwtiDQrY

I would invite folks to review the proposal and add suggestions (if any).

Thanks,
Sujeet

davidopp · 2019-08-08T08:01:52Z

I believe
kubernetes/enhancements#997
kubernetes/enhancements#1127
are related?

brysonshepherd · 2019-08-08T08:21:30Z

@moshloop
Wouldn't it be better to vmotion, rather than build a whole new node on another vhost? To me that would lead longer time with pending pods.

moshloop · 2019-08-08T08:26:17Z

@brysonshepherd vMotion requires shared storage which is really expensive, slow and unreliable (when compared to just starting a new vm)

moshloop · 2019-08-08T08:31:52Z

, it's not a good idea to tie/specify host-details in the affinity-definition

antiAffinity using a topology key would work as new hosts, clusters and datancenters are added and removed:

e.g.

    vmAntiAffinity:
          requiredDuringSchedulingIgnoredDuringExecution: 
            topologyKey: "vmware.io/hostname"

and to distribute nodes across multiple clusters

    vmAntiAffinity:
          requiredDuringSchedulingIgnoredDuringExecution: 
            topologyKey: "vmware.io/clustername"

brysonshepherd · 2019-08-08T08:46:27Z

@moshloop

Some users may pay for that higher level of service for higher uptime.

In my organizations current use case, drs and vmotion are needed.

moshloop · 2019-08-08T09:03:31Z

higher uptime

Shared storage is much more likely to reduce uptime than it is to increase it, especially when your application is stateless and doesn't need persistent storage. Even when an application requires clustered or replicated storage, the control plane (kubernetes) should not share the same shared fault domain.

vMotion has it's advantages and is often the easiest solution from an operational overhead, but if you want to achieve the highest levels of resilience and uptime your kubernetes nodes shouldn't be using it.

brysonshepherd · 2019-08-08T09:11:30Z

@moshloop I'm not saying that vmotion is what I want, it is just faster than making a new node. If making a new node (along with rescheduling/starting up the pods) was faster, then that is what I would do. But it isn't. At least not that I'm aware of.

moshloop · 2019-08-08T09:20:09Z

The time to recover a pod doesn't need to be instant, just reasonable - You should be running multiple pods with enough headroom to survive the loss of one pod, If you lose a physical ESXi host than the VM doesn't get vMotion'ed, it gets booted on a different host, potentially goes through disk crash recovery and rejoins the cluster - in the meantime the pods running on the node may have already been detected as down and rescheduled.

sujeet-banerjee · 2019-08-21T18:52:42Z

The discussion moved to: https://docs.google.com/document/d/1fNm53l5K0OfPrGc3zhjNDYzVQNBEc85uEJfNwtiDQrY/edit#

fejta-bot · 2019-11-19T19:45:15Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

akutz · 2019-12-17T23:15:24Z

ping @pdaigle

fejta-bot · 2020-01-16T23:30:45Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

moshloop · 2020-01-17T10:50:57Z

/remove-lifecycle stale

moshloop · 2020-01-17T10:51:19Z

/remove-lifecycle rotten

- Set keyname on instances - Better handle certificate missing from machine status in GetKubeConfig

fejta-bot · 2020-04-16T11:49:02Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

vincepri · 2020-04-16T13:41:03Z

/lifecycle frozen

jayunit100 · 2021-12-21T21:32:03Z

naively looking at this, and wondering,

Since @srm09 added https://github.com/kubernetes-sigs/cluster-api-provider-vsphere/pull/1182/files , would it be possibly to hijack the VerifyAffinityRule(ctx computeClusterContext, clusterName, hostGroupName, vmGroupName string) (Rule, error) { somehow, to do some kind of affinity/anti-affinity VM Spreading at a non-regional, but rather at just an ESXi'ish level ?

srm09 · 2022-01-30T20:36:08Z

@jayunit100 I am not sure I quite understand what you are looking for here? Currently multi AZ allows you to attach VMs to specific hosts, if the host groups are setup to point to a particular host.
Can you elaborate on this ask?

k8s-triage-robot · 2022-05-01T17:53:12Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

srm09 · 2022-05-10T03:01:44Z

/close
This has been implemented as the multi AZ feature using VSphereDeploymnetZone and VSphereFailureDomain CRDs.

k8s-ci-robot · 2022-05-10T03:01:54Z

@srm09: Closing this issue.

In response to this:

/close
This has been implemented as the multi AZ feature using VSphereDeploymnetZone and VSphereFailureDomain CRDs.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

sujeet-banerjee · 2022-05-10T14:48:35Z

Does VSphereDeploymnetZone tie the machines (VMs) to specific ESXi hosts? As I understand, ESXi hosts may be added/taken down at the will of the end-users, within a vSphere cluster. Not sure if it's a good idea to tie/specify host details in the affinity definition (machine/machine-set CRDs).

k8s-ci-robot assigned sidharthsurana Feb 1, 2019

k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. labels Feb 1, 2019

frapposelli added this to the Next milestone Feb 1, 2019

sflxn modified the milestones: Next, v1alpha1 Feb 1, 2019

sflxn modified the milestones: v1alpha1, Next Mar 6, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 4, 2019

moshloop mentioned this issue Jun 7, 2019

Feature: Support spreading nodes across multiple VC #44

Closed

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 7, 2019

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jul 8, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 19, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 16, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jan 17, 2020

rhockenbury mentioned this issue Jan 23, 2020

Support for adding created VM to DRS group #729

Closed

jayunit100 pushed a commit to jayunit100/cluster-api-provider-vsphere that referenced this issue Feb 26, 2020

Misc fixes (kubernetes-sigs#175)

022bb8c

- Set keyname on instances - Better handle certificate missing from machine status in GetKubeConfig

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 16, 2020

yastij removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 16, 2020

k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Apr 16, 2020

vincepri removed the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jan 31, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 1, 2022

k8s-ci-robot closed this as completed May 10, 2022

[Feature] Support for anti-affinity/affinity rules for the created machines #175

[Feature] Support for anti-affinity/affinity rules for the created machines #175

Comments

sidharthsurana commented Jan 22, 2019

frapposelli commented Feb 1, 2019

sflxn commented Mar 6, 2019

fejta-bot commented Jun 4, 2019

moshloop commented Jun 7, 2019

akutz commented Jun 7, 2019

frapposelli commented Jun 7, 2019

fejta-bot commented Jul 7, 2019

moshloop commented Jul 8, 2019

sujeet-banerjee commented Aug 6, 2019

moshloop commented Aug 6, 2019

andrewsykim commented Aug 6, 2019

sujeet-banerjee commented Aug 7, 2019 • edited Loading

sujeet-banerjee commented Aug 8, 2019

davidopp commented Aug 8, 2019

brysonshepherd commented Aug 8, 2019

moshloop commented Aug 8, 2019

moshloop commented Aug 8, 2019

brysonshepherd commented Aug 8, 2019 • edited Loading

moshloop commented Aug 8, 2019

brysonshepherd commented Aug 8, 2019

moshloop commented Aug 8, 2019

sujeet-banerjee commented Aug 21, 2019

fejta-bot commented Nov 19, 2019

akutz commented Dec 17, 2019

fejta-bot commented Jan 16, 2020

moshloop commented Jan 17, 2020

moshloop commented Jan 17, 2020

fejta-bot commented Apr 16, 2020

vincepri commented Apr 16, 2020

jayunit100 commented Dec 21, 2021

srm09 commented Jan 30, 2022

k8s-triage-robot commented May 1, 2022

srm09 commented May 10, 2022

k8s-ci-robot commented May 10, 2022

sujeet-banerjee commented May 10, 2022

sujeet-banerjee commented Aug 7, 2019 •

edited

Loading

brysonshepherd commented Aug 8, 2019 •

edited

Loading