distributed provisioning: unset "selected-node" for nodes which have no driver running #544

pohly · 2020-12-20T13:27:40Z

When deploying external-provisioner alongside the CSI driver on each node, there is one problem: if the scheduler picks a node which has no driver instance, then the volume is stuck because the usual "no capacity -> reschedule" recovery is never triggered.

A custom scheduler extension and capacity tracking can minimize the risk, but cannot prevent this entirely.

Possible solutions:

deploy the driver on all nodes, let it report "no capacity" on those were it has no resources -> works today, but creates overhead
deploy a central provisioner together with a driver component that knows about nodes where the driver runs -> can be done today, but implies that CSI drivers must be made aware of Kubernetes
do something similar inside external-provisioner, probably based on node labels (specify node selector for "driver not running" and handle those)

pohly · 2020-12-20T15:46:09Z

deploy a central provisioner together with a driver component that knows about nodes where the driver runs -> can be done today, but implies that CSI drivers must be made aware of Kubernetes

This isn't ideal because "provisioning" will be started by the central provisioner for all nodes and then must be made to fail for those which do have a driver, which will emit additional events.

pohly · 2020-12-21T09:28:32Z

do something similar inside external-provisioner, probably based on node labels (specify node selector for "driver not running" and handle those)

This is conceptually very similar to setting AllowedTopologies in the storage class which uses late binding. The volume scheduler will check that before selecting a node, right?

If so, then this is probably the right solution for this issue because it avoids the problem entirely.

There's a slight race (node has the right labels, is selected for a PVC, labels get removed, driver no longer runs -> PVC stuck), but that should be rare and can be documented as a caveat for admins.

pohly · 2021-01-11T19:00:52Z

This is conceptually very similar to setting AllowedTopologies in the storage class which uses late binding. The volume scheduler will check that before selecting a node, right?

Yes, it does: https://github.com/kubernetes/kubernetes/blob/ba5f5bea64a7b42265e581e8c7fe633336bec79a/pkg/controller/volume/scheduling/scheduler_binder.go#L873-L877

fejta-bot · 2021-04-11T19:27:03Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

pohly · 2021-04-11T19:39:09Z

/remove-lifecycle stale

fejta-bot · 2021-07-10T19:59:56Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

k8s-triage-robot · 2021-08-09T20:43:58Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

pohly · 2021-08-09T20:53:07Z

/remove-lifecycle rotten

k8s-triage-robot · 2021-11-07T21:00:39Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

pohly · 2021-11-08T06:44:01Z

/remove-lifecycle stale

k8s-triage-robot · 2022-02-06T07:22:36Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

pohly · 2022-02-07T08:23:21Z

/remove-lifecycle stale
/lifecycle frozen

Due to a bug in the scheduler a node with no driver instance might be picked and the volume is stuck in pending as the "no capacity - > reschedule" recovery is never triggered [[0]](kubernetes/kubernetes#122109), [[1]](kubernetes-csi/external-provisioner#544). - See #400 --------- Co-authored-by: lukasmetzner <[email protected]> Co-authored-by: Julian Tölle <[email protected]>

We modified the response for `NodeGetInfo` to return an additional Topology Segment. We assumed that this only “adds” new info, but in practice it breaks the spec. When trying to schedule a volume to nodes, the container orchestration systems should verify that the Node fulfills at least one Accessible Topology of the Node, where “fulfills” means that all supplied segments match. This is not implemented in the same way between Kubernetes and Nomad. - **Kubernetes**: requirements are fulfilled if the volume specifies a subset of the Nodes topology - **Nomad**: requirements are fulfilled if the volume specifies all of the Nodes topology We made these changes to work around a bug in the Kubernetes scheduler ([here](kubernetes-csi/external-provisioner#544)) where nodes without the CSI Plugin would still be considered for scheduling, but then creating and attaching the volume fails with no automatic reconciliation of this error.

pohly mentioned this issue Dec 20, 2020

Distributed provisioning intel/pmem-csi#838

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 11, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 11, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 10, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 9, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Aug 9, 2021

pohly mentioned this issue Oct 13, 2021

volume.kubernetes.io/selected-node never cleared for non-existent nodes on PVC without PVs kubernetes/kubernetes#100485

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 7, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 8, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 6, 2022

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 7, 2022

edreed mentioned this issue Aug 16, 2022

Deleted nodes are not removed from PVC kubernetes-sigs/azuredisk-csi-driver#1475

Closed

This was referenced Mar 27, 2023

No topology key found on hw nodes hetznercloud/csi-driver#399

Closed

No topology key found on hw nodes hetznercloud/csi-driver#400

Closed

lukasmetzner mentioned this issue Oct 9, 2024

feat: force pods with volumes to be scheduled on Cloud servers hetznercloud/csi-driver#743

Merged

lukasmetzner mentioned this issue Nov 11, 2024

fix: reverted NodeGetInfo response as it breaks Nomad clusters hetznercloud/csi-driver#776

Merged

lukasmetzner mentioned this issue Nov 18, 2024

feat: Added new option enableProvidedByTopology hetznercloud/csi-driver#780

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

distributed provisioning: unset "selected-node" for nodes which have no driver running #544

distributed provisioning: unset "selected-node" for nodes which have no driver running #544

pohly commented Dec 20, 2020

pohly commented Dec 20, 2020

pohly commented Dec 21, 2020

pohly commented Jan 11, 2021

fejta-bot commented Apr 11, 2021

pohly commented Apr 11, 2021

fejta-bot commented Jul 10, 2021

k8s-triage-robot commented Aug 9, 2021

pohly commented Aug 9, 2021

k8s-triage-robot commented Nov 7, 2021

pohly commented Nov 8, 2021

k8s-triage-robot commented Feb 6, 2022

pohly commented Feb 7, 2022

distributed provisioning: unset "selected-node" for nodes which have no driver running #544

distributed provisioning: unset "selected-node" for nodes which have no driver running #544

Comments

pohly commented Dec 20, 2020

pohly commented Dec 20, 2020

pohly commented Dec 21, 2020

pohly commented Jan 11, 2021

fejta-bot commented Apr 11, 2021

pohly commented Apr 11, 2021

fejta-bot commented Jul 10, 2021

k8s-triage-robot commented Aug 9, 2021

pohly commented Aug 9, 2021

k8s-triage-robot commented Nov 7, 2021

pohly commented Nov 8, 2021

k8s-triage-robot commented Feb 6, 2022

pohly commented Feb 7, 2022