Kubernetes worker node upgrade should respect PodDisruptionBudget

See this idea on ideas.ibm.com

We are using Kubernetes PodDisruptionBudget to prevent voluntary disruptions of services. This should include the drain of worker nodes during worker node upgrade. While a manual kubectl drain fails in case of insufficient running replicas, IBM Cloud worker node upgrade ignores the PodDisruptionBudget and proceeds with the shutdown of the worker node.

Knowing that, I have to check every service regarding a defined PDB and whether the upgrade will cause a disruption in sense of PDB or not. The other option is to drain every node manually to see if it complies with the PDB.

So please make the automations for the worker node upgrade aware of the actual work load and respect the PDB and not only the ConfigMap ibm-cluster-update-configuration with the unavailability rules on worker node level.

Proposed solution:
provide a second workload aware upgrade option, so the user can choose (and can still use the existing forced upgrade option)

start draining the worker nodes as long as the worker node unavailability rules allow
if a drain fails, cancel the upgrade of the worker node and uncordon the worker node
proceed with remaining worker nodes as long as the worker node unavailability rules allow
inform user about failed drains and possible reasons (PDB, gracefully termination period) and solutions (just retry, manual drain, move pods, forced upgrade option)

Idea priority

High

Post comment

By clicking the "Post Comment" or "Submit Idea" button, you are agreeing to the IBM Ideas Portal Terms of Use.
Do not place IBM confidential, company confidential, or personal information into any field.

Shape the future of IBM!

Search existing ideas

Post your ideas

Specific links you will want to bookmark for future use

Kubernetes worker node upgrade should respect PodDisruptionBudget

Please enter your email address

RELATED IDEAS

Kubernetes worker node upgrade should respect PodDisruptionBudget