Default Pod Topology Spread Constraints¶
Important
- This guardrail is enabled by default as since Welkin Kubespray v2.25.0-ck8s1 and Welkin Cluster API v0.3.0.
Problem¶
A healthy security posture requires you to ensure your application tolerates failures. This involves two things:
- Running your application replicated with at least two Pods. Specifically, this implies that your Deployment has
.spec.replicas
of at least 2. - Ensuring that the Pods are spread across failure domains. The latter was usually achieved by setting correct
topologySpreadConstraints
.
In Welkin, dealing with (1) above is still the Application Developer's responsibility.
However, with Welkin, you don't need to deal with (2).
Solution¶
Welkin comes with strong cluster-level default topologySpreadConstraints
.
Single-Zone Clusters¶
If your Cluster is hosted on a single zone, then your administrator will have configured the following default topologySpreadConstraints
:
- maxSkew: 1
topologyKey: kubernetes.io/hostname
whenUnsatisfiable: ScheduleAnyway
This means that the Kubernetes scheduler will try to spread Pods of the same Deployment across Nodes. If this is not possible, it will still try to run the Pod on any Node.
This implies that your application is more likely to tolerate a Node going down.
Multi-Zone Clusters¶
If your Cluster is hosted on at least three zones in the same region, then your administrator will have configured the following default topologySpreadConstraints
:
- maxSkew: 1
topologyKey: topology.kubernetes.io/zone
whenUnsatisfiable: ScheduleAnyway
This means that the Kubernetes scheduler will try to spread Pods of the same Deployment across zones. If this is not possible, it will still try to run the Pod on any zone.
This implies that your application is more likely to tolerate a zone going down.
What if I need to customize my topologySpreadConstraints
?¶
Simply override this in your application Helm Chart. The user demo provides an example on how to achieve this.