Reference Architecture

The following diagram depicts a production-grade reference architecture, recommended for high available & scale out data center environments:

Remarks:

Availability Zones: Spread Nest management appliances and worker nodes across different Availability Zones (Failure Domains) for high availability: either accross distinct Racks or distinct data centers.
- Be aware that etcd requires an odd number of instances to achieve quorum, so an odd-number of Availability Zones is recommended (for example three racks or data centers).
Group servers into Machine Pools and spread them across Availability Zones too; this allows to use them as Update Domains for graceful rollover during updates.
For CSI-Storage, we recommend using external IP-based storage arrays (significantly simpler and usually better supported by Kubernetes than FibreChannel setups)
Interconnect the servers, management appliances and storage arrays using a Network Fabric with Spine-Leaf Topology.
Use a dedicated site as backup location.

With Elastic Node Pools

The diagram above shows the bare metal foundation using Machines and Machine Pools, where each server is directly assigned to a cluster as a worker node. The following extends this setup with Elastic Node Pools, where the bare metal servers are grouped into a shared Machine Pool and clusters carve virtual workers from it:

Reference Architecture with Elastic Node Pools

Reference Architecture ​

With Elastic Node Pools ​

Reference Architecture

With Elastic Node Pools