open-consul/website/content/docs/k8s/architecture.mdx

---
layout: docs
page_title: Consul on Kubernetes Control Plane Architecture
description: >-
  When running on Kubernetes, Consul’s control plane architecture does not change significantly. Server agents are deployed as a StatefulSet with a persistent volume, while client agents run as a k8s DaemonSet with an exposed API port.
---


# Architecture

This topic describes the architecture, components, and resources associated with Consul deployments to Kubernetes. Consul employs the same architectural design on Kubernetes as it does with other platforms (see [Architecture](/docs/architecture)), but Kubernetes provides additional benefits that make operating a Consul cluster easier. 

Refer to the standard [production deployment guide](https://learn.hashicorp.com/consul/datacenter-deploy/deployment-guide) for important information, regardless of the deployment platform.

## Server Agents

The server agents are deployed as a `StatefulSet` and use persistent volume
claims to store the server state. This also ensures that the
[node ID](/docs/agent/config/config-files#node_id) is persisted so that servers
can be rescheduled onto new IP addresses without causing issues. The server agents
are configured with
[anti-affinity](https://kubernetes.io/docs/concepts/configuration/assign-pod-node/#affinity-and-anti-affinity)
rules so that they are placed on different nodes. A readiness probe is
configured that marks the pod as ready only when it has established a leader.

A Kubernetes `Service` is registered to represent the servers and exposes ports that are required to communicate to the Consul server pods.
The servers utilize the DNS address of this service to join a Consul cluster, without requiring any other access to the Kubernetes cluster. Additional consul servers may also utilize non-ready endpoints which are published by the Kubernetes service, so that servers can utilize the service for joining during bootstrap and upgrades.

Additionally, a **PodDisruptionBudget** is configured so the Consul server
cluster maintains quorum during voluntary operational events. The maximum
unavailable is `(n/2)-1` where `n` is the number of server agents.

-> **Note:** Kubernetes and Helm do not delete Persistent Volumes or Persistent
Volume Claims when a
[StatefulSet is deleted](https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/#stable-storage),
so this must done manually when removing servers.

## Client Agents

The client agents are run as a **DaemonSet**. This places one agent
(within its own pod) on each Kubernetes node.
The clients expose the Consul HTTP API via a static port (8500)
bound to the host port. This enables all other pods on the node to connect
to the node-local agent using the host IP that can be retrieved via the
Kubernetes downward API. See
[accessing the Consul HTTP API](/docs/k8s/installation/install#accessing-the-consul-http-api)
for an example.

We do not use a **NodePort** Kubernetes service because requests to node ports get randomly routed
to any pod in the service and we need to be able to route directly to the Consul
client running on our node.

-> **Note:** There is no way to bind to a local-only
host port. Therefore, any other node can connect to the agent. This should be
considered for security. For a properly production-secured agent with TLS
and ACLs, this is safe.

We run Consul clients as a **DaemonSet** instead of running a client in each
application pod as a sidecar because this would turn
a pod into a "node" in Consul and also causes an explosion of resource usage
since every pod needs a Consul agent. Service registration should be handled via the
catalog syncing feature with Services rather than pods.

-> **Note:** Due to a limitation of anti-affinity rules with DaemonSets,
a client-mode agent runs alongside server-mode agents in Kubernetes. This
duplication wastes some resources, but otherwise functions perfectly fine.
-												docs: Consul K8s Overview update (#12575)

* docs: Consul K8s Overview update

Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>

											
										
										
											2022-03-18 19:01:41 +00:00
+								---
 								layout: docs
-												/docs/k8s

											
										
										
											2022-09-14 22:26:14 +00:00
+								page_title: Consul on Kubernetes Control Plane Architecture
-												docs: Consul K8s Overview update (#12575)

* docs: Consul K8s Overview update

Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>

											
										
										
											2022-03-18 19:01:41 +00:00
+								description: >-
-												Spacing and title fixes

											
										
										
											2022-09-16 15:28:32 +00:00
+								  When running on Kubernetes, Consul’s control plane architecture does not change significantly. Server agents are deployed as a StatefulSet with a persistent volume, while client agents run as a k8s DaemonSet with an exposed API port.
-												docs: Consul K8s Overview update (#12575)

* docs: Consul K8s Overview update

Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>

											
										
										
											2022-03-18 19:01:41 +00:00
+								---
 								# Architecture
 								This topic describes the architecture, components, and resources associated with Consul deployments to Kubernetes. Consul employs the same architectural design on Kubernetes as it does with other platforms (see [Architecture](/docs/architecture)), but Kubernetes provides additional benefits that make operating a Consul cluster easier.
-												revert links to learn

											
										
										
											2022-09-06 15:35:01 +00:00
+								Refer to the standard [production deployment guide](https://learn.hashicorp.com/consul/datacenter-deploy/deployment-guide) for important information, regardless of the deployment platform.
-												docs: Consul K8s Overview update (#12575)

* docs: Consul K8s Overview update

Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>

											
										
										
											2022-03-18 19:01:41 +00:00
 								## Server Agents
 								The server agents are deployed as a `StatefulSet` and use persistent volume
 								claims to store the server state. This also ensures that the
-												website: content updates for developer (#14419)

Co-authored-by: Ashlee Boyer <ashlee.boyer@hashicorp.com>
Co-authored-by: Ashlee M Boyer <43934258+ashleemboyer@users.noreply.github.com>
Co-authored-by: Tu Nguyen <im2nguyen@gmail.com>
Co-authored-by: Tu Nguyen <im2nguyen@users.noreply.github.com>
Co-authored-by: HashiBot <62622282+hashibot-web@users.noreply.github.com>
Co-authored-by: Kevin Wang <kwangsan@gmail.com>

											
										
										
											2022-09-14 22:45:42 +00:00
+								[node ID](/docs/agent/config/config-files#node_id) is persisted so that servers
-												docs: Consul K8s Overview update (#12575)

* docs: Consul K8s Overview update

Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>

											
										
										
											2022-03-18 19:01:41 +00:00
+								can be rescheduled onto new IP addresses without causing issues. The server agents
 								are configured with
 								[anti-affinity](https://kubernetes.io/docs/concepts/configuration/assign-pod-node/#affinity-and-anti-affinity)
 								rules so that they are placed on different nodes. A readiness probe is
 								configured that marks the pod as ready only when it has established a leader.
-												docs: Fix spelling errors across site (#12973)


											
										
										
											2022-05-10 14:28:33 +00:00
+								A Kubernetes `Service` is registered to represent the servers and exposes ports that are required to communicate to the Consul server pods.
-												docs: Consul K8s Overview update (#12575)

* docs: Consul K8s Overview update

Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>

											
										
										
											2022-03-18 19:01:41 +00:00
+								The servers utilize the DNS address of this service to join a Consul cluster, without requiring any other access to the Kubernetes cluster. Additional consul servers may also utilize non-ready endpoints which are published by the Kubernetes service, so that servers can utilize the service for joining during bootstrap and upgrades.
 								Additionally, a **PodDisruptionBudget** is configured so the Consul server
 								cluster maintains quorum during voluntary operational events. The maximum
 								unavailable is `(n/2)-1` where `n` is the number of server agents.
 								-> **Note:** Kubernetes and Helm do not delete Persistent Volumes or Persistent
 								Volume Claims when a
 								[StatefulSet is deleted](https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/#stable-storage),
 								so this must done manually when removing servers.
 								## Client Agents
 								The client agents are run as a **DaemonSet**. This places one agent
 								(within its own pod) on each Kubernetes node.
 								The clients expose the Consul HTTP API via a static port (8500)
 								bound to the host port. This enables all other pods on the node to connect
 								to the node-local agent using the host IP that can be retrieved via the
 								Kubernetes downward API. See
 								[accessing the Consul HTTP API](/docs/k8s/installation/install#accessing-the-consul-http-api)
 								for an example.
 								We do not use a **NodePort** Kubernetes service because requests to node ports get randomly routed
 								to any pod in the service and we need to be able to route directly to the Consul
 								client running on our node.
 								-> **Note:** There is no way to bind to a local-only
 								host port. Therefore, any other node can connect to the agent. This should be
 								considered for security. For a properly production-secured agent with TLS
 								and ACLs, this is safe.
 								We run Consul clients as a **DaemonSet** instead of running a client in each
 								application pod as a sidecar because this would turn
 								a pod into a "node" in Consul and also causes an explosion of resource usage
 								since every pod needs a Consul agent. Service registration should be handled via the
 								catalog syncing feature with Services rather than pods.
 								-> **Note:** Due to a limitation of anti-affinity rules with DaemonSets,
 								a client-mode agent runs alongside server-mode agents in Kubernetes. This
 								duplication wastes some resources, but otherwise functions perfectly fine.