open-vault/website/content/docs/concepts/integrated-storage/autopilot.mdx

---
layout: docs
page_title: Integrated Storage Autopilot
description: Learn about the autopilot subsystem of integrated raft storage in Vault.
---

# Autopilot

Autopilot enables automated workflows for managing Raft clusters. The current
feature set includes 3 main features: Server Stabilization, Dead Server Cleanup
and State API. These three features were introduced in Vault 1.7. The Enterprise
feature set includes 2 main features: Automated Upgrades and Redundancy Zones.
These two features were introduced in Vault 1.11.

## Server Stabilization

Server stabilization helps to retain the stability of the Raft cluster by safely
joining new voting nodes to the cluster. When a new voter node is joined to an
existing cluster, autopilot adds it as a non-voter instead, and waits for a
pre-configured amount of time to monitor it's health. If the node remains to be
healthy for the entire duration of stabilization, then that node will be
promoted as a voter. The server stabilization period can be tuned using
`server_stabilization_time` (see below).

## Dead Server Cleanup

Dead server cleanup automatically removes nodes deemed unhealthy from the
Raft cluster, avoiding the manual operator intervention. This feature can be
tuned using the `cleanup_dead_servers`, `dead_server_last_contact_threshold`,
and `min_quorum` (see below).

## State API

State API provides detailed information about all the nodes in the Raft cluster
in a single call. This API can be used for monitoring for cluster health.

### Follower Health

Follower node health is determined by 2 factors.

- Its ability to heartbeat to leader node at regular intervals. Tuned using
  `last_contact_threshold` (see below).
- Its ability to keep up with data replication from the leader node. Tuned using
  `max_trailing_logs` (see below).

### Default Configuration

By default, Autopilot will be enabled with clusters using Vault 1.7+,
although dead server cleanup is not enabled by default. Upgrade of
Raft clusters deployed with older versions of Vault will also transition to use
Autopilot automatically.

Autopilot exposes a [configuration
API](/api-docs/system/storage/raftautopilot#set-configuration) to manage its
behavior. Autopilot gets initialized with the following default values.

- `cleanup_dead_servers` - `false`

- `dead_server_last_contact_threshold` - `24h`

- `min_quorum` - This doesn't default to anything and will need to be set to at
  least 3 when `cleanup_dead_servers` is set as `true`.

- `max_trailing_logs` - `1000`

- `last_contact_threshold` - `10s`

- `server_stabilization_time` - `10s`


~> **Note**: Autopilot in Vault does similar things to what autopilot does in
[Consul](https://www.consul.io/). However, the configuration in these 2 systems
differ in terms of default values and thresholds; some additional parameters
might also show up in Vault in comparison to Consul as well. Autopilot in Vault
and Consul use different technical underpinnings requiring these differences, to
provide the autopilot functionality.

## Automated Upgrades

Automated Upgrades lets you automatically upgrade a cluster of Vault nodes to a new version as
updated server nodes join the cluster. Once the number of nodes on the new version is
equal to or greater than the number of nodes on the old version, Autopilot will promote
the newer versioned nodes to voters, demote the older versioned nodes to non-voters,
and initiate a leadership transfer from the older version leader to one of the newer
versioned nodes. After the leadership transfer completes, the older versioned non-voting
nodes can be removed from the cluster.

## Redundancy Zones

Redundancy Zones provide both scaling and resiliency benefits by deploying non-voting
nodes alongside voting nodes on a per availability zone basis. When using redundancy zones,
each zone will have exactly one voting node and as many additional non-voting nodes as desired.
If the voting node in a zone fails, a non-voting node will be automatically promoted to
voter. If an entire zone is lost, a non-voting node from another zone will be promoted to voter,
maintaining quorum. These non-voting nodes function not only as hot standbys, but also
increase read scalability.

## Replication

DR secondary and Performance secondary clusters have their own Autopilot configurations, managed
independently of their primary.

The [Autopilot API](/api/system/storage/raftautopilot) uses DR operation tokens for
authorization when executed against a DR secondary cluster.

## Tutorial

Refer to the following tutorials to learn more.

- [Integrated Storage Autopilot](https://learn.hashicorp.com/tutorials/vault/raft-autopilot)
- [Fault Tolerance with Redundancy Zones](https://learn.hashicorp.com/tutorials/vault/raft-redundancy-zones)
- [Automate Upgrades with Vault Enterprise](https://learn.hashicorp.com/tutorials/vault/raft-upgrade-automation)
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00			`---`
			`layout: docs`
			`page_title: Integrated Storage Autopilot`
			`description: Learn about the autopilot subsystem of integrated raft storage in Vault.`
			`---`

			`# Autopilot`

			`Autopilot enables automated workflows for managing Raft clusters. The current`
			`feature set includes 3 main features: Server Stabilization, Dead Server Cleanup`
Autopilot enterprise docs (#15589) 2022-06-07 21:32:45 +00:00			`and State API. These three features were introduced in Vault 1.7. The Enterprise`
			`feature set includes 2 main features: Automated Upgrades and Redundancy Zones.`
			`These two features were introduced in Vault 1.11.`
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00
			`## Server Stabilization`

			`Server stabilization helps to retain the stability of the Raft cluster by safely`
			`joining new voting nodes to the cluster. When a new voter node is joined to an`
			`existing cluster, autopilot adds it as a non-voter instead, and waits for a`
			`pre-configured amount of time to monitor it's health. If the node remains to be`
			`healthy for the entire duration of stabilization, then that node will be`
			`promoted as a voter. The server stabilization period can be tuned using`
			`server_stabilization_time` (see below).

			`## Dead Server Cleanup`

			`Dead server cleanup automatically removes nodes deemed unhealthy from the`
			`Raft cluster, avoiding the manual operator intervention. This feature can be`
			tuned using the `cleanup_dead_servers`, `dead_server_last_contact_threshold`,
			and `min_quorum` (see below).

			`## State API`

			`State API provides detailed information about all the nodes in the Raft cluster`
			`in a single call. This API can be used for monitoring for cluster health.`

			`### Follower Health`

			`Follower node health is determined by 2 factors.`
feat(website): migrates nav data format and updates docs pages (#11242) * migrates nav data format and updates docs pages * removes sidebar_title from content files 2021-04-06 17:49:04 +00:00
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00			`- Its ability to heartbeat to leader node at regular intervals. Tuned using`
feat(website): migrates nav data format and updates docs pages (#11242) * migrates nav data format and updates docs pages * removes sidebar_title from content files 2021-04-06 17:49:04 +00:00			`last_contact_threshold` (see below).
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00			`- Its ability to keep up with data replication from the leader node. Tuned using`
feat(website): migrates nav data format and updates docs pages (#11242) * migrates nav data format and updates docs pages * removes sidebar_title from content files 2021-04-06 17:49:04 +00:00			`max_trailing_logs` (see below).
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00
			`### Default Configuration`

			`By default, Autopilot will be enabled with clusters using Vault 1.7+,`
			`although dead server cleanup is not enabled by default. Upgrade of`
			`Raft clusters deployed with older versions of Vault will also transition to use`
			`Autopilot automatically.`

			`Autopilot exposes a [configuration`
Fix broken links referencing to API docs (#14565) * Fix all '/api/' to '/api-docs/' * Minor fixes * Undo some of the unintentional changes 2022-03-18 01:14:48 +00:00			`API](/api-docs/system/storage/raftautopilot#set-configuration) to manage its`
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00			`behavior. Autopilot gets initialized with the following default values.`

			- `cleanup_dead_servers` - `false`

			- `dead_server_last_contact_threshold` - `24h`

			- `min_quorum` - This doesn't default to anything and will need to be set to at
feat(website): migrates nav data format and updates docs pages (#11242) * migrates nav data format and updates docs pages * removes sidebar_title from content files 2021-04-06 17:49:04 +00:00			least 3 when `cleanup_dead_servers` is set as `true`.
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00
			- `max_trailing_logs` - `1000`

			- `last_contact_threshold` - `10s`

			- `server_stabilization_time` - `10s`

Document autopilot config differences at a high level (#15000) Co-authored-by: Loann Le <84412881+taoism4504@users.noreply.github.com> 2022-07-11 21:37:44 +00:00
			`~> Note: Autopilot in Vault does similar things to what autopilot does in`
			`[Consul](https://www.consul.io/). However, the configuration in these 2 systems`
			`differ in terms of default values and thresholds; some additional parameters`
			`might also show up in Vault in comparison to Consul as well. Autopilot in Vault`
			`and Consul use different technical underpinnings requiring these differences, to`
			`provide the autopilot functionality.`

Autopilot enterprise docs (#15589) 2022-06-07 21:32:45 +00:00			`## Automated Upgrades`

			`Automated Upgrades lets you automatically upgrade a cluster of Vault nodes to a new version as`
			`updated server nodes join the cluster. Once the number of nodes on the new version is`
			`equal to or greater than the number of nodes on the old version, Autopilot will promote`
			`the newer versioned nodes to voters, demote the older versioned nodes to non-voters,`
			`and initiate a leadership transfer from the older version leader to one of the newer`
			`versioned nodes. After the leadership transfer completes, the older versioned non-voting`
			`nodes can be removed from the cluster.`

			`## Redundancy Zones`

			`Redundancy Zones provide both scaling and resiliency benefits by deploying non-voting`
			`nodes alongside voting nodes on a per availability zone basis. When using redundancy zones,`
			`each zone will have exactly one voting node and as many additional non-voting nodes as desired.`
			`If the voting node in a zone fails, a non-voting node will be automatically promoted to`
			`voter. If an entire zone is lost, a non-voting node from another zone will be promoted to voter,`
			`maintaining quorum. These non-voting nodes function not only as hot standbys, but also`
			`increase read scalability.`

Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00			`## Replication`

Docs/autopilot typo fix and DR autopilot clarification (#12596) * autopilot cleanup and parameter typo cleanup * clarify DR AP lives now * typo myself cleanup * trigger ci Co-authored-by: aphorise <aphorise@gmail.com> Co-authored-by: taoism4504 <loann@hashicorp.com> 2022-08-19 21:30:13 +00:00			`DR secondary and Performance secondary clusters have their own Autopilot configurations, managed`
Autopilot Docs (#11167) 2021-03-24 14:29:10 +00:00			`independently of their primary.`

Docs/autopilot typo fix and DR autopilot clarification (#12596) * autopilot cleanup and parameter typo cleanup * clarify DR AP lives now * typo myself cleanup * trigger ci Co-authored-by: aphorise <aphorise@gmail.com> Co-authored-by: taoism4504 <loann@hashicorp.com> 2022-08-19 21:30:13 +00:00			`The [Autopilot API](/api/system/storage/raftautopilot) uses DR operation tokens for`
			`authorization when executed against a DR secondary cluster.`
[Docs] Add relevant Learn tutorial links (#11440) * Add cross-links to matching Learn tutorials * Undo automatic format update by Atom * Fix a typo * Fix a grammartical error 2021-04-22 17:09:31 +00:00
Vault documentation: updated all references from Learn to Tutorial (#15514) * updated learn to tutorial * correct spelling 2022-05-20 01:04:46 +00:00			`## Tutorial`
[Docs] Add relevant Learn tutorial links (#11440) * Add cross-links to matching Learn tutorials * Undo automatic format update by Atom * Fix a typo * Fix a grammartical error 2021-04-22 17:09:31 +00:00
Add a note referring to automated upgrade (#16444) * Add a note referring to automated upgrade * Update website/content/docs/upgrading/upgrade-to-1.11.x.mdx Co-authored-by: Loann Le <84412881+taoism4504@users.noreply.github.com> * Update website/content/docs/upgrading/upgrade-to-1.11.x.mdx Co-authored-by: Loann Le <84412881+taoism4504@users.noreply.github.com> * Update website/content/docs/upgrading/upgrade-to-1.11.x.mdx Co-authored-by: Loann Le <84412881+taoism4504@users.noreply.github.com> Co-authored-by: Loann Le <84412881+taoism4504@users.noreply.github.com> 2022-07-25 22:03:55 +00:00			`Refer to the following tutorials to learn more.`

			`- [Integrated Storage Autopilot](https://learn.hashicorp.com/tutorials/vault/raft-autopilot)`
			`- [Fault Tolerance with Redundancy Zones](https://learn.hashicorp.com/tutorials/vault/raft-redundancy-zones)`
			`- [Automate Upgrades with Vault Enterprise](https://learn.hashicorp.com/tutorials/vault/raft-upgrade-automation)`