open-vault/website/source/docs/enterprise/replication/index.html.md

---
layout: "docs"
page_title: "Vault Enterprise Replication"
sidebar_current: "docs-vault-enterprise-replication"
description: |-
  Vault Enterprise has support for Replication, allowing critical data to be replicated across clusters to support horizontally scaling and disaster recovery workloads.

---

# Vault Replication

## Overview

Many organizations have infrastructure that spans multiple datacenters. Vault
provides the critical services of identity management, secrets storage, and
policy management.  This functionality is expected to be highly available and
to scale as the number of clients and their functional needs increase; at the
same time, operators would like to ensure that a common set of policies are
enforced globally, and a consistent set of secrets and keys are exposed to
applications that need to interoperate. 

Vault replication addresses both of these needs in providing consistency, 
scalability, and highly-available disaster recovery. 

## Architecture

The core unit of Vault replication is a **cluster**, which is comprised of a 
collection of Vault nodes (an active and its corresponding HA nodes). Multiple Vault 
clusters communicate in a one-to-many near real-time flow.

Replication operates on a leader/follower model, wherein a leader cluster (known as a 
**primary**) is linked to a series of follower **secondary** clusters. The primary 
cluster acts as the system of record and asynchronously replicates most Vault data.

All communication between primaries and secondaries is end-to-end encrypted
with mutually-authenticated TLS sessions, setup via replication tokens which are
exchanged during bootstrapping.

What data is replicated between the primary and secondary depends on the type of
replication that is configured between the primary and secondary. These types
of relationships are either **disaster recovery** or **performance**
relationships.

## Performance Replication and Disaster Recovery (DR) Replication

*Performance Replication*: 
In performance replication, secondaries keep track of their own tokens and leases 
but share the underlying configuration, policies, and supporting secrets (K/V values,
encryption keys for `transit`, etc). 

If a user action would modify underlying shared state, the secondary forwards the request 
to the primary to be handled; this is transparent to the client. In practice, most 
high-volume workloads (reads in the `kv` backend, encryption/decryption operations
in `transit`, etc.) can be satisfied by the local secondary, allowing Vault to scale
relatively horizontally with the number of secondaries rather than vertically as 
in the past.

*Disaster Recovery (DR) Replication*:
In disaster recovery (or DR) replication, secondaries share the same underlying configuration,
policy, and supporting secrets  (K/V values, encryption keys for `transit`, etc) infrastructure
as the primary. They also share the same token and lease infrastructure as the primary, as
they are designed to allow for continuous operations with applications connecting to the
original primary on the election of the DR secondary. 

DR is designed to be a mechanism to protect against catastrophic failure of entire clusters. 
They do not forward service read or write requests until they are elected and become a new primary. 

| Capability                                                                                                               	| Disaster Recovery 	| Performance                                                              	|
|--------------------------------------------------------------------------------------------------------------------------	|-------------------	|--------------------------------------------------------------------------	|
| Mirrors the secrets infrastructure of a primary cluster                                                                  	| Yes               	| Yes                                                                      	|
| Mirrors the configuration of a primary cluster’s backends (i.e.: auth backends, storage backends, secret backends, etc.) 	| Yes               	| Yes                                                                      	|
| Contains a local replica of secrets on the secondary and allows the secondary to forward writes                          	| No                	| Yes                                                                      	|
| Mirrors the token auth infrastructure for applications or users interacting with the primary cluster                     	| Yes               	| No. Upon promotion, applications must re-auth tokens with a new primary. 	|

For more information on the capabilities of performance and disaster recovery replication, see the Vault Replication [API Documentation](/api/system/replication.html).

## Internals

Details on the internal design of the replication feature can be found in the
[replication
internals](/docs/internals/replication.html)
document.

## Security Model

Vault is trusted all over the world to keep secrets safe. As such, we have put
extreme focus to detail to our replication model as well.

### Primary/Secondary Communication

When a cluster is marked as the primary it generates a self-signed CA
certificate. On request, and given a user-specified identifier, the primary
uses this CA certificate to generate a private key and certificate and packages
these, along with some other information, into a replication bootstrapping
bundle, a.k.a. a secondary activation token. The certificate is used to perform
TLS mutual authentication between the primary and that secondary.

This CA certificate is never shared with secondaries, and no secondary ever has
access to any other secondary’s certificate. In practice this means that
revoking a secondary’s access to the primary does not allow it continue
replication with any other machine; it also means that if a primary goes down,
there is full administrative control over which cluster becomes primary. An
attacker cannot spoof a secondary into believing that a cluster the attacker
controls is the new primary without also being able to administratively direct
the secondary to connect by giving it a new bootstrap package (which is an
ACL-protected call).

Vault makes use of Application Layer Protocol Negotiation on its cluster port.
This allows the same port to handle both request forwarding and replication,
even while keeping the certificate root of trust and feature set different.

### Secondary Activation Tokens

A secondary activation token is an extremely sensitive item and as such is
protected via response wrapping. Experienced Vault users will note that the
wrapping format for replication bootstrap packages is different from normal
response wrapping tokens: it is a signed JWT. This allows the replication token
to carry the redirect address of the primary cluster as part of the token. In
most cases this means that simply providing the token to a new secondary is
enough to activate replication, although this can also be overridden when the
token is provided to the secondary.

Secondary activation tokens should be treated like Vault root tokens. If
disclosed to a bad actor, that actor can gain access to all Vault data. It
should therefore be treated with utmost sensitivity.  Like all
response-wrapping tokens, once the token is used successfully (in this case, to
activate a secondary) it is useless, so it is only necessary to safeguard it
from one machine to the next.  Like with root tokens, HashiCorp recommends that
when a secondary activation token is live, there are multiple eyes on it from
generation until it is used.

Once a secondary is activated, its cluster information is stored safely behind
its encrypted barrier.

## Setup and Best Practices

A [setup guide](/guides/replication.html) is
available to help you get started; this guide also contains best practices
around operationalizing the replication feature.

## API

The Vault replication component has a full HTTP API. Please see the
[Vault Replication API](/api/system/replication.html) for more
details.
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
+								---
 								layout: "docs"
 								page_title: "Vault Enterprise Replication"
-												Fix layout for replication

											
										
										
											2017-03-16 10:50:33 +00:00
+								sidebar_current: "docs-vault-enterprise-replication"
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
+								description: |-
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								  Vault Enterprise has support for Replication, allowing critical data to be replicated across clusters to support horizontally scaling and disaster recovery workloads.
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
 								---
 								# Vault Replication
 								## Overview
 								Many organizations have infrastructure that spans multiple datacenters. Vault
 								provides the critical services of identity management, secrets storage, and
 								policy management.  This functionality is expected to be highly available and
 								to scale as the number of clients and their functional needs increase; at the
 								same time, operators would like to ensure that a common set of policies are
 								enforced globally, and a consistent set of secrets and keys are exposed to
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								applications that need to interoperate.
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								Vault replication addresses both of these needs in providing consistency,
 								scalability, and highly-available disaster recovery.
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
 								## Architecture
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								The core unit of Vault replication is a **cluster**, which is comprised of a
 								collection of Vault nodes (an active and its corresponding HA nodes). Multiple Vault
 								clusters communicate in a one-to-many near real-time flow.
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								Replication operates on a leader/follower model, wherein a leader cluster (known as a
 								**primary**) is linked to a series of follower **secondary** clusters. The primary
 								cluster acts as the system of record and asynchronously replicates most Vault data.
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
 								All communication between primaries and secondaries is end-to-end encrypted
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								with mutually-authenticated TLS sessions, setup via replication tokens which are
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
+								exchanged during bootstrapping.
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								What data is replicated between the primary and secondary depends on the type of
 								replication that is configured between the primary and secondary. These types
 								of relationships are either **disaster recovery** or **performance**
 								relationships.
 								## Performance Replication and Disaster Recovery (DR) Replication
 								*Performance Replication*:
 								In performance replication, secondaries keep track of their own tokens and leases
 								but share the underlying configuration, policies, and supporting secrets (K/V values,
 								encryption keys for `transit`, etc).
 								If a user action would modify underlying shared state, the secondary forwards the request
 								to the primary to be handled; this is transparent to the client. In practice, most
-												Rename "generic" secret backend to "kv" (#3292)


											
										
										
											2017-09-15 13:02:29 +00:00
+								high-volume workloads (reads in the `kv` backend, encryption/decryption operations
-												Update index.html.md

Updated replication docs for DR

											
										
										
											2017-08-15 02:02:02 +00:00
+								in `transit`, etc.) can be satisfied by the local secondary, allowing Vault to scale
 								relatively horizontally with the number of secondaries rather than vertically as
 								in the past.
 								*Disaster Recovery (DR) Replication*:
 								In disaster recovery (or DR) replication, secondaries share the same underlying configuration,
 								policy, and supporting secrets  (K/V values, encryption keys for `transit`, etc) infrastructure
 								as the primary. They also share the same token and lease infrastructure as the primary, as
 								they are designed to allow for continuous operations with applications connecting to the
 								original primary on the election of the DR secondary.
 								DR is designed to be a mechanism to protect against catastrophic failure of entire clusters.
 								They do not forward service read or write requests until they are elected and become a new primary.
 								| Capability                                                                                                               	| Disaster Recovery 	| Performance                                                              	|
 								|--------------------------------------------------------------------------------------------------------------------------	|-------------------	|--------------------------------------------------------------------------	|
 								| Mirrors the secrets infrastructure of a primary cluster                                                                  	| Yes               	| Yes                                                                      	|
 								| Mirrors the configuration of a primary cluster’s backends (i.e.: auth backends, storage backends, secret backends, etc.) 	| Yes               	| Yes                                                                      	|
 								| Contains a local replica of secrets on the secondary and allows the secondary to forward writes                          	| No                	| Yes                                                                      	|
 								| Mirrors the token auth infrastructure for applications or users interacting with the primary cluster                     	| Yes               	| No. Upon promotion, applications must re-auth tokens with a new primary. 	|
 								For more information on the capabilities of performance and disaster recovery replication, see the Vault Replication [API Documentation](/api/system/replication.html).
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
 								## Internals
 								Details on the internal design of the replication feature can be found in the
 								[replication
-												Use relative links

											
										
										
											2017-03-16 19:04:36 +00:00
+								internals](/docs/internals/replication.html)
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
+								document.
 								## Security Model
 								Vault is trusted all over the world to keep secrets safe. As such, we have put
 								extreme focus to detail to our replication model as well.
 								### Primary/Secondary Communication
 								When a cluster is marked as the primary it generates a self-signed CA
 								certificate. On request, and given a user-specified identifier, the primary
 								uses this CA certificate to generate a private key and certificate and packages
 								these, along with some other information, into a replication bootstrapping
 								bundle, a.k.a. a secondary activation token. The certificate is used to perform
 								TLS mutual authentication between the primary and that secondary.
 								This CA certificate is never shared with secondaries, and no secondary ever has
 								access to any other secondary’s certificate. In practice this means that
 								revoking a secondary’s access to the primary does not allow it continue
 								replication with any other machine; it also means that if a primary goes down,
 								there is full administrative control over which cluster becomes primary. An
 								attacker cannot spoof a secondary into believing that a cluster the attacker
 								controls is the new primary without also being able to administratively direct
 								the secondary to connect by giving it a new bootstrap package (which is an
 								ACL-protected call).
 								Vault makes use of Application Layer Protocol Negotiation on its cluster port.
 								This allows the same port to handle both request forwarding and replication,
 								even while keeping the certificate root of trust and feature set different.
 								### Secondary Activation Tokens
 								A secondary activation token is an extremely sensitive item and as such is
 								protected via response wrapping. Experienced Vault users will note that the
 								wrapping format for replication bootstrap packages is different from normal
 								response wrapping tokens: it is a signed JWT. This allows the replication token
 								to carry the redirect address of the primary cluster as part of the token. In
 								most cases this means that simply providing the token to a new secondary is
 								enough to activate replication, although this can also be overridden when the
 								token is provided to the secondary.
 								Secondary activation tokens should be treated like Vault root tokens. If
 								disclosed to a bad actor, that actor can gain access to all Vault data. It
 								should therefore be treated with utmost sensitivity.  Like all
 								response-wrapping tokens, once the token is used successfully (in this case, to
 								activate a secondary) it is useless, so it is only necessary to safeguard it
 								from one machine to the next.  Like with root tokens, HashiCorp recommends that
 								when a secondary activation token is live, there are multiple eyes on it from
 								generation until it is used.
 								Once a secondary is activated, its cluster information is stored safely behind
 								its encrypted barrier.
 								## Setup and Best Practices
-												Add rekeying guide & move guides to top-level (#2935)


											
										
										
											2017-06-29 13:43:43 +00:00
+								A [setup guide](/guides/replication.html) is
-												Vault_Enterprise_WWW (#2327)


											
										
										
											2017-03-15 18:31:14 +00:00
+								available to help you get started; this guide also contains best practices
 								around operationalizing the replication feature.
 								## API
-												minor docs update

											
										
										
											2017-04-10 13:46:25 +00:00
+								The Vault replication component has a full HTTP API. Please see the
-												/docs/http -> /api

											
										
										
											2017-03-17 18:06:03 +00:00
+								[Vault Replication API](/api/system/replication.html) for more
-												Reformat replication API

											
										
										
											2017-03-16 18:57:06 +00:00
+								details.