open-vault

Commit Graph

Author	SHA1	Message	Date
Daniel Kimsey	b4b61efc75	Auto-join support for IPv6 discovery (#12366 ) * Auto-join support for IPv6 discovery The go-discover library returns IP addresses and not URLs. It just so happens net.URL parses "127.0.0.1", which isn't a valid URL. Instead, we construct the URL ourselves. Being careful to check if it's an ipv6 address and making sure it's in explicit form if so. Fixes #12323 * feedback: addrs & ipv6 test Rename addrs to clusterIPs to improve clarity and intent Tighten up our IPv6 address detection to be more correct and to ensure it's actually in implicit form	2021-09-07 11:55:07 -07:00
Jeff Mitchell	f7147025dd	Migrate to sdk/internalshared libs in go-secure-stdlib (#12090 ) * Swap sdk/helper libs to go-secure-stdlib * Migrate to go-secure-stdlib reloadutil * Migrate to go-secure-stdlib kv-builder * Migrate to go-secure-stdlib gatedwriter	2021-07-15 20:17:31 -04:00
Vishal Nayak	eecb39a57f	OSS parts of Autopilot in DR secondaries (#12014 )	2021-07-08 12:30:01 -04:00
Nick Cabatoff	01f96f18ce	VAULT-2439: OSS parts of #1889 (raft licensing init) (#11665 )	2021-05-19 16:07:58 -04:00
Brian Kassouf	f498d0d389	Reload raft TLS keys on active startup (#11660 )	2021-05-19 10:03:32 -07:00
Lars Lehtonen	53dd619d2f	vault: deprecate errwrap.Wrapf() (#11577 )	2021-05-11 13:12:54 -04:00
Josh Black	06809930a3	Add HTTP response headers for hostname and raft node ID (if applicable) (#11289 )	2021-04-20 15:25:04 -07:00
Vishal Nayak	4666f40925	Support autopilot when raft is for HA only (#11260 )	2021-04-12 09:33:21 -04:00
Nick Cabatoff	44c00cd54f	Fix: leader_tls_servername raft option only worked when used with mTLS and/or an explicit CA cert. (#11252 )	2021-04-06 09:16:54 -04:00
Nick Cabatoff	41d9030fbb	Disable autopilot in raft-ha mode. (#11181 ) * Disable autopilot in raft-ha mode. * Also don't run autopilot on DR secondaries.	2021-03-23 14:13:44 -07:00
Vishal Nayak	3e55e79a3f	Autopilot: Server Stabilization, State and Dead Server Cleanup (#10856 ) * k8s doc: update for 0.9.1 and 0.8.0 releases (#10825) * k8s doc: update for 0.9.1 and 0.8.0 releases * Update website/content/docs/platform/k8s/helm/configuration.mdx Co-authored-by: Theron Voran <tvoran@users.noreply.github.com> Co-authored-by: Theron Voran <tvoran@users.noreply.github.com> * Autopilot initial commit * Move autopilot related backend implementations to its own file * Abstract promoter creation * Add nil check for health * Add server state oss no-ops * Config ext stub for oss * Make way for non-voters * s/health/state * s/ReadReplica/NonVoter * Add synopsis and description * Remove struct tags from AutopilotConfig * Use var for config storage path * Handle nin-config when reading * Enable testing autopilot by using inmem cluster * First passing test * Only report the server as known if it is present in raft config * Autopilot defaults to on for all existing and new clusters * Add locking to some functions * Persist initial config * Clarify the command usage doc * Add health metric for each node * Fix audit logging issue * Don't set DisablePerformanceStandby to true in test * Use node id label for health metric * Log updates to autopilot config * Less aggressively consume config loading failures * Return a mutable config * Return early from known servers if raft config is unable to be pulled * Update metrics name * Reduce log level for potentially noisy log * Add knob to disable autopilot * Don't persist if default config is in use * Autopilot: Dead server cleanup (#10857) * Dead server cleanup * Initialize channel in any case * Fix a bunch of tests * Fix panic * Add follower locking in heartbeat tracker * Add LastContactFailureThreshold to config * Add log when marking node as dead * Update follower state locking in heartbeat tracker * Avoid follower states being nil * Pull test to its own file * Add execution status to state response * Optionally enable autopilot in some tests * Updates * Added API function to fetch autopilot configuration * Add test for default autopilot configuration * Configuration tests * Add State API test * Update test * Added TestClusterOptions.PhysicalFactoryConfig * Update locking * Adjust locking in heartbeat tracker * s/last_contact_failure_threshold/left_server_last_contact_threshold * Add disabling autopilot as a core config option * Disable autopilot in some tests * s/left_server_last_contact_threshold/dead_server_last_contact_threshold * Set the lastheartbeat of followers to now when setting up active node * Don't use config defaults from CLI command * Remove config file support * Remove HCL test as well * Persist only supplied config; merge supplied config with default to operate * Use pointer to structs for storing follower information * Test update * Retrieve non voter status from configbucket and set it up when a node comes up * Manage desired suffrage * Consider bucket being created already * Move desired suffrage to its own entry * s/DesiredSuffrageKey/LocalNodeConfigKey * s/witnessSuffrage/recordSuffrage * Fix test compilation * Handle local node config post a snapshot install * Commit to storage first; then record suffrage in fsm * No need of local node config being nili case, post snapshot restore * Reconcile autopilot config when a new leader takes over duty * Grab fsm lock when recording suffrage * s/Suffrage/DesiredSuffrage in FollowerState * Instantiate autopilot only in leader * Default to old ways in more scenarios * Make API gracefully handle 404 * Address some feedback * Make IsDead an atomic.Value * Simplify follower hearbeat tracking * Use uber.atomic * Don't have multiple causes for having autopilot disabled * Don't remove node from follower states if we fail to remove the dead server * Autopilot server removals map (#11019) * Don't remove node from follower states if we fail to remove the dead server * Use map to track dead server removals * Use lock and map * Use delegate lock * Adjust when to remove entry from map * Only hold the lock while accessing map * Fix race * Don't set default min_quorum * Fix test * Ensure follower states is not nil before starting autopilot * Fix race Co-authored-by: Jason O'Donnell <2160810+jasonodonnell@users.noreply.github.com> Co-authored-by: Theron Voran <tvoran@users.noreply.github.com>	2021-03-03 13:59:50 -05:00
Vishal Nayak	53cb1deb38	Revert "Read-replica instead of non-voter (#10875 )" (#10890 ) This reverts commit fc745670cf34821f5834357d9caebc3351dbc1e7.	2021-02-10 16:41:58 -05:00
Vishal Nayak	a2394e7353	Read-replica instead of non-voter (#10875 )	2021-02-10 09:58:18 -05:00
Nick Cabatoff	8cbc63d572	Add configuration to specify a TLS ServerName to use in the TLS handshake when performing a raft join. (#10698 )	2021-01-19 17:54:28 -05:00
Nick Cabatoff	84d566db9e	Be consistent with how we report init status. (#10498 ) Also make half-joined raft peers consider storage to be initialized, whether or not they're sealed.	2020-12-08 13:55:34 -05:00
Aleksandr Bezobchuk	95bbd8d920	Merge PR #10192 : Auto-Join: Configurable Scheme & Port (and add k8s provider)	2020-10-23 16:13:09 -04:00
Aleksandr Bezobchuk	d37be9af6e	Merge PR #10095 : Integrated Storage Cloud Auto-Join	2020-10-13 16:26:39 -04:00
Brian Kassouf	fd72d92434	raft: Fix some snapshot restore issues (#9533 ) * raft: Remove double read lock * Reload TLS keyring after reloading the barrier keys	2020-07-21 10:59:07 -07:00
ncabatoff	d2436a9c56	Make standbyStopCh atomic to avoid data races (#9539 )	2020-07-21 08:34:07 -04:00
Calvin Leung Huang	c45bdca0b3	raft: add support for using backend for ha_storage (#9193 ) * raft: initial work on raft ha storage support * add note on join * add todo note * raft: add support for bootstrapping and joining existing nodes * raft: gate bootstrap join by reading leader api address from storage * raft: properly check for raft-only for certain conditionals * raft: add bootstrap to api and cli * raft: fix bootstrap cli command * raft: add test for setting up new cluster with raft HA * raft: extend TestRaft_HA_NewCluster to include inmem and consul backends * raft: add test for updating an existing cluster to use raft HA * raft: remove debug log lines, clean up verifyRaftPeers * raft: minor cleanup * raft: minor cleanup * Update physical/raft/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/ha.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/ha.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/logical_system_raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * address feedback comments * address feedback comments * raft: refactor tls keyring logic * address feedback comments * Update vault/raft.go Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * address feedback comments * testing: fix import ordering * raft: rename var, cleanup comment line * docs: remove ha_storage restriction note on raft * docs: more raft HA interaction updates with migration and recovery mode * docs: update the raft join command * raft: update comments * raft: add missing isRaftHAOnly check for clearing out state set earlier * raft: update a few ha_storage config checks * Update command/operator_raft_bootstrap.go Co-authored-by: Vishal Nayak <vishalnayak@users.noreply.github.com> * raft: address feedback comments * raft: fix panic when checking for config.HAStorage.Type * Update vault/raft.go Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * Update website/pages/docs/commands/operator/raft.mdx Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * raft: remove bootstrap cli command * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * raft: address review feedback * raft: revert vendored sdk * raft: don't send applied index and node ID info if we're HA-only Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> Co-authored-by: Vishal Nayak <vishalnayak@users.noreply.github.com>	2020-06-23 12:04:13 -07:00
Brian Kassouf	c8dde052f2	storage/raft: Advertise the configured cluster address (#9008 ) * storage/raft: Advertise the configured cluster address * Don't allow raft to start with unspecified IP * Fix concurrent map write panic * Add test file * changelog++ * changelog++ * changelog++ * Update tcp_layer.go * Update tcp_layer.go * Only set the adverise addr if set	2020-05-18 18:22:25 -07:00
Brian Kassouf	1bb0bd489d	storage/raft: Add committed and applied indexes to the status output (#9011 ) * storage/raft: Add committed and applied indexes to the status output * Update api vendor * changelog++ * Update http/sys_leader.go Co-authored-by: Jim Kalafut <jkalafut@hashicorp.com> Co-authored-by: Jim Kalafut <jkalafut@hashicorp.com>	2020-05-18 16:07:27 -07:00
Brian Kassouf	05eea911bd	storage/raft: Refresh TLS keyring on snapshot restore (#8546 )	2020-03-13 13:39:14 -07:00
Jim Kalafut	f17fc4e5c1	Run goimports (#8251 )	2020-01-27 21:11:00 -08:00
Vishal Nayak	8891f2ba88	Raft retry join (#7856 ) * Raft retry join * update * Make retry join work with shamir seal * Return upon context completion * Update vault/raft.go Co-Authored-By: Brian Kassouf <briankassouf@users.noreply.github.com> * Address some review comments * send leader information slice as a parameter * Make retry join work properly with Shamir case. This commit has a blocking issue * Fix join goroutine exiting before the job is done * Polishing changes * Don't return after a successful join during unseal * Added config parsing test * Add test and fix bugs * minor changes * Address review comments * Fix build error Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com>	2020-01-13 17:02:16 -08:00
Jeff Mitchell	a0694943cc	Migrate built in auto seal to go-kms-wrapping (#8118 )	2020-01-10 20:39:52 -05:00
Lexman	c86fe212c0	oss changes for entropy augmentation feature (#7670 ) * oss changes for entropy augmentation feature * fix oss command/server/config tests * update go.sum * fix logical_system and http/ tests * adds vendored files * removes unused variable	2019-10-17 10:33:00 -07:00
Brian Kassouf	024c29c36a	OSS portions of raft non-voters (#7634 ) * OSS portions of raft non-voters * add file * Update vault/raft.go Co-Authored-By: Vishal Nayak <vishalnayak@users.noreply.github.com>	2019-10-11 11:56:59 -07:00
isbric	e6e20e9eb3	Correct spelling of error message (#7630 )	2019-10-11 11:14:41 -04:00
ncabatoff	ed147b7ae7	Make clusterListener an atomic.Value to avoid races with getGRPCDialer. (#7408 )	2019-09-03 11:59:56 -04:00
Brian Kassouf	b83aaf7331	storage/raft: Support storage migration to raft storage (#7207 ) * Support raft in the migration command * Add comments	2019-07-29 13:05:43 -07:00
Brian Kassouf	4d7d0d729a	storage/raft: When restoring a snapshot preseal first (#7011 ) * storage/raft: When restoring a snapshot preseal first * best-effort allow standbys to apply the restoreOp before sealing active node * Don't cache the raft tls key * Update physical/raft/raft.go * Move pending raft peers to core * Fix race on close bool * Extend the leaderlease time for tests * Update raft deps * Fix audit hashing * Fix race with auditing	2019-07-03 13:56:30 -07:00
Brian Kassouf	62e14c280d	storage/raft: fix races in tests (#6996 ) * storage/raft: fix races in tests * Fix another test race	2019-06-27 10:00:03 -07:00
Jeff Mitchell	07dcdc8b79	Sync	2019-06-20 20:55:10 -04:00
Brian Kassouf	ed14061578	Raft Storage Backend (#6888 ) * Work on raft backend * Add logstore locally * Add encryptor and unsealable interfaces * Add clustering support to raft * Remove client and handler * Bootstrap raft on init * Cleanup raft logic a bit * More raft work * Work on TLS config * More work on bootstrapping * Fix build * More work on bootstrapping * More bootstrapping work * fix build * Remove consul dep * Fix build * merged oss/master into raft-storage * Work on bootstrapping * Get bootstrapping to work * Clean up FMS and node-id * Update local node ID logic * Cleanup node-id change * Work on snapshotting * Raft: Add remove peer API (#906) * Add remove peer API * Add some comments * Fix existing snapshotting (#909) * Raft get peers API (#912) * Read raft configuration * address review feedback * Use the Leadership Transfer API to step-down the active node (#918) * Raft join and unseal using Shamir keys (#917) * Raft join using shamir * Store AEAD instead of master key * Split the raft join process to answer the challenge after a successful unseal * get the follower to standby state * Make unseal work * minor changes * Some input checks * reuse the shamir seal access instead of new default seal access * refactor joinRaftSendAnswer function * Synchronously send answer in auto-unseal case * Address review feedback * Raft snapshots (#910) * Fix existing snapshotting * implement the noop snapshotting * Add comments and switch log libraries * add some snapshot tests * add snapshot test file * add TODO * More work on raft snapshotting * progress on the ConfigStore strategy * Don't use two buckets * Update the snapshot store logic to hide the file logic * Add more backend tests * Cleanup code a bit * [WIP] Raft recovery (#938) * Add recovery functionality * remove fmt.Printfs * Fix a few fsm bugs * Add max size value for raft backend (#942) * Add max size value for raft backend * Include physical.ErrValueTooLarge in the message * Raft snapshot Take/Restore API (#926) * Inital work on raft snapshot APIs * Always redirect snapshot install/download requests * More work on the snapshot APIs * Cleanup code a bit * On restore handle special cases * Use the seal to encrypt the sha sum file * Add sealer mechanism and fix some bugs * Call restore while state lock is held * Send restore cb trigger through raft log * Make error messages nicer * Add test helpers * Add snapshot test * Add shamir unseal test * Add more raft snapshot API tests * Fix locking * Change working to initalize * Add underlying raw object to test cluster core * Move leaderUUID to core * Add raft TLS rotation logic (#950) * Add TLS rotation logic * Cleanup logic a bit * Add/Remove from follower state on add/remove peer * add comments * Update more comments * Update request_forwarding_service.proto * Make sure we populate all nodes in the followerstate obj * Update times * Apply review feedback * Add more raft config setting (#947) * Add performance config setting * Add more config options and fix tests * Test Raft Recovery (#944) * Test raft recovery * Leave out a node during recovery * remove unused struct * Update physical/raft/snapshot_test.go * Update physical/raft/snapshot_test.go * fix vendoring * Switch to new raft interface * Remove unused files * Switch a gogo -> proto instance * Remove unneeded vault dep in go.sum * Update helper/testhelpers/testhelpers.go Co-Authored-By: Calvin Leung Huang <cleung2010@gmail.com> * Update vault/cluster/cluster.go * track active key within the keyring itself (#6915) * track active key within the keyring itself * lookup and store using the active key ID * update docstring * minor refactor * Small text fixes (#6912) * Update physical/raft/raft.go Co-Authored-By: Calvin Leung Huang <cleung2010@gmail.com> * review feedback * Move raft logical system into separate file * Update help text a bit * Enforce cluster addr is set and use it for raft bootstrapping * Fix tests * fix http test panic * Pull in latest raft-snapshot library * Add comment	2019-06-20 12:14:58 -07:00

35 Commits