open-vault

Author	SHA1	Message	Date
Hridoy Roy	049f2513e6	Initial Diagnose Command for TLS and Listener [VAULT-1896, VAULT-1899] (#11249 ) * sanity checks for tls config in diagnose * backup * backup * backup * added necessary tests * remove comment * remove parallels causing test flakiness * comments * small fix * separate out config hcl test case into new hcl file * newline * addressed comments * addressed comments * addressed comments * addressed comments * addressed comments * reload funcs should be allowed to be nil	2021-04-06 16:40:43 -07:00
Vishal Nayak	3e55e79a3f	Autopilot: Server Stabilization, State and Dead Server Cleanup (#10856 ) * k8s doc: update for 0.9.1 and 0.8.0 releases (#10825) * k8s doc: update for 0.9.1 and 0.8.0 releases * Update website/content/docs/platform/k8s/helm/configuration.mdx Co-authored-by: Theron Voran <tvoran@users.noreply.github.com> Co-authored-by: Theron Voran <tvoran@users.noreply.github.com> * Autopilot initial commit * Move autopilot related backend implementations to its own file * Abstract promoter creation * Add nil check for health * Add server state oss no-ops * Config ext stub for oss * Make way for non-voters * s/health/state * s/ReadReplica/NonVoter * Add synopsis and description * Remove struct tags from AutopilotConfig * Use var for config storage path * Handle nin-config when reading * Enable testing autopilot by using inmem cluster * First passing test * Only report the server as known if it is present in raft config * Autopilot defaults to on for all existing and new clusters * Add locking to some functions * Persist initial config * Clarify the command usage doc * Add health metric for each node * Fix audit logging issue * Don't set DisablePerformanceStandby to true in test * Use node id label for health metric * Log updates to autopilot config * Less aggressively consume config loading failures * Return a mutable config * Return early from known servers if raft config is unable to be pulled * Update metrics name * Reduce log level for potentially noisy log * Add knob to disable autopilot * Don't persist if default config is in use * Autopilot: Dead server cleanup (#10857) * Dead server cleanup * Initialize channel in any case * Fix a bunch of tests * Fix panic * Add follower locking in heartbeat tracker * Add LastContactFailureThreshold to config * Add log when marking node as dead * Update follower state locking in heartbeat tracker * Avoid follower states being nil * Pull test to its own file * Add execution status to state response * Optionally enable autopilot in some tests * Updates * Added API function to fetch autopilot configuration * Add test for default autopilot configuration * Configuration tests * Add State API test * Update test * Added TestClusterOptions.PhysicalFactoryConfig * Update locking * Adjust locking in heartbeat tracker * s/last_contact_failure_threshold/left_server_last_contact_threshold * Add disabling autopilot as a core config option * Disable autopilot in some tests * s/left_server_last_contact_threshold/dead_server_last_contact_threshold * Set the lastheartbeat of followers to now when setting up active node * Don't use config defaults from CLI command * Remove config file support * Remove HCL test as well * Persist only supplied config; merge supplied config with default to operate * Use pointer to structs for storing follower information * Test update * Retrieve non voter status from configbucket and set it up when a node comes up * Manage desired suffrage * Consider bucket being created already * Move desired suffrage to its own entry * s/DesiredSuffrageKey/LocalNodeConfigKey * s/witnessSuffrage/recordSuffrage * Fix test compilation * Handle local node config post a snapshot install * Commit to storage first; then record suffrage in fsm * No need of local node config being nili case, post snapshot restore * Reconcile autopilot config when a new leader takes over duty * Grab fsm lock when recording suffrage * s/Suffrage/DesiredSuffrage in FollowerState * Instantiate autopilot only in leader * Default to old ways in more scenarios * Make API gracefully handle 404 * Address some feedback * Make IsDead an atomic.Value * Simplify follower hearbeat tracking * Use uber.atomic * Don't have multiple causes for having autopilot disabled * Don't remove node from follower states if we fail to remove the dead server * Autopilot server removals map (#11019) * Don't remove node from follower states if we fail to remove the dead server * Use map to track dead server removals * Use lock and map * Use delegate lock * Adjust when to remove entry from map * Only hold the lock while accessing map * Fix race * Don't set default min_quorum * Fix test * Ensure follower states is not nil before starting autopilot * Fix race Co-authored-by: Jason O'Donnell <2160810+jasonodonnell@users.noreply.github.com> Co-authored-by: Theron Voran <tvoran@users.noreply.github.com>	2021-03-03 13:59:50 -05:00
Scott Miller	08d8f65e01	Take the state lock in checkBarrierRotate, and don't save on seal (#11028 ) * Use the state lock, and don't bother a last minute check on seal * defer	2021-03-01 16:32:17 -06:00
Scott Miller	b13b27f37e	OSS side barrier encryption tracking and automatic rotation (#11007 ) * Automatic barrier key rotation, OSS portion * Fix build issues * Vendored version * Add missing encs field, not sure where this got lost.	2021-02-25 14:27:25 -06:00
Nick Cabatoff	c1ddfbb538	OSS parts of the new client controlled consistency feature (#10974 )	2021-02-24 06:58:10 -05:00
swayne275	e4119a6a8a	Vault-1403 Switch Expiration Manager to use Fairsharing Backpressure (#1709 ) (#10932 ) * basic pool and start testing * refactor a bit for testing * workFunc, start/stop safety, testing * cleanup function for worker quit, more tests * redo public/private members * improve tests, export types, switch uuid package * fix loop capture bug, cleanup * cleanup tests * update worker pool file name, other improvements * add job manager prototype * remove remnants * add functions to wait for job manager and worker pool to stop, other fixes * test job manager functionality, fix bugs * encapsulate how jobs are distributed to workers * make worker job channel read only * add job interface, more testing, fixes * set name for dispatcher * fix test races * wire up expiration manager most of the way * dispatcher and job manager constructors don't return errors * logger now dependency injected * make some members private, test fcn to get worker pool size * make GetNumWorkers public * Update helper/fairshare/jobmanager_test.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * update fairsharing usage, add tests * make workerpool private * remove custom worker names * concurrency improvements * remove worker pool cleanup function * remove cleanup func from job manager, remove non blocking stop from fairshare * update job manager for new constructor * stop job manager when expiration manager stopped * unset env var after test * stop fairshare when started in tests * stop leaking job manager goroutine * prototype channel for waking up to assign work * fix typo/bug and add tests * improve job manager wake up, fix test typo * put channel drain back * better start/pause test for job manager * comment cleanup * degrade possible noisy log * remove closure, clean up context * improve revocation context timer * test: reduce number of revocation workers during many tests * Update vault/expiration.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * feedback tweaks Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com>	2021-02-17 14:30:27 -08:00
Vishal Nayak	53cb1deb38	Revert "Read-replica instead of non-voter (#10875 )" (#10890 ) This reverts commit fc745670cf34821f5834357d9caebc3351dbc1e7.	2021-02-10 16:41:58 -05:00
Vishal Nayak	a2394e7353	Read-replica instead of non-voter (#10875 )	2021-02-10 09:58:18 -05:00
Brian Kassouf	275ca323e8	core: Record the time a node became active (#10489 ) * core: Record the time a node became active * Update vault/core.go Co-authored-by: Nick Cabatoff <ncabatoff@hashicorp.com> * Add omitempty field * Update vendor * Added CL entry and fixed test * Fix test * Fix command package tests Co-authored-by: Nick Cabatoff <ncabatoff@hashicorp.com>	2020-12-11 16:50:19 -08:00
Nick Cabatoff	b425be1a93	Fix race with test that mutates KeyRotateGracePeriod: make the global be a Core field instead. (#10512 )	2020-12-08 13:57:44 -05:00
swayne275	88eaf5f4c3	Fix Racy Activity Log Tests (#10484 ) * fix racy activity log tests and move testing utilities elsewhere * remove TODO * move SetEnable out of activity log * clarify not waiting on waitgroup * remove todo	2020-12-02 13:48:13 -07:00
Brian Kassouf	81a86f48e8	Backport some OSS changes (#10267 ) * Backport some OSS changes * go mod vendor	2020-10-29 16:47:34 -07:00
Nick Cabatoff	0d6a929a4c	Same seal migration oss (#10224 ) * Refactoring and test improvements. * Support migrating from a given type of autoseal to that same type but with different parameters.	2020-10-23 14:16:04 -04:00
Aleksandr Bezobchuk	0d6a0ec589	Merge PR #10010 : Rate Limit Quotas: Allow Exempt Paths to be Configurable	2020-10-16 14:58:19 -04:00
Nick Cabatoff	66274607b7	OSS changes for enterprise automated snapshots (#10160 )	2020-10-16 14:57:11 -04:00
Brian Kassouf	84dbca38a1	Revert "Migrate internalshared out (#9727 )" (#10141 ) This reverts commit ee6391b691ac12ab6ca13c3912404f1d3a842bd6.	2020-10-13 16:38:21 -07:00
Jeff Mitchell	e6881c8147	Migrate internalshared out (#9727 ) * Migrate internalshared out * fix merge issue * fix merge issue * go mod vendor Co-authored-by: Brian Kassouf <bkassouf@hashicorp.com>	2020-10-12 11:56:24 -07:00
Mark Gritter	587ed7d499	Disable usage metrics on performance standby nodes. (#9966 )	2020-09-15 17:12:28 -05:00
Mark Gritter	1b2c20e07c	Merge activity log work to date on enterprise back into oss. (#9900 ) * Added stub class for activity logging. (#1435) * Define activity fragments and starter methods for manipulating them. (#1441)	2020-09-08 14:22:09 -05:00
ncabatoff	4134ef2e98	Ensure that perf standbys can perform seal migrations. (#9690 )	2020-08-10 08:35:57 -04:00
Rodrigo D. L	d0df8bfa21	adding new config flag disable_sentinel_trace (#9696 )	2020-08-10 06:23:44 -04:00
ncabatoff	b6fd378ee8	Make manualStepDownCh a 1-buffered channel to ensure StepDown actually steps down in tests. (#9622 )	2020-07-31 10:01:51 -04:00
ncabatoff	1154b36b56	Log sanitized config at startup and when it changes. (#9637 ) Co-authored-by: Aleksandr Bezobchuk <aleks.bezobchuk@gmail.com>	2020-07-30 13:15:00 -04:00
Alexander Bezobchuk	1e262e5648	Merge PR #9581 : Rate Limit Quota Headers	2020-07-29 15:15:05 -04:00
ncabatoff	003bccd16e	Eliminate global that caused race tests to fail in ent with an internal config setting. (#9604 )	2020-07-27 16:10:26 -04:00
ncabatoff	d2436a9c56	Make standbyStopCh atomic to avoid data races (#9539 )	2020-07-21 08:34:07 -04:00
Mike Jarmy	93ff4c098c	Add a lock to seal migration (#9485 ) * add a lock to seal migration * switch to CompareAndSwapInt32 * switch to uber go-atomic	2020-07-16 15:14:29 -04:00
Brian Kassouf	f8df68b673	seal: Fix issue migrating from Auto->Shamir and improve tests (#9430 ) * Fix issue migrating from Auto->Shamir and improve tests * Undo newline * fix panic in test * Fix test panic	2020-07-09 12:28:17 -07:00
Alexander Bezobchuk	f1534a0ed0	Add nil check for quota manager (#9379 ) * Add nil check for quota manager * Add missing nil checks	2020-07-01 18:14:33 -07:00
Scott Miller	a6f62359a9	Don't setup plugin reload on perf standbys (#9352 )	2020-06-30 17:32:06 -05:00
Mike Jarmy	4b2cdfee72	re-enable seal migration (#9351 ) Co-authored-by: Vishal Nayak <vishalnayak@users.noreply.github.com>	2020-06-30 18:21:18 -04:00
Scott Miller	ad292bec73	Fix wrong err return value in plugin reload status command (#9348 ) * Fix wrong return value (discovered when merging to ENT) * go.mod * go mod vendor * Add setup plugin reload hook * All reloads return something now	2020-06-30 13:33:30 -05:00
ncabatoff	d42ee4f7ef	Ensure "initialized" service registration tag is also present whenever Vault is unsealed, on both Consul and K8s (#8990 ) * Add the initialized tag to Consul registration for parity with k8s (and for easy automated testing). Ensure that whenever we flag Vault as unsealed, we also flag it as initialized. * Update API docs. Co-authored-by: Jason O'Donnell <2160810+jasonodonnell@users.noreply.github.com>	2020-06-29 16:02:49 -04:00
Vishal Nayak	6bd5674345	Reset quota manager during shutdown (#9331 )	2020-06-29 13:23:10 -04:00
Vishal Nayak	c6876fe00f	Resource Quotas: Rate Limiting (#9330 )	2020-06-26 17:13:16 -04:00
Mark Gritter	97d415d024	Token gauge metrics implementation. (#9239 ) * Token gauge metrics implementation. * Enable gauges only when interval is nonzero. * Added count by TTL * Yandle "in restore mode" error specifically. * Refactored initialization code for gauge collection processes. * Fixed for multiple namespaces. * Ability to disable individual gauges with environment variable. * changelog++	2020-06-23 18:36:24 -05:00
Calvin Leung Huang	c45bdca0b3	raft: add support for using backend for ha_storage (#9193 ) * raft: initial work on raft ha storage support * add note on join * add todo note * raft: add support for bootstrapping and joining existing nodes * raft: gate bootstrap join by reading leader api address from storage * raft: properly check for raft-only for certain conditionals * raft: add bootstrap to api and cli * raft: fix bootstrap cli command * raft: add test for setting up new cluster with raft HA * raft: extend TestRaft_HA_NewCluster to include inmem and consul backends * raft: add test for updating an existing cluster to use raft HA * raft: remove debug log lines, clean up verifyRaftPeers * raft: minor cleanup * raft: minor cleanup * Update physical/raft/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/ha.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/ha.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/logical_system_raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * address feedback comments * address feedback comments * raft: refactor tls keyring logic * address feedback comments * Update vault/raft.go Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * address feedback comments * testing: fix import ordering * raft: rename var, cleanup comment line * docs: remove ha_storage restriction note on raft * docs: more raft HA interaction updates with migration and recovery mode * docs: update the raft join command * raft: update comments * raft: add missing isRaftHAOnly check for clearing out state set earlier * raft: update a few ha_storage config checks * Update command/operator_raft_bootstrap.go Co-authored-by: Vishal Nayak <vishalnayak@users.noreply.github.com> * raft: address feedback comments * raft: fix panic when checking for config.HAStorage.Type * Update vault/raft.go Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * Update website/pages/docs/commands/operator/raft.mdx Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> * raft: remove bootstrap cli command * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * Update vault/raft.go Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> * raft: address review feedback * raft: revert vendored sdk * raft: don't send applied index and node ID info if we're HA-only Co-authored-by: Brian Kassouf <briankassouf@users.noreply.github.com> Co-authored-by: Alexander Bezobchuk <alexanderbez@users.noreply.github.com> Co-authored-by: Vishal Nayak <vishalnayak@users.noreply.github.com>	2020-06-23 12:04:13 -07:00
Scott Miller	0b9a40a64e	Add a simple sealed gauge, updated when seal status changes (#9177 ) * Add a simple unsealed gauge, updated when seal status changes	2020-06-17 09:50:28 -05:00
Mike Jarmy	4303790aae	Test pre-1.4 seal migration (#9085 ) * enable seal wrap in all seal migration tests * move adjustForSealMigration to vault package * fix adjustForSealMigration * begin working on new seal migration test * create shamir seal migration test * refactor testhelpers * add VerifyRaftConfiguration to testhelpers * stub out TestTransit * Revert "refactor testhelpers" This reverts commit 39593defd0d4c6fd79aedfd37df6298391abb9db. * get shamir test working again * stub out transit join * work on transit join * Revert "move resuable storage test to avoid creating import cycle" This reverts commit b3ff2317381a5af12a53117f87d1c6fbb093af6b. * remove debug code * initTransit now works with raft join * runTransit works with inmem * work on runTransit with raft * runTransit works with raft * get rid of dis-used test * cleanup tests * TestSealMigration_TransitToShamir_Pre14 * TestSealMigration_ShamirToTransit_Pre14 * split for pre-1.4 testing * add simple tests for transit and shamir * fix typo in test suite * debug wrapper type * test debug * test-debug * refactor core migration * Revert "refactor core migration" This reverts commit a776452d32a9dca7a51e3df4a76b9234d8c0c7ce. * begin refactor of adjustForSealMigration * fix bug in adjustForSealMigration * clean up tests * clean up core refactoring * fix bug in shamir->transit migration * remove unnecessary lock from setSealsForMigration() * rename sealmigration test package * use ephemeral ports below 30000 * simplify use of numTestCores	2020-06-11 15:07:59 -04:00
ncabatoff	fdba917b66	Fix feature flag persistence: we shouldn't have excluded dr primaries, they too must write feature flags. DR secondaries might not need depend on feature flags being there, but a DR primary could also be (or become) a perf primary. (#9148 )	2020-06-04 13:00:33 -04:00
Josh Black	6e92c8cbd2	Add a new "vault monitor" command (#8477 ) Add a new "vault monitor" command Co-authored-by: ncabatoff <ncabatoff@hashicorp.com> Co-authored-by: Calvin Leung Huang <cleung2010@gmail.com> Co-authored-by: Jeff Mitchell <jeffrey.mitchell@gmail.com>	2020-05-21 13:07:50 -07:00
Brian Kassouf	c8dde052f2	storage/raft: Advertise the configured cluster address (#9008 ) * storage/raft: Advertise the configured cluster address * Don't allow raft to start with unspecified IP * Fix concurrent map write panic * Add test file * changelog++ * changelog++ * changelog++ * Update tcp_layer.go * Update tcp_layer.go * Only set the adverise addr if set	2020-05-18 18:22:25 -07:00
Calvin Leung Huang	8cefbca1c9	Refactor service registration (#8976 ) * serivceregistration: refactor service registration logic to run later * move state check to the internal func * sr/kubernetes: update setInitialStateInternal godoc * sr/kubernetes: remove return in setInitialState * core/test: fix mockServiceRegistration * address review feedback	2020-05-15 11:06:58 -07:00
Jeff Mitchell	1d3d89e2aa	Create configutil and move some common config and setup functions there (#8362 )	2020-05-14 09:19:27 -04:00
Mark Gritter	bd766d7bae	Metrics wrapper that adds the cluster name as a label. (#8961 )	2020-05-12 21:00:59 -05:00
Dustin Decker	08571a0ac3	Add identity num_entities gauge metric (#8816 ) Signed-off-by: Dustin Decker <dustindecker@protonmail.com>	2020-04-23 19:29:42 -05:00
Calvin Leung Huang	df23b481a6	core: change rawConfig to be atomic.Value (#8755 ) This avoids SetConfig from having to grab a write lock which is called on a SIGHUP, and may block, along with a long-running requests that has a read lock held, any other operation that requires a state lock.	2020-04-16 16:34:46 -07:00
ncabatoff	5fe1ab766b	Add option to detect deadlocks in Core.stateLock using build tag `deadlock` (#8524 )	2020-03-10 16:01:20 -04:00
ncabatoff	e5721310ac	Add persistent feature flags to be used on enterprise non-primaries. (#8391 )	2020-02-19 18:06:53 -05:00
Jeff Mitchell	844b2c3a5d	Bump API/SDK and adapt to move from SDK stuff	2020-02-15 14:58:05 -05:00

1 2 3 4 5 ...

445 commits