James Phillips
36a0abe10f
Merge pull request #2801 from hashicorp/spoken-hub-oss
...
Adds support for WAN soft fail and join flooding.
2017-03-20 16:24:07 -07:00
James Phillips
d68bd9cca0
Cleans up based on review feedback.
2017-03-20 16:23:40 -07:00
James Phillips
898b26524a
Adds a warning if the context is canceled.
2017-03-20 09:27:28 -07:00
James Phillips
5ee1256137
Converts the stats fetch from serial to parallel and snaps the last index.
2017-03-19 20:48:42 -07:00
James Phillips
cfc01419c8
Adds a stats fetcher to make sure we don't block the autopilot loop.
2017-03-17 18:42:28 -07:00
James Phillips
e49d5456aa
Lowers log spam from the join flooder.
2017-03-16 23:38:15 -07:00
James Phillips
44d3e207ed
Makes the flood goroutine more reusable.
2017-03-16 16:42:19 -07:00
James Phillips
9176e77111
Shuts down flooder when either Serf is shut down.
2017-03-16 16:42:19 -07:00
James Phillips
0cf9cf22ad
Fixes a unit test that needed to account for WAN flooding.
2017-03-16 16:42:19 -07:00
James Phillips
91a861337b
Adds LAN -> WAN join flooding.
2017-03-16 16:42:19 -07:00
James Phillips
1e5a442420
Walks back the changes to change pool address interface into strings.
2017-03-16 16:42:19 -07:00
James Phillips
dec485145e
Adds a hook for the route lookup function.
2017-03-16 16:42:19 -07:00
James Phillips
ad39f8a0f6
Cleans up some small things based on review feedback.
2017-03-16 16:42:19 -07:00
James Phillips
543389ad0a
Adds offline detection.
2017-03-16 16:42:19 -07:00
James Phillips
8cc06ec10d
Adds missing unit tests and cleans up some router bugs.
2017-03-16 16:42:19 -07:00
James Phillips
850ac50e99
Makes the raft_vsn optional in Serf tags to support older servers.
2017-03-16 16:42:19 -07:00
James Phillips
1cd3a8a107
Simplifies median calculation.
2017-03-16 16:42:19 -07:00
James Phillips
28f8aa5559
Removes remoteConsuls in favor of the new router.
...
This has the next wave of RTT integration with the router and also
factors some common RTT-related helpers out to lib. While we were
in here we also got rid of the coordinate disable config so we don't
need to deal with the complexity in the router (there was never a
user-visible way to disable coordinates).
2017-03-16 16:42:19 -07:00
James Phillips
746e735dce
Fixes snapshot endpoint after merge.
2017-03-16 16:42:18 -07:00
James Phillips
0269bd0e41
Cleans up after merge.
2017-03-16 16:42:18 -07:00
James Phillips
82b6fbd844
Adds router into RPC paths with work in progress on coordinates.
2017-03-16 16:42:18 -07:00
James Phillips
ef642c21b3
Adds an area request type.
2017-03-16 16:42:18 -07:00
James Phillips
838d85b7ae
Changes pool's dial address to a string and adds a timeout.
2017-03-16 16:42:18 -07:00
Kyle Havlovitz
f50ce2084a
Only count healthy voters for FailureTolerance
2017-03-16 12:19:16 -07:00
Kyle Havlovitz
9c7bba7903
Tweak last_contact health logic for leader
2017-03-15 19:57:54 -07:00
Kyle Havlovitz
bc0494e396
Reorganized cluster health check loop and logic
2017-03-15 18:27:17 -07:00
Kyle Havlovitz
bb98e39dd4
Add tests for servers changing address/ID
2017-03-15 16:50:42 -07:00
Kyle Havlovitz
c40279e012
Fix an issue with changing server IDs and add a few UX enhancements around autopilot features
2017-03-15 16:09:55 -07:00
Kyle Havlovitz
3c6bece859
Add autopilot guide to the docs
2017-03-10 14:55:18 -08:00
Kyle Havlovitz
4807215fa1
Merge pull request #2788 from hashicorp/f-autopilot-2
...
Autopilot server health monitoring
2017-03-10 12:29:45 -08:00
Kyle Havlovitz
b15d67bfac
Use defers for WaitGroup and Ticker stop
2017-03-10 12:29:03 -08:00
Kyle Havlovitz
8130f9b1c1
Cleaned up and reorganized some autopilot-related code
2017-03-09 18:21:40 -08:00
James Phillips
1e313e339c
Adds token to deregister request when reconciling.
...
Fixes #2792 .
2017-03-09 09:25:42 -08:00
Kyle Havlovitz
a5cbee0e99
Add AutopilotPolicy interface and BasicAutopilot
2017-03-08 12:26:58 -08:00
Kyle Havlovitz
a7de1e2a3b
Move RaftStats to Status endpoint
2017-03-07 13:58:06 -08:00
Kyle Havlovitz
8bcab6c6d7
Add autopilot server health tracking
...
This adds two goroutines to perform autopilot tasks on the leader - one
to monitor the health of servers and another to periodically clean up
dead servers with a limit on removal count. Also adds a new http endpoint,
`/v1/operator/autopilot/health`, for querying this information through an
operator RPC endpoint.
2017-03-06 16:00:10 -08:00
Kyle Havlovitz
23c492a74e
Rename DeadServerCleanup and make wording adjustments
2017-02-28 14:45:21 -08:00
Kyle Havlovitz
f7b6d776f4
Add cli docs and minor test/comment tweaks
2017-02-24 16:55:44 -08:00
Kyle Havlovitz
459e72b011
Merge branch 'master' into f-autopilot
2017-02-24 15:55:18 -08:00
Kyle Havlovitz
c9ddee1a79
Add CAS capability to autopilot config endpoint
2017-02-24 13:08:49 -08:00
James Phillips
3887c52cb2
Reserves an RPC selector byte for Consul Enterprise.
2017-02-24 09:54:33 -08:00
Kyle Havlovitz
56e22a719f
Add state store table and endpoints for autopilot
2017-02-23 20:32:13 -08:00
Kyle Havlovitz
ae9fce0ae0
Move raft_protocol out of autopilot config
2017-02-23 13:08:40 -08:00
Kyle Havlovitz
f9588b8d7f
Add raft version 2/3 compatibility
2017-02-22 12:53:32 -08:00
Kyle Havlovitz
2c9001a389
Add configurable cleanup of dead servers when a new server joins
2017-02-17 10:49:16 -08:00
Sean Chittenden
cea8a2f5d5
Round the node lookup prefix down to the nearest modulo two size before
...
performing the lookup.
Hat tip: @dadgar
2017-02-02 12:13:58 -08:00
Sean Chittenden
06db5bc353
Reduce the size of the UUID Lookup Length restriction from `8` to `2`.
...
I'm torn on this. It's useful from a UX perspective for an operator to
be able to type in something that's short. At the same time, by
enforcing an `8` character length, we reduced the probability of a user
depending on the behavior and having it suddenly stop working in the
future when a duplicate prefix is injected into the environment.
2017-02-02 12:12:18 -08:00
Kyle Havlovitz
fbcd43e95c
Merge pull request #2704 from hashicorp/f-relay-query-responses
...
Add relay-factor arg to keyring operations
2017-02-02 12:15:19 -05:00
Kyle Havlovitz
4fc3bd3abf
Added -relay-factor param to keyring operations
2017-02-01 21:53:29 -05:00
James Phillips
40454c999a
Adds a test for node UUID or name lookups.
2017-02-01 16:41:44 -08:00
Sean Chittenden
1c5d6eff76
Merge pull request #2702 from hashicorp/f-dns-nodeid
...
DNS lookup by Consul node ID
2017-02-01 16:23:18 -08:00
Sean Chittenden
f8b64ec5f8
Treat a uuid prefix lookup error as a soft error, as if a node name
...
lookup returned nil.
Add a TODO to note where a future point of logging should occur once a
logger is present and a few additional comments to explain the program
flow.
2017-02-01 16:09:25 -08:00
Sean Chittenden
6844ebb43b
Treat a uuid prefix lookup error as a soft error, as if a node name lookup returned nil.
...
Add a TODO to note where a future point of logging should occur once a
logger is present.
2017-02-01 15:51:25 -08:00
Sean Chittenden
1d9c5a3efb
Run a test of `NodeServices()` with a NodeID as an argument.
2017-02-01 15:41:10 -08:00
Sean Chittenden
b241cfc7fd
Whoops. Return an empty set in the event that there are multiple matches.
2017-02-01 15:18:00 -08:00
Sean Chittenden
324215c842
Rename `nodeName` to `nodeNameOrID`.
2017-02-01 14:59:24 -08:00
Sean Chittenden
19c2cd106a
Toggle `AllowMissing` to false to accommodate old clients without Node IDs.
2017-02-01 14:58:34 -08:00
Sean Chittenden
c9eea45b1c
Enable looking up consul nodes by their node ID.
...
Assuming the following output from a consul agent:
```
==> Consul agent running!
Version: 'v0.7.3-43-gc5e140c-dev (c5e140c+CHANGES)'
Node ID: '40e4a748-2192-161a-0510-9bf59fe950b5'
Node name: 'myhost'
```
it is now possible to lookup nodes by their Node Name or Node ID, or a
prefix match of the Node ID, with the following caveats re: the prefix
match:
1) first eight digits of the Node ID are a required minimum (eight was
chosen as an arbitrary number)
2) the length of the Node ID must be an even number or no result will be
returned.
```
% dig @127.0.0.1 -p 8600 myhost.node.dc1.consul.
myhost.node.dc1.consul. 0 IN A 127.0.0.1
% dig @127.0.0.1 -p 8600 40e4a748-2192-161a-0510-9bf59fe950b5.node.dc1.consul.
40e4a748-2192-161a-0510-9bf59fe950b5.node.dc1.consul. 0 IN A 127.0.0.1
% dig @127.0.0.1 -p 8600 40e4a748.node.dc1.consul.
40e4a748.node.dc1.consul. 0 IN A 127.0.0.1
% dig @127.0.0.1 -p 8600 40e4a74821.node.dc1.consul.
40e4a74821.node.dc1.consul. 0 IN A 127.0.0.1
% dig @127.0.0.1 -p 8600 40e4a748-21.node.dc1.consul.
40e4a748-21.node.dc1.consul. 0 IN A 127.0.0.1
```
2017-02-01 14:46:25 -08:00
Kyle Havlovitz
086975ac0a
Add TLSMinVersion to config options
2017-02-01 16:20:33 -05:00
Sean Chittenden
e39dd09bfa
Small premature optimization in `isUUID()`.
...
If the length isn't `36`, return `false` immediately before firing up
the regexp engine.
2017-02-01 11:00:06 -08:00
James Phillips
84a487526b
Tweaks leader test now that we have new wait timing.
2017-01-25 22:12:22 -08:00
James Phillips
d780c49bac
Keeps the old state store state if a restore fails.
2017-01-25 19:42:34 -08:00
James Phillips
418b627f4e
Bails out of blocking queries when a state restore occurs.
2017-01-25 19:00:32 -08:00
James Phillips
f286716899
Tweaks a few comments.
2017-01-25 09:58:23 -08:00
James Phillips
8fcf695fa1
Pass state store pointer into the blocking query work function.
...
Previously the blocking functions all closed over the state store from
their first query, with would not have worked properly when a restore
occurred. This makes sure they get a frest state store pointer each time,
and that pointer is synchronized with the abandon watch.
2017-01-25 09:58:23 -08:00
James Phillips
20ae4b6139
Guts all the old blocking query code.
2017-01-25 09:58:23 -08:00
James Phillips
943ca1b2d9
Cuts KVS endpoints over to new fine-grained watch plumbing.
2017-01-25 09:58:22 -08:00
James Phillips
c3a1014fbf
Adds a facility to notify when restores occur.
2017-01-25 09:58:22 -08:00
James Phillips
e59f398d80
Adds fine-grained watches to session endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
997934c94f
Adds fine-grained watches to prepared query endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
557df50be6
Adds fine-grained watches to internal endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
e1dd427cf4
Adds fine-grained watches to coordinate endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
2cbf45301e
Adds fine-grained watch support to ACL endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
635afd1924
Adds fine-grained watches to health endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
a604c438a2
Fixes a race condition when updating the state store during a snapshot restore.
2017-01-25 09:58:22 -08:00
James Phillips
0891b6c74d
Adds fine-grained watches to catalog endpoints.
2017-01-25 09:58:22 -08:00
James Phillips
5afa233d28
Adds diff check for node and service parts of register requests.
...
We always did an update before which caused excessive watch churn, even
with our new fine-grained queries. This does a diff any only updates the
node and service records if something actually changed.
2017-01-25 09:58:22 -08:00
James Phillips
40ef50c738
Don't do any watch tracking for non-blocking queries.
2017-01-25 09:58:22 -08:00
James Phillips
034d1960b6
Removes some incorrect comments.
...
We can't actually return a fine-grained index from these tables unless
support is added for tombstones. Otherwise, the index could slip backwards
as things are deleted.
2017-01-25 09:58:22 -08:00
James Phillips
085672a126
Adds new variant of blocking query wrapper with WatchSet support.
2017-01-25 09:58:22 -08:00
Kyle Havlovitz
a09b6ea8b4
Fix test import
2017-01-23 21:34:01 -05:00
Kyle Havlovitz
cc271aa436
Merge branch 'master' into f-prepared-query-nodemeta
2017-01-23 20:17:48 -05:00
Kyle Havlovitz
3cd054e81f
Add tests for node meta in prepared queries and update docs
2017-01-23 19:17:30 -05:00
James Phillips
9e8952c126
Merge pull request #2661 from hashicorp/f-node-id
...
Adds basic support for node IDs.
2017-01-18 16:02:38 -08:00
James Phillips
60c135dd67
Uses clean replies each time so they are safe to receive map changes.
2017-01-18 15:40:19 -08:00
James Phillips
abe7b33986
Fixes a startup ordering issue between Raft and Serf.
...
This fixes #2663 and fixes #1899 . It's not super related to this PR,
but the startup time changes that this PR brings made this a lot worse
so I was able to track it down.
2017-01-18 15:06:15 -08:00
James Phillips
55cd1d975c
Adds catalog support for node IDs.
2017-01-18 14:26:42 -08:00
Kyle Havlovitz
fdf2a10835
First pass at adding node meta filter to prepared queries
2017-01-18 16:23:33 -05:00
James Phillips
96bff003b7
Adds basic support for node IDs.
2017-01-17 22:47:59 -08:00
Kyle Havlovitz
b3fdfb09d8
Minor formatting tweaks as a follow-up to #2654
2017-01-17 19:20:29 -05:00
Kyle Havlovitz
9080006a37
Add support for multiple metadata filters to remaining endpoints
...
Enabled multiple meta filters for /v1/catalog/nodes and /v1/catalog/services
2017-01-13 20:49:13 -05:00
Kyle Havlovitz
e37f25dc02
Add node metadata filtering to remaining health/catalog endpoints
2017-01-13 20:08:43 -05:00
James Phillips
c837d055cd
Breaks up the state store into several files.
2017-01-13 11:47:16 -08:00
Kyle Havlovitz
3620e5e199
Fix inconsistency in TestStateStore_ServicesByNodeMeta
2017-01-12 19:46:58 -05:00
Kyle Havlovitz
51a1834667
Add -node-meta to agent command line options
2017-01-11 16:09:04 -05:00
Kyle Havlovitz
d77890a011
Validate metadata config earlier and handle multiple filters
2017-01-11 15:12:03 -05:00
Kyle Havlovitz
6b5cf20b1c
Fix formatting
2017-01-09 13:49:33 -08:00
Kyle Havlovitz
aee766baba
Add meta key validations and more tests
2017-01-09 11:21:49 -08:00
Kyle Havlovitz
b25f4c7d35
Add tests for node metadata functionality
2017-01-05 17:21:56 -08:00
Kyle Havlovitz
2a423c6e2c
Add support for setting node metadata fields
2017-01-05 14:10:26 -08:00