open-consul

Commit Graph

Author	SHA1	Message	Date
Daniel Nephin	96c4a35de7	Remove t.Name() from TestAgent.Name And re-add the name to the logger so that log messages from different agents in a single can be identified.	2020-03-30 16:47:24 -04:00
Matt Keeler	83426d7a9f	OSS Changes for agent local state namespace testing (#7250 )	2020-02-10 11:25:12 -05:00
Hans Hasselberg	107e8523a8	agent: ensure node info sync and full sync. (#7189 ) This fixes #7020. There are two problems this PR solves: * if the node info changes it is highly likely to get service and check registration permission errors unless those service tokens have node:write. Hopefully services you register don’t have this permission. * the timer for a full sync gets reset for every partial sync which means that many partial syncs are preventing a full sync from happening Instead of syncing node info last, after services and checks, and possibly saving one RPC because it is included in every service sync, I am syncing node info first. It is only ever going to be a single RPC that we are only doing when node info has changed. This way we are guaranteed to sync node info even when something goes wrong with services or checks which is more likely because there are more syncs happening for them.	2020-02-06 15:30:58 +01:00
R.B. Boyer	01ebdff2a9	various tweaks on top of the hclog work (#7165 )	2020-01-29 11:16:08 -06:00
Chris Piraino	3dd0b59793	Allow users to configure either unstructured or JSON logging (#7130 ) * hclog Allow users to choose between unstructured and JSON logging	2020-01-28 17:50:41 -06:00
Kit Patella	49e9bbbdf9	Add accessorID of token when ops are denied by ACL system (#7117 ) * agent: add and edit doc comments * agent: add ACL token accessorID to debugging traces * agent: polish acl debugging * agent: minor fix + string fmt over value interp * agent: undo export & fix logging field names * agent: remove note and migrate up to code review * Update agent/consul/acl.go Co-Authored-By: Matt Keeler <mkeeler@users.noreply.github.com> * agent: incorporate review feedback * Update agent/acl.go Co-Authored-By: R.B. Boyer <public@richardboyer.net> Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com> Co-authored-by: R.B. Boyer <public@richardboyer.net>	2020-01-27 11:54:32 -08:00
Chris Piraino	db36928faa	Fix segfault when removing both a service and associated check (#7108 ) * Fix segfault when removing both a service and associated check updateSyncState creates entries in the services and checks maps for remote services/checks that are not found locally, so that we can then make sure to delete them in our reconciliation process. However, the values added to the map are missing key fields that the rest of the code expects to not be nil. * Add comment stating Check field can be nil	2020-01-23 10:38:32 -06:00
Aestek	c35af89dfd	agent: do not deregister service checks twice (#6168 ) Deregistering a service from the catalog automatically deregisters its checks, however the agent still performs a deregister call for each service checks even after the service has been deregistered. With ACLs enabled this results in logs like: "message:consul: "Catalog.Deregister" RPC failed to server server_ip:8300: rpc error making call: rpc error making call: Unknown check 'check_id'" This change removes associated checks from the agent state when deregistering a service, which results in less calls to the servers and supresses the error logs.	2020-01-17 14:26:53 +01:00
Matt Keeler	442924c35a	Sync of OSS changes to support namespaces (#6909 )	2019-12-09 21:26:41 -05:00
Freddy	5eace88ce2	Expose HTTP-based paths through Connect proxy (#6446 ) Fixes: #5396 This PR adds a proxy configuration stanza called expose. These flags register listeners in Connect sidecar proxies to allow requests to specific HTTP paths from outside of the node. This allows services to protect themselves by only listening on the loopback interface, while still accepting traffic from non Connect-enabled services. Under expose there is a boolean checks flag that would automatically expose all registered HTTP and gRPC check paths. This stanza also accepts a paths list to expose individual paths. The primary use case for this functionality would be to expose paths for third parties like Prometheus or the kubelet. Listeners for requests to exposed paths are be configured dynamically at run time. Any time a proxy, or check can be registered, a listener can also be created. In this initial implementation requests to these paths are not authenticated/encrypted.	2019-09-25 20:55:52 -06:00
Sarah Adams	8e673371df	test: ensure all TestAgent constructions use a constructor (#6443 ) ensure all TestAgent constructions use a constructor to get start retries + test logs going to the right place Fixes #6435	2019-09-05 10:24:36 -07:00
Sarah Adams	f8fa10fecb	refactor & add better retry logic to NewTestAgent (#6363 ) Fixes #6361	2019-09-03 15:05:51 -07:00
Matt Keeler	31d9d2e557	Store primaries root in secondary after intermediate signature (#6333 ) * Store primaries root in secondary after intermediate signature This ensures that the intermediate exists within the CA root stored in raft and not just in the CA provider state. This has the very nice benefit of actually outputting the intermediate cert within the ca roots HTTP/RPC endpoints. This change means that if signing the intermediate fails it will not set the root within raft. So far I have not come up with a reason why that is bad. The secondary CA roots watch will pull the root again and go through all the motions. So as soon as getting an intermediate CA works the root will get set. * Make TestAgentAntiEntropy_Check_DeferSync less flaky I am not sure this is the full fix but it seems to help for me.	2019-08-30 11:38:46 -04:00
Mike Morris	88df658243	connect: remove managed proxies (#6220 ) * connect: remove managed proxies implementation and all supporting config options and structs * connect: remove deprecated ProxyDestination * command: remove CONNECT_PROXY_TOKEN env var * agent: remove entire proxyprocess proxy manager * test: remove all managed proxy tests * test: remove irrelevant managed proxy note from TestService_ServerTLSConfig * test: update ContentHash to reflect managed proxy removal * test: remove deprecated ProxyDestination test * telemetry: remove managed proxy note * http: remove /v1/agent/connect/proxy endpoint * ci: remove deprecated test exclusion * website: update managed proxies deprecation page to note removal * website: remove managed proxy configuration API docs * website: remove managed proxy note from built-in proxy config * website: add note on removing proxy subdirectory of data_dir	2019-08-09 15:19:30 -04:00
Freddy	a295d9e5db	Flaky test overhaul (#6100 )	2019-07-12 09:52:26 -06:00
Aestek	97bb907b69	ae: use stale requests when performing full sync (#5873 ) Read requests performed during anti antropy full sync currently target the leader only. This generates a non-negligible load on the leader when the DC is large enough and can be offloaded to the followers following the "eventually consistent" policy for the agent state. We switch the AE read calls to use stale requests with a small (2s) MaxStaleDuration value and make sure we do not read too fast after a write.	2019-06-17 18:05:47 +02:00
R.B. Boyer	9b41199585	agent: fix several data races and bugs related to node-local alias checks (#5876 ) The observed bug was that a full restart of a consul datacenter (servers and clients) in conjunction with a restart of a connect-flavored application with bring-your-own-service-registration logic would very frequently cause the envoy sidecar service check to never reflect the aliased service. Over the course of investigation several bugs and unfortunate interactions were corrected: (1) local.CheckState objects were only shallow copied, but the key piece of data that gets read and updated is one of the things not copied (the underlying Check with a Status field). When the stock code was run with the race detector enabled this highly-relevant-to-the-test-scenario field was found to be racy. Changes: a) update the existing Clone method to include the Check field b) copy-on-write when those fields need to change rather than incrementally updating them in place. This made the observed behavior occur slightly less often. (2) If anything about how the runLocal method for node-local alias check logic was ever flawed, there was no fallback option. Those checks are purely edge-triggered and failure to properly notice a single edge transition would leave the alias check incorrect until the next flap of the aliased check. The change was to introduce a fallback timer to act as a control loop to double check the alias check matches the aliased check every minute (borrowing the duration from the non-local alias check logic body). This made the observed behavior eventually go away when it did occur. (3) Originally I thought there were two main actions involved in the data race: A. The act of adding the original check (from disk recovery) and its first health evaluation. B. The act of the HTTP API requests coming in and resetting the local state when re-registering the same services and checks. It took awhile for me to realize that there's a third action at work: C. The goroutines associated with the original check and the later checks. The actual sequence of actions that was causing the bad behavior was that the API actions result in the original check to be removed and re-added _without waiting for the original goroutine to terminate_. This means for brief windows of time during check definition edits there are two goroutines that can be sending updates for the alias check status. In extremely unlikely scenarios the original goroutine sees the aliased check start up in `critical` before being removed but does not get the notification about the nearly immediate update of that check to `passing`. This is interlaced wit the new goroutine coming up, initializing its base case to `passing` from the current state and then listening for new notifications of edge triggers. If the original goroutine "finishes" its update, it then commits one more write into the local state of `critical` and exits leaving the alias check no longer reflecting the underlying check. The correction here is to enforce that the old goroutines must terminate before spawning the new one for alias checks.	2019-05-24 13:36:56 -05:00
Freddy	1538a738f2	Update alias checks on local add and remove	2019-04-24 12:17:06 -06:00
Alvin Huang	aacb81a566	Merge pull request #5376 from hashicorp/fix-tests Fix tests in prep for CircleCI Migration	2019-04-04 17:09:32 -04:00
Jeff Mitchell	d3c7d57209	Move internal/ to sdk/ (#5568 ) * Move internal/ to sdk/ * Add a readme to the SDK folder	2019-03-27 08:54:56 -04:00
Jeff Mitchell	a41c865059	Convert to Go Modules (#5517 ) * First conversion * Use serf 0.8.2 tag and associated updated deps * * Move freeport and testutil into internal/ * Make internal/ its own module * Update imports * Add replace statements so API and normal Consul code are self-referencing for ease of development * Adapt to newer goe/values * Bump to new cleanhttp * Fix ban nonprintable chars test * Update lock bad args test The error message when the duration cannot be parsed changed in Go 1.12 (ae0c435877d3aacb9af5e706c40f9dddde5d3e67). This updates that test. * Update another test as well * Bump travis * Bump circleci * Bump go-discover and godo to get rid of launchpad dep * Bump dockerfile go version * fix tar command * Bump go-cleanhttp	2019-03-26 17:04:58 -04:00
R.B. Boyer	91e78e00c7	fix typos reported by golangci-lint:misspell (#5434 )	2019-03-06 11:13:28 -06:00
Aestek	2ce7240abc	Register and deregisters services and their checks atomically in the local state (#5012 ) Prevent race between register and deregister requests by saving them together in the local state on registration. Also adds more cleaning in case of failure when registering services / checks.	2019-03-04 09:34:05 -05:00
Matt Keeler	0c76a4389f	ACL Token Persistence and Reloading (#5328 ) This PR adds two features which will be useful for operators when ACLs are in use. 1. Tokens set in configuration files are now reloadable. 2. If `acl.enable_token_persistence` is set to `true` in the configuration, tokens set via the `v1/agent/token` endpoint are now persisted to disk and loaded when the agent starts (or during configuration reload) Note that token persistence is opt-in so our users who do not want tokens on the local disk will see no change. Some other secondary changes: * Refactored a bunch of places where the replication token is retrieved from the token store. This token isn't just for replicating ACLs and now it is named accordingly. * Allowed better paths in the `v1/agent/token/` API. Instead of paths like: `v1/agent/token/acl_replication_token` the path can now be just `v1/agent/token/replication`. The old paths remain to be valid. * Added a couple new API functions to set tokens via the new paths. Deprecated the old ones and pointed to the new names. The names are also generally better and don't imply that what you are setting is for ACLs but rather are setting ACL tokens. There is a minor semantic difference there especially for the replication token as again, its no longer used only for ACL token/policy replication. The new functions will detect 404s and fallback to using the older token paths when talking to pre-1.4.3 agents. * Docs updated to reflect the API additions and to show using the new endpoints. * Updated the ACL CLI set-agent-tokens command to use the non-deprecated APIs.	2019-02-27 14:28:31 -05:00
Alvin Huang	9543ab6a7c	fix TestAgent_CheckCriticalTime and better error output	2019-02-22 17:34:45 -05:00
Matt Keeler	a34f8c751e	Pass a testing.T into NewTestAgent and TestAgent.Start (#5342 ) This way we can avoid unnecessary panics which cause other tests not to run. This doesn't remove all the possibilities for panics causing other tests not to run, it just fixes the TestAgent	2019-02-14 10:59:14 -05:00
Aestek	5647ca2bbb	[Fix] Services sometimes not being synced with acl_enforce_version_8 = false (#4771 ) Fixes: https://github.com/hashicorp/consul/issues/3676 This fixes a bug were registering an agent with a non-existent ACL token can prevent other services registered with a good token from being synced to the server when using `acl_enforce_version_8 = false`. ## Background When `acl_enforce_version_8` is off the agent does not check the ACL token validity before storing the service in its state. When syncing a service registered with a missing ACL token we fall into the default error handling case (https://github.com/hashicorp/consul/blob/master/agent/local/state.go#L1255) and stop the sync (https://github.com/hashicorp/consul/blob/master/agent/local/state.go#L1082) without setting its Synced property to true like in the permission denied case. This means that the sync will always stop at the faulty service(s). The order in which the services are synced is random since we iterate on a map. So eventually all services with good ACL tokens will be synced, this can however take some time and is influenced by the cluster size, the bigger the slower because retries are less frequent. Having a service in this state also prevent all further sync of checks as they are done after the services. ## Changes This change modify the sync process to continue even if there is an error. This fixes the issue described above as well as making the sync more error tolerant: if the server repeatedly refuses a service (the ACL token could have been deleted by the time the service is synced, the servers were upgraded to a newer version that has more strict checks on the service definition...). Then all services and check that can be synced will, and those that don't will be marked as errors in the logs instead of blocking the whole process.	2019-01-04 10:01:50 -05:00
Matt Keeler	99e0a124cb	New ACLs (#4791 ) This PR is almost a complete rewrite of the ACL system within Consul. It brings the features more in line with other HashiCorp products. Obviously there is quite a bit left to do here but most of it is related docs, testing and finishing the last few commands in the CLI. I will update the PR description and check off the todos as I finish them over the next few days/week. Description At a high level this PR is mainly to split ACL tokens from Policies and to split the concepts of Authorization from Identities. A lot of this PR is mostly just to support CRUD operations on ACLTokens and ACLPolicies. These in and of themselves are not particularly interesting. The bigger conceptual changes are in how tokens get resolved, how backwards compatibility is handled and the separation of policy from identity which could lead the way to allowing for alternative identity providers. On the surface and with a new cluster the ACL system will look very similar to that of Nomads. Both have tokens and policies. Both have local tokens. The ACL management APIs for both are very similar. I even ripped off Nomad's ACL bootstrap resetting procedure. There are a few key differences though. Nomad requires token and policy replication where Consul only requires policy replication with token replication being opt-in. In Consul local tokens only work with token replication being enabled though. All policies in Nomad are globally applicable. In Consul all policies are stored and replicated globally but can be scoped to a subset of the datacenters. This allows for more granular access management. Unlike Nomad, Consul has legacy baggage in the form of the original ACL system. The ramifications of this are: A server running the new system must still support other clients using the legacy system. A client running the new system must be able to use the legacy RPCs when the servers in its datacenter are running the legacy system. The primary ACL DC's servers running in legacy mode needs to be a gate that keeps everything else in the entire multi-DC cluster running in legacy mode. So not only does this PR implement the new ACL system but has a legacy mode built in for when the cluster isn't ready for new ACLs. Also detecting that new ACLs can be used is automatic and requires no configuration on the part of administrators. This process is detailed more in the "Transitioning from Legacy to New ACL Mode" section below.	2018-10-19 12:04:07 -04:00
Paul Banks	10af44006a	Proxy Config Manager (#4729 ) * Proxy Config Manager This component watches for local state changes on the agent and ensures that each service registered locally with Kind == connect-proxy has it's state being actively populated in the cache. This serves two purposes: 1. For the built-in proxy, it ensures that the state needed to accept connections is available in RAM shortly after registration and likely before the proxy actually starts accepting traffic. 2. For (future - next PR) xDS server and other possible future proxies that require _push_ based config discovery, this provides a mechanism to subscribe and be notified about updates to a proxy instance's config including upstream service discovery results. * Address review comments * Better comments; Better delivery of latest snapshot for slow watchers; Embed Config * Comment typos * Add upstream Stringer for funsies	2018-10-10 16:55:34 +01:00
Paul Banks	979e1c9c94	Add -sidecar-for and new /agent/service/:service_id endpoint (#4691 ) - A new endpoint `/v1/agent/service/:service_id` which is a generic way to look up the service for a single instance. The primary value here is that it: - supports hash-based blocking and so; - replaces `/agent/connect/proxy/:proxy_id` as the mechanism the built-in proxy uses to read its config. - It's not proxy specific and so works for any service. - It has a temporary shim to call through to the existing endpoint to preserve current managed proxy config defaulting behaviour until that is removed entirely (tested). - The built-in proxy now uses the new endpoint exclusively for it's config - The built-in proxy now has a `-sidecar-for` flag that allows the service ID of the _target_ service to be specified, on the condition that there is exactly one "sidecar" proxy (that is one that has `Proxy.DestinationServiceID` set) for the service registered. - Several fixes for edge cases for SidecarService - A fix for `Alias` checks - when running locally they didn't update their state until some external thing updated the target. If the target service has no checks registered as below, then the alias never made it past critical.	2018-10-10 16:55:34 +01:00
Paul Banks	92fe8c8e89	Add Proxy Upstreams to Service Definition (#4639 ) * Refactor Service Definition ProxyDestination. This includes: - Refactoring all internal structs used - Updated tests for both deprecated and new input for: - Agent Services endpoint response - Agent Service endpoint response - Agent Register endpoint - Unmanaged deprecated field - Unmanaged new fields - Managed deprecated upstreams - Managed new - Catalog Register - Unmanaged deprecated field - Unmanaged new fields - Managed deprecated upstreams - Managed new - Catalog Services endpoint response - Catalog Node endpoint response - Catalog Service endpoint response - Updated API tests for all of the above too (both deprecated and new forms of register) TODO: - config package changes for on-disk service definitions - proxy config endpoint - built-in proxy support for new fields * Agent proxy config endpoint updated with upstreams * Config file changes for upstreams. * Add upstream opaque config and update all tests to ensure it works everywhere. * Built in proxy working with new Upstreams config * Command fixes and deprecations * Fix key translation, upstream type defaults and a spate of other subtele bugs found with ned to end test scripts... TODO: tests still failing on one case that needs a fix. I think it's key translation for upstreams nested in Managed proxy struct. * Fix translated keys in API registration. ≈ * Fixes from docs - omit some empty undocumented fields in API - Bring back ServiceProxyDestination in Catalog responses to not break backwards compat - this was removed assuming it was only used internally. * Documentation updates for Upstreams in service definition * Fixes for tests broken by many refactors. * Enable travis on f-connect branch in this branch too. * Add consistent Deprecation comments to ProxyDestination uses * Update version number on deprecation notices, and correct upstream datacenter field with explanation in docs	2018-10-10 16:55:34 +01:00
Pierre Souchay	473e589d86	Implementation of Weights Data structures (#4468 ) * Implementation of Weights Data structures Adding this datastructure will allow us to resolve the issues #1088 and #4198 This new structure defaults to values: ``` { Passing: 1, Warning: 0 } ``` Which means, use weight of 0 for a Service in Warning State while use Weight 1 for a Healthy Service. Thus it remains compatible with previous Consul versions. * Implemented weights for DNS SRV Records * DNS properly support agents with weight support while server does not (backwards compatibility) * Use Warning value of Weights of 1 by default When using DNS interface with only_passing = false, all nodes with non-Critical healthcheck used to have a weight value of 1. While having weight.Warning = 0 as default value, this is probably a bad idea as it breaks ascending compatibility. Thus, we put a default value of 1 to be consistent with existing behaviour. * Added documentation for new weight field in service description * Better documentation about weights as suggested by @banks * Return weight = 1 for unknown Check states as suggested by @banks * Fixed typo (of -> or) in error message as requested by @mkeeler * Fixed unstable unit test TestRetryJoin * Fixed unstable tests * Fixed wrong Fatalf format in `testrpc/wait.go` * Added notes regarding DNS SRV lookup limitations regarding number of instances * Documentation fixes and clarification regarding SRV records with weights as requested by @banks * Rephrase docs	2018-09-07 15:30:47 +01:00
Martin	6af4501a68	Use target service name instead of ID as connect proxy service name (#4620 )	2018-09-05 20:33:17 +01:00
Siva Prasad	5fe9053416	TestAgentAntiEntropy: Wait until Consul service is up on the agent. (#4591 ) * Anti-Entropy test wait for Consul service added * Reverted some tests back to using WaitForLeader	2018-08-28 09:52:11 -04:00
Pierre Souchay	fd927ea110	BUGFIX: Unit test relying on WaitForLeader() did not work due to wrong test (#4472 ) - Improve resilience of testrpc.WaitForLeader() - Add additionall retry to CI - Increase "go test" timeout to 8m - Add wait for cluster leader to several tests in the agent package - Add retry to some tests in the api and command packages	2018-08-06 19:46:09 -04:00
Mitchell Hashimoto	dedc5ad69f	agent/local: silly spacing on select statements	2018-07-19 14:21:30 -05:00
Mitchell Hashimoto	e42ca78c5d	agent/local: address remaining test feedback	2018-07-19 14:20:50 -05:00
Mitchell Hashimoto	81f6486fb5	agent/local: don't use time.After in test since notify is instant	2018-07-18 16:16:28 -05:00
Mitchell Hashimoto	5889a3b6ff	agent: address some basic feedback	2018-07-12 09:36:11 -07:00
Mitchell Hashimoto	3177d1719d	agent/local: support local alias checks	2018-07-12 09:36:10 -07:00
Pierre Souchay	9128de5b11	Merge remote-tracking branch 'origin/master' into ACL_additional_info	2018-07-07 14:09:18 +02:00
Paul Banks	1e5a2561b6	Make tests pass and clean proxy persistence. No detached child changes yet. This is a good state for persistence stuff to re-start the detached child work that got mixed up last time.	2018-06-25 12:24:10 -07:00
Paul Banks	3bac52480e	Abandon daemonize for simpler solution (preserving history): Reverts: - bdb274852ae469c89092d6050697c0ff97178465 - 2c689179c4f61c11f0016214c0fc127a0b813bfe - d62e25c4a7ab753914b6baccd66f88ffd10949a3 - c727ffbcc98e3e0bf41e1a7bdd40169bd2d22191 - 31b4d18933fd0acbe157e28d03ad59c2abf9a1fb - 85c3f8df3eabc00f490cd392213c3b928a85aa44	2018-06-25 12:24:10 -07:00
Paul Banks	e1aca748c4	Make daemoinze an option on test binary without hacks. Misc fixes for racey or broken tests. Still failing on several though.	2018-06-25 12:24:09 -07:00
Paul Banks	3a00574a13	Persist proxy state through agent restart	2018-06-25 12:24:08 -07:00
Mitchell Hashimoto	ed14e9edf8	agent: resolve some conflicts and fix tests	2018-06-14 09:42:10 -07:00
Mitchell Hashimoto	657c09133a	agent/local: clarify the non-risk of a full buffer	2018-06-14 09:42:10 -07:00
Mitchell Hashimoto	31b09c0674	agent/local: remove outdated comment	2018-06-14 09:42:10 -07:00
Mitchell Hashimoto	a2167a7fd1	agent/proxy: manager and basic tests, not great coverage yet coming soon	2018-06-14 09:42:08 -07:00
Mitchell Hashimoto	fae8dc8951	agent/local: add Notify mechanism for proxy changes	2018-06-14 09:42:08 -07:00
Mitchell Hashimoto	f64a002f68	agent: start/stop proxies	2018-06-14 09:42:08 -07:00
Mitchell Hashimoto	76c6849ffe	agent/local: store proxy on local state, wip, not working yet	2018-06-14 09:42:08 -07:00
Paul Banks	02ab461dae	TLS watching integrated into Service with some basic tests. There are also a lot of small bug fixes found when testing lots of things end-to-end for the first time and some cleanup now it's integrated with real CA code.	2018-06-14 09:42:07 -07:00
Paul Banks	9d11cd9bf4	Fix various test failures and vet warnings. Intention de-duplication in previously merged PR actualy failed some tests that were not caught be me or CI. I ran the test files for state changes but they happened not to trigger this case so I made sure they did first and then fixed. That fixed some upstream intention endpoint tests that I'd not run as part of testing the previous fix.	2018-06-14 09:41:58 -07:00
Paul Banks	44afb5c699	Agent Connect Proxy config endpoint with hash-based blocking	2018-06-14 09:41:57 -07:00
Paul Banks	78e48fd547	Added connect proxy config and local agent state setup on boot.	2018-06-14 09:41:57 -07:00
Mitchell Hashimoto	c43ccd024a	agent/local: anti-entropy for connect proxy services	2018-06-14 09:41:48 -07:00
Pierre Souchay	6c7f01ae73	Removed labels from new ACL denied metrics	2018-06-08 11:56:46 +02:00
Pierre Souchay	2113071ae7	Removed consul prefix from metrics as requested by @kyhavlov	2018-06-08 11:51:50 +02:00
Pierre Souchay	bebf03e26e	Fixed import	2018-04-18 17:09:25 +02:00
Pierre Souchay	4739b05d12	Added labels to improve new metric	2018-04-18 16:51:22 +02:00
Pierre Souchay	12f81c60ac	Track calls blocked by ACLs using metrics	2018-04-17 10:17:16 +02:00
Guido Iaquinti	244fc72b05	Add package name to log output	2018-03-21 15:56:14 +00:00
Josh Soref	1dd8c378b9	Spelling (#3958 ) * spelling: another * spelling: autopilot * spelling: beginning * spelling: circonus * spelling: default * spelling: definition * spelling: distance * spelling: encountered * spelling: enterprise * spelling: expands * spelling: exits * spelling: formatting * spelling: health * spelling: hierarchy * spelling: imposed * spelling: independence * spelling: inspect * spelling: last * spelling: latest * spelling: client * spelling: message * spelling: minimum * spelling: notify * spelling: nonexistent * spelling: operator * spelling: payload * spelling: preceded * spelling: prepared * spelling: programmatically * spelling: required * spelling: reconcile * spelling: responses * spelling: request * spelling: response * spelling: results * spelling: retrieve * spelling: service * spelling: significantly * spelling: specifies * spelling: supported * spelling: synchronization * spelling: synchronous * spelling: themselves * spelling: unexpected * spelling: validations * spelling: value	2018-03-19 16:56:00 +00:00
James Phillips	c52824bab7	Adds a longer retry period for the AE deferred output test. There's some justification in the comments about this and a TODO to improve this later. Fixes #3668	2017-11-08 18:10:13 -08:00
Frank Schroeder	1d2ae14719	local state: fix go vet issue	2017-10-23 10:56:05 +02:00
Frank Schroeder	a818414bb6	local state: remove stale comment	2017-10-23 10:56:05 +02:00
Frank Schroeder	329fdc40a8	local state: make test more robust	2017-10-23 10:56:05 +02:00
Frank Schroeder	f5a3d73b27	local state: clone check to avoid side effect	2017-10-23 10:56:05 +02:00
Frank Schroeder	b36613e7ff	local state: use synchronized access to internal maps	2017-10-23 10:56:05 +02:00
Frank Schroeder	f187c37c27	local state: rename Add{Check,Service}State to Set{Check,Service}State	2017-10-23 10:56:04 +02:00
Frank Schroeder	209e67b2f9	local state: move Metadata methods together	2017-10-23 10:56:04 +02:00
Frank Schroeder	9513a042be	local state: update documentation of updateSyncState	2017-10-23 10:56:04 +02:00
Frank Schroeder	2e3b72d2c3	local state: update comments	2017-10-23 10:56:04 +02:00
Frank Schroeder	da604495a0	local state: address review comments * move non-blocking notification mechanism into ae.Trigger * move Pause/Resume into separate type	2017-10-23 10:56:04 +02:00
Frank Schroeder	c39bc770b3	local state: refactor TestAgentAntiEntropy_EnableTagOverride Make intent clearer by being more explicit and adding some comments. Use verify.Values to compare service entries.	2017-10-23 10:56:04 +02:00
Frank Schroeder	e1358a541d	local state: fix TestAgentAntiEntropy_EnableTagOverride The test had a race condition where it relied on the first service to be synced to the remote catalog which sometimes failed.	2017-10-23 10:56:04 +02:00
Frank Schroeder	b3195006b1	local state: rename tests	2017-10-23 10:56:04 +02:00
Frank Schroeder	d9a4b440a8	local state: drop retry loops from tests Since the tests are now using synchronous calls for state syncing we no longer need to use retry loops to wait for the changes to propagate.	2017-10-23 10:56:04 +02:00
Frank Schroeder	32c2d1b217	local state: fix anti-entropy state tests The anti-entropy tests relied on the side-effect of the StartSync() method to perform a full sync instead of a partial sync. This lead to multiple anti-entropy go routines being started unnecessary retry loops. This change changes the behavior to perform synchronous full syncs when necessary removing the need for all of the time.Sleep and most of the retry loops.	2017-10-23 10:56:04 +02:00
Frank Schroeder	6b966e48ce	local state: fix test with updated error message	2017-10-23 10:56:04 +02:00
Frank Schroeder	ea92ee308a	local state: tests compile	2017-10-23 10:56:03 +02:00
Frank Schroeder	7289576988	local state: replace multi-map state with structs The state of the service and health check records was spread out over multiple maps guarded by a single lock. Access to the maps has to happen in a coordinated effort and the tests often violated this which made them brittle and racy. This patch replaces the multiple maps with a single one for both checks and services to make the code less fragile. This is also necessary since moving the local state into its own package creates circular dependencies for the tests. To avoid this the tests can no longer access internal data structures which they should not be doing in the first place. The tests still don't compile but this is a ncessary step in that direction.	2017-10-23 10:56:03 +02:00
Frank Schroeder	bc7571cccf	local state: move to separate package This patch moves the local state to a separate package to further decouple it from the agent code. The code compiles but the tests do not yet.	2017-10-23 10:56:03 +02:00
Frank Schroeder	443fe8e4db	Revert "local state: move to separate package" This reverts commit d447e823c63720c74bb02459a985724f035f023e.	2017-10-23 10:08:34 +02:00
Frank Schroeder	435b442c8b	Revert "local state: replace multi-map state with structs" This reverts commit ccbae7da5bceeb2328ab7993a8badbf2e72a4597.	2017-10-23 10:08:34 +02:00
Frank Schroeder	138aa25280	Revert "local state: tests compile" This reverts commit 1af52bf7be02d952e16e14209899a9715451f7ba.	2017-10-23 10:08:34 +02:00
Frank Schroeder	80d9df69e4	Revert "local state: fix test with updated error message" This reverts commit e9149f64d9afb38246f9432edd806321c1eefb83.	2017-10-23 10:08:34 +02:00
Frank Schroeder	ded6f79b6a	Revert "local state: fix anti-entropy state tests" This reverts commit f8e20cd9960e19bbe61e258c445250723870816f.	2017-10-23 10:08:34 +02:00
Frank Schroeder	5aa77fb9e4	Revert "local state: drop retry loops from tests" This reverts commit 2bdba8ab06d1c9dd99d5e7cf8370c94b4f7adfaa.	2017-10-23 10:08:34 +02:00
Frank Schroeder	d7bb81a940	Revert "local state: rename tests" This reverts commit ff62eaf0634a4c09377c53d4623685437f217b49.	2017-10-23 10:08:34 +02:00
Frank Schroeder	3d67ce9000	Revert "local state: fix TestAgentAntiEntropy_EnableTagOverride" This reverts commit 86f7ea601342d6f3ceb9d0dc74bd5b33dae0b8d8.	2017-10-23 10:08:34 +02:00
Frank Schroeder	1bd73d2a6e	Revert "local state: refactor TestAgentAntiEntropy_EnableTagOverride" This reverts commit c28e23eac8ada7a668b13e9a4a3d8066457488ef.	2017-10-23 10:08:33 +02:00
Frank Schroeder	c72d21813b	Revert "local state: address review comments" This reverts commit 1d315075b15647db7fcd42986c9c5673cbb77a77.	2017-10-23 10:08:33 +02:00
Frank Schroeder	4177bad4f3	Revert "local state: update comments" This reverts commit 42188164f885188e3bc8cff70ea5aeb47d633b83.	2017-10-23 10:08:33 +02:00
Frank Schroeder	d1e514cedc	Revert "local state: update documentation of updateSyncState" This reverts commit e86521e637d742bce1e460b6b960037cef3578ed.	2017-10-23 10:08:33 +02:00
Frank Schroeder	133b23fb77	Revert "local state: move Metadata methods together" This reverts commit 9bc8127728a62beb94b28849070b6ac35c181404.	2017-10-23 10:08:33 +02:00
Frank Schroeder	67135cc33e	Revert "local state: rename Add{Check,Service}State to Set{Check,Service}State" This reverts commit 9280841a80d98b253a8f23967875e45e5e37e093.	2017-10-23 10:08:33 +02:00
Frank Schroeder	655a24e383	Revert "local state: use synchronized access to internal maps" This reverts commit 39a2d8d25e629823e183e384e8414171edcf4164.	2017-10-23 10:08:32 +02:00
Frank Schroeder	fe0f7c961d	Revert "local state: clone check to avoid side effect" This reverts commit af1243c7251fe6291145bbe4f4dacd374779c425.	2017-10-23 10:08:32 +02:00

1 2 3 4

172 Commits