open-consul

Commit Graph

Author	SHA1	Message	Date
R.B. Boyer	22ee60d1ba	agent: blocking central config RPCs iterations should not interfere with each other (#6316 )	2019-08-14 09:08:46 -05:00
hashicorp-ci	29767157ed	Merge Consul OSS branch 'master' at commit 8f7586b339dbb518eff3a2eec27d7b8eae7a3fbb	2019-08-13 02:00:43 +00:00
Sarah Adams	2f7a90bc52	add flag to allow /operator/keyring requests to only hit local servers (#6279 ) Add parameter local-only to operator keyring list requests to force queries to only hit local servers (no WAN traffic). HTTP API: GET /operator/keyring?local-only=true CLI: consul keyring -list --local-only Sending the local-only flag with any non-GET/list request will result in an error.	2019-08-12 11:11:11 -07:00
Mike Morris	88df658243	connect: remove managed proxies (#6220 ) * connect: remove managed proxies implementation and all supporting config options and structs * connect: remove deprecated ProxyDestination * command: remove CONNECT_PROXY_TOKEN env var * agent: remove entire proxyprocess proxy manager * test: remove all managed proxy tests * test: remove irrelevant managed proxy note from TestService_ServerTLSConfig * test: update ContentHash to reflect managed proxy removal * test: remove deprecated ProxyDestination test * telemetry: remove managed proxy note * http: remove /v1/agent/connect/proxy endpoint * ci: remove deprecated test exclusion * website: update managed proxies deprecation page to note removal * website: remove managed proxy configuration API docs * website: remove managed proxy note from built-in proxy config * website: add note on removing proxy subdirectory of data_dir	2019-08-09 15:19:30 -04:00
R.B. Boyer	357ca39868	connect: ensure intention replication continues to work when the replication ACL token changes (#6288 )	2019-08-07 11:34:09 -05:00
hashicorp-ci	3ac803da5e	Merge Consul OSS branch 'master' at commit d84863799deca45ccf4bec5ab9f645ccae6b3aeb	2019-08-06 02:00:30 +00:00
Sarah Adams	9ed3e64510	fallback to proxy config global protocol when upstream services' protocol is unset (#6277 ) fallback to proxy config global protocol when upstream services' protocol is unset Fixes #5857	2019-08-05 12:52:35 -07:00
R.B. Boyer	64fc002e03	connect: fix failover through a mesh gateway to a remote datacenter (#6259 ) Failover is pushed entirely down to the data plane by creating envoy clusters and putting each successive destination in a different load assignment priority band. For example this shows that normally requests go to 1.2.3.4:8080 but when that fails they go to 6.7.8.9:8080: - name: foo load_assignment: cluster_name: foo policy: overprovisioning_factor: 100000 endpoints: - priority: 0 lb_endpoints: - endpoint: address: socket_address: address: 1.2.3.4 port_value: 8080 - priority: 1 lb_endpoints: - endpoint: address: socket_address: address: 6.7.8.9 port_value: 8080 Mesh gateways route requests based solely on the SNI header tacked onto the TLS layer. Envoy currently only lets you configure the outbound SNI header at the cluster layer. If you try to failover through a mesh gateway you ideally would configure the SNI value per endpoint, but that's not possible in envoy today. This PR introduces a simpler way around the problem for now: 1. We identify any target of failover that will use mesh gateway mode local or remote and then further isolate any resolver node in the compiled discovery chain that has a failover destination set to one of those targets. 2. For each of these resolvers we will perform a small measurement of comparative healths of the endpoints that come back from the health API for the set of primary target and serial failover targets. We walk the list of targets in order and if any endpoint is healthy we return that target, otherwise we move on to the next target. 3. The CDS and EDS endpoints both perform the measurements in (2) for the affected resolver nodes. 4. For CDS this measurement selects which TLS SNI field to use for the cluster (note the cluster is always going to be named for the primary target) 5. For EDS this measurement selects which set of endpoints will populate the cluster. Priority tiered failover is ignored. One of the big downsides to this approach to failover is that the failover detection and correction is going to be controlled by consul rather than deferring that entirely to the data plane as with the prior version. This also means that we are bound to only failover using official health signals and cannot make use of data plane signals like outlier detection to affect failover. In this specific scenario the lack of data plane signals is ok because the effectiveness is already muted by the fact that the ultimate destination endpoints will have their data plane signals scrambled when they pass through the mesh gateway wrapper anyway so we're not losing much. Another related fix is that we now use the endpoint health from the underlying service, not the health of the gateway (regardless of failover mode).	2019-08-05 13:30:35 -05:00
R.B. Boyer	0165e93517	connect: expose an API endpoint to compile the discovery chain (#6248 ) In addition to exposing compilation over the API cleaned up the structures that would be exchanged to be cleaner and easier to support and understand. Also removed ability to configure the envoy OverprovisioningFactor.	2019-08-02 15:34:54 -05:00
Todd Radel	295abd82c3	connect: generate intermediate at same time as root (#6272 ) Generate intermediate at same time as root Co-Authored-By: Freddy <freddygv@users.noreply.github.com>	2019-08-02 15:36:03 -04:00
R.B. Boyer	4e2fb5730c	connect: detect and prevent circular discovery chain references (#6246 )	2019-08-02 09:18:45 -05:00
R.B. Boyer	6c9edb17c2	server: if inserting bootstrap config entries fails don't silence the errors (#6256 )	2019-08-01 23:07:11 -05:00
R.B. Boyer	782c647bf4	connect: simplify the compiled discovery chain data structures (#6242 ) This should make them better for sending over RPC or the API. Instead of a chain implemented explicitly like a linked list (nodes holding pointers to other nodes) instead switch to a flat map of named nodes with nodes linking other other nodes by name. The shipped structure is just a map and a string to indicate which key to start from. Other changes: * inline the compiler option InferDefaults as true * introduce compiled target config to avoid needing to send back additional maps of Resolvers; future target-specific compiled state can go here * move compiled MeshGateway out of the Resolver and into the TargetConfig where it makes more sense.	2019-08-01 22:44:05 -05:00
R.B. Boyer	4666599e18	connect: reconcile how upstream configuration works with discovery chains (#6225 ) * connect: reconcile how upstream configuration works with discovery chains The following upstream config fields for connect sidecars sanely integrate into discovery chain resolution: - Destination Namespace/Datacenter: Compilation occurs locally but using different default values for namespaces and datacenters. The xDS clusters that are created are named as they normally would be. - Mesh Gateway Mode (single upstream): If set this value overrides any value computed for any resolver for the entire discovery chain. The xDS clusters that are created may be named differently (see below). - Mesh Gateway Mode (whole sidecar): If set this value overrides any value computed for any resolver for the entire discovery chain. If this is specifically overridden for a single upstream this value is ignored in that case. The xDS clusters that are created may be named differently (see below). - Protocol (in opaque config): If set this value overrides the value computed when evaluating the entire discovery chain. If the normal chain would be TCP or if this override is set to TCP then the result is that we explicitly disable L7 Routing and Splitting. The xDS clusters that are created may be named differently (see below). - Connect Timeout (in opaque config): If set this value overrides the value for any resolver in the entire discovery chain. The xDS clusters that are created may be named differently (see below). If any of the above overrides affect the actual result of compiling the discovery chain (i.e. "tcp" becomes "grpc" instead of being a no-op override to "tcp") then the relevant parameters are hashed and provided to the xDS layer as a prefix for use in naming the Clusters. This is to ensure that if one Upstream discovery chain has no overrides and tangentially needs a cluster named "api.default.XXX", and another Upstream does have overrides for "api.default.XXX" that they won't cross-pollinate against the operator's wishes. Fixes #6159	2019-08-01 22:03:34 -05:00
Paul Banks	a5c70d79d0	Revert "connect: support AWS PCA as a CA provider" (#6251 ) This reverts commit 3497b7c00d49c4acbbf951d84f2bba93f3da7510.	2019-07-31 09:08:10 -04:00
Todd Radel	d3b7fd83fe	connect: support AWS PCA as a CA provider (#6189 ) Port AWS PCA provider from consul-ent	2019-07-30 22:57:51 -04:00
Todd Radel	1b14d6595e	connect: Support RSA keys in addition to ECDSA (#6055 ) Support RSA keys in addition to ECDSA	2019-07-30 17:47:39 -04:00
Matt Keeler	a7c4b7af7c	Fix CA Replication when ACLs are enabled (#6201 ) Secondary CA initialization steps are: • Wait until the primary will be capable of signing intermediate certs. We use serf metadata to check the versions of servers in the primary which avoids needing a token like the previous implementation that used RPCs. We require at least one alive server in the primary and the all alive servers meet the version requirement. • Initialize the secondary CA by getting the primary to sign an intermediate When a primary dc is configured, if no existing CA is initialized and for whatever reason we cannot initialize a secondary CA the secondary DC will remain without a CA. As soon as it can it will initialize the secondary CA by pulling the primaries roots and getting the primary to sign an intermediate. This also fixes a segfault that can happen during leadership revocation. There was a spot in the secondaryCARootsWatch that was getting the CA Provider and executing methods on it without nil checking. Under normal circumstances it wont be nil but during leadership revocation it gets nil'ed out. Therefore there is a period of time between closing the stop chan and when the go routine is actually stopped where it could read a nil provider and cause a segfault.	2019-07-26 15:57:57 -04:00
R.B. Boyer	1b95d2e5e3	Merge Consul OSS branch master at commit b3541c4f34d43ab92fe52256420759f17ea0ed73	2019-07-26 10:34:24 -05:00
Matt Keeler	c4a34602b6	Allow forwarding of some status RPCs (#6198 ) * Allow forwarding of some status RPCs * Update docs * add comments about not using the regular forward	2019-07-25 14:26:22 -04:00
Jeff Mitchell	e266b038cc	Make the chunking test multidimensional (#6212 ) This ensures that it's not just a single operation we restores successfully, but many. It's the same foundation, just with multiple going on at once.	2019-07-25 11:40:09 +01:00
Freddy	7dbbe7e55a	auto-encrypt: Fix port resolution and fallback to default port (#6205 ) Auto-encrypt meant to fallback to the default port when it wasn't provided, but it hadn't been because of an issue with the error handling. We were checking against an incomplete error value: "missing port in address" vs "address $HOST: missing port in address" Additionally, all RPCs to AutoEncrypt.Sign were using a.config.ServerPort, so those were updated to use ports resolved by resolveAddrs, if they are available.	2019-07-24 16:49:37 -07:00
Jeff Mitchell	e0068431f5	Chunking support (#6172 ) * Initial chunk support This uses the go-raft-middleware library to allow for chunked commits to the KV	2019-07-24 17:06:39 -04:00
Freddy	1b97d65873	Make new config when retrying testServer creation (#6204 )	2019-07-24 08:41:00 -06:00
Alvin Huang	5b6fa58453	resolve circleci config conflicts	2019-07-23 20:18:36 -04:00
Freddy	c19f46639b	Restore NotifyListen to avoid panic in newServer retry (#6200 )	2019-07-23 14:33:00 -06:00
Christian Muehlhaeuser	2602f6907e	Simplified code in various places (#6176 ) All these changes should have no side-effects or change behavior: - Use bytes.Buffer's String() instead of a conversion - Use time.Since and time.Until where fitting - Drop unnecessary returns and assignment	2019-07-20 09:37:19 -04:00
hashicorp-ci	8b109e5f9f	Merge Consul OSS branch 'master' at commit ef257b084d2e2a474889518440515e360d0cd990	2019-07-20 02:00:29 +00:00
Christian Muehlhaeuser	26f9368567	Fixed typos in comments (#6175 ) Just a few nitpicky typo fixes.	2019-07-19 07:54:53 -04:00
Christian Muehlhaeuser	877bfd280b	Fixed a few tautological condition mistakes (#6177 ) None of these changes should have any side-effects. They're merely fixing tautological mistakes.	2019-07-19 07:53:42 -04:00
Christian Muehlhaeuser	d1426767f6	Fixed nil check for token (#6179 ) I can only assume we want to check for the retrieved `updatedToken` to not be nil, before accessing it below. `token` can't possibly be nil at this point, as we accessed `token.AccessorID` just before.	2019-07-19 07:48:11 -04:00
Alvin Huang	17654c6292	Merge branch 'master' into release/1-6	2019-07-17 15:43:30 -04:00
Freddy	f59e6db9b1	Reduce number of servers in TestServer_Expect_NonVoters (#6155 )	2019-07-17 11:35:33 -06:00
Freddy	476a4b95a5	More flaky test fixes (#6151 ) * Add retry to TestAPI_ClientTxn * Add retry to TestLeader_RegisterMember * Account for empty watch result in ConnectRootsWatch	2019-07-17 09:33:38 -06:00
hashicorp-ci	022483aff0	Merge Consul OSS branch 'master' at commit 95dbb7f2f1b9fc3528a16335201e2324f1b388bd	2019-07-17 02:00:21 +00:00
Freddy	99601aa3a7	Update retries that weren't using retry.R (#6146 )	2019-07-16 14:47:45 -06:00
R.B. Boyer	1cc6d07d0f	add test for discovery chain agent cache-type (#6130 )	2019-07-15 10:09:52 -05:00
Jack Pearkes	fa15914813	Merge branch 'master' into release/1-6	2019-07-12 14:51:25 -07:00
Matt Keeler	3914ec5c62	Various Gateway Fixes (#6093 ) * Ensure the mesh gateway configuration comes back in the api within each upstream * Add a test for the MeshGatewayConfig in the ToAPI functions * Ensure we don’t use gateways for dc local connections * Update the svc kind index for deletions * Replace the proxycfg.state cache with an interface for testing Also start implementing proxycfg state testing. * Update the state tests to verify some gateway watches for upstream-targets of a discovery chain.	2019-07-12 17:19:37 -04:00
Sarah Adams	4afa034d6a	fix flaky test TestACLEndpoint_SecureIntroEndpoints_OnlyCreateLocalData (#6116 ) * fix test to write only to dc2 (typo) * fix retry behavior in existing test (was being used incorrectly)	2019-07-12 14:14:42 -07:00
R.B. Boyer	72a8195839	implement some missing service-router features and add more xDS testing (#6065 ) - also implement OnlyPassing filters for non-gateway clusters	2019-07-12 14:16:21 -05:00
R.B. Boyer	9e1e9aad2e	Fix bug in service-resolver redirects if the destination uses a default resolver. (#6122 ) Also: - add back an internal http endpoint to dump a compiled discovery chain for debugging purposes Before the CompiledDiscoveryChain.IsDefault() method would test: - is this chain just one resolver step? - is that resolver step just the default? But what I forgot to test: - is that resolver step for the same service that the chain represents? This last point is important because if you configured just one config entry: kind = "service-resolver" name = "web" redirect { service = "other" } and requested the chain for "web" you'd get back a default resolver for "other". In the xDS code the IsDefault() method is used to determine if this chain is "empty". If it is then we use the pre-discovery-chain logic that just uses data embedded in the Upstream object (and still lets the escape hatches function). In the example above that means certain parts of the xDS code were going to try referencing a cluster named "web..." despite the other parts of the xDS code maintaining clusters named "other...".	2019-07-12 12:21:25 -05:00
Freddy	a295d9e5db	Flaky test overhaul (#6100 )	2019-07-12 09:52:26 -06:00
Freddy	b6b6dbadb0	Remove dummy config (#6121 )	2019-07-12 09:50:14 -06:00
Freddy	74b7bcb612	Update TestServer creation in sdk/testutil (#6084 ) * Retry the creation of the test server three times. * Reduce the retry timeout for the API wait to 2 seconds, opting to fail faster and start over. * Remove wait for leader from server creation. This wait can be added on a test by test basis now that the function is being exported. * Remove wait for anti-entropy sync. This is built into the existing WaitForSerfCheck func, so that can be used if the anti-entropy wait is needed	2019-07-12 09:37:29 -06:00
Freddy	f5634a24e8	Clean up StatsFetcher work when context is exceeded (#6086 )	2019-07-12 08:23:28 -06:00
Matt Keeler	6cc936d64b	Move ctx and cancel func setup into the Replicator.Start (#6115 ) Previously a sequence of events like: Start Stop Start Stop would segfault on the second stop because the original ctx and cancel func were only initialized during the constructor and not during Start.	2019-07-12 10:10:48 -04:00
Jack Pearkes	2b1761bab3	Make cluster names SNI always (#6081 ) * Make cluster names SNI always * Update some tests * Ensure we check for prepared query types * Use sni for route cluster names * Proper mesh gateway mode defaulting when the discovery chain is used * Ignore service splits from PatchSliceOfMaps * Update some xds golden files for proper test output * Allow for grpc/http listeners/cluster configs with the disco chain * Update stats expectation	2019-07-08 12:48:48 +01:00
Matt Keeler	35a839952b	Fix Internal.ServiceDump blocking (#6076 ) maxIndexWatchTxn was only watching the IndexEntry of the max index of all the entries. It needed to watch all of them regardless of which was the max. Also plumbed the query source through in the proxy config to help better track requests.	2019-07-04 16:17:49 +01:00
R.B. Boyer	a1900754db	digest the proxy-defaults protocol into the graph (#6050 )	2019-07-02 11:01:17 -05:00

1 2 3 4 5 ...

612 Commits