open-consul

Commit Graph

Author	SHA1	Message	Date
Derek Menteer	065e538de3	Add tests.	2022-10-31 08:45:00 -05:00
Derek Menteer	59a385bc9a	Fix peered service protocols using proxy-defaults.	2022-10-31 08:45:00 -05:00
Eric Haberkorn	57fb729547	Fix peering metrics bug (#15178 ) This bug was caused by the peering health metric being set to NaN.	2022-10-28 10:51:12 -04:00
Chris S. Kim	a0ac76ecf5	Allow consul debug on non-ACL consul servers (#15155 )	2022-10-27 09:25:18 -04:00
cskh	57380ea752	fix(peering): nil pointer in calling handleUpdateService (#15160 ) * fix(peering): nil pointer in calling handleUpdateService * changelog	2022-10-26 11:50:34 -04:00
Eric Haberkorn	74baaf910c	fix bug that resulted in generating Envoy configs that use CDS with an EDS configuration (#15140 )	2022-10-25 14:49:57 -04:00
Luke Kysow	4956b81333	ingress-gateways: don't log error when registering gateway (#15001 ) * ingress-gateways: don't log error when registering gateway Previously, when an ingress gateway was registered without a corresponding ingress gateway config entry, an error was logged because the watch on the config entry returned a nil result. This is expected so don't log an error.	2022-10-25 10:55:44 -07:00
Luke Kysow	6b1ec05470	autoencrypt: helpful error for clients with wrong dc (#14832 ) * autoencrypt: helpful error for clients with wrong dc If clients have set a different datacenter than the servers they're connecting with for autoencrypt, give a helpful error message.	2022-10-25 10:13:41 -07:00
R.B. Boyer	a01936442c	cache: refactor agent cache fetching to prevent unnecessary fetches on error (#14956 ) This continues the work done in #14908 where a crude solution to prevent a goroutine leak was implemented. The former code would launch a perpetual goroutine family every iteration (+1 +1) and the fixed code simply caused a new goroutine family to first cancel the prior one to prevent the leak (-1 +1 == 0). This PR refactors this code completely to: - make it more understandable - remove the recursion-via-goroutine strangeness - prevent unnecessary RPC fetches when the prior one has errored. The core issue arose from a conflation of the entry.Fetching field to mean: - there is an RPC (blocking query) in flight right now - there is a goroutine running to manage the RPC fetch retry loop The problem is that the goroutine-leak-avoidance check would treat Fetching like (2), but within the body of a goroutine it would flip that boolean back to false before the retry sleep. This would cause a new chain of goroutines to launch which #14908 would correct crudely. The refactored code uses a plain for-loop and changes the semantics to track state for "is there a goroutine associated with this cache entry" instead of the former. We use a uint64 unique identity per goroutine instead of a boolean so that any orphaned goroutines can tell when they've been replaced when the expiry loop deletes a cache entry while the goroutine is still running and is later replaced.	2022-10-25 10:27:26 -05:00
R.B. Boyer	bcbe7b225f	test: ensure that all dependencies in a test agent use the test logger (#14996 )	2022-10-24 17:02:38 -05:00
Chris S. Kim	5e901bfa01	Remove invalid 1xx HTTP codes These tests started failing in go1.19, presumably due to support for valid 1xx responses being added. https://github.com/golang/go/issues/56346	2022-10-24 16:12:08 -04:00
Chris S. Kim	ae1646706f	Regenerate files according to 1.19.2 formatter	2022-10-24 16:12:08 -04:00
cskh	a5acb987fa	fix(peering): replicating wan address (#15108 ) * fix(peering): replicating wan address * add changelog * unit test	2022-10-24 15:44:57 -04:00
Iryna Shustava	a3a6743e0a	proxycfg: watch service-defaults config entries (#15025 ) To support Destinations on the service-defaults (for tproxy with terminating gateway), we need to now also make servers watch service-defaults config entries.	2022-10-24 12:50:28 -06:00
Chris S. Kim	06f583a7c2	Move oss-only test to its own file	2022-10-24 14:17:43 -04:00
R.B. Boyer	bf05547080	test: fix flaky TestHealthServiceNodes_NodeMetaFilter by waiting until the streaming subsystem has a valid grpc connection (#15019 ) Also potentially unflakes TestHealthIngressServiceNodes for similar reasons.	2022-10-24 13:09:53 -05:00
R.B. Boyer	87432a8dd4	chore: update golangci-lint to v1.50.1 (#15022 )	2022-10-24 11:48:02 -05:00
Venu Yanamandra	3dd12a2960	Update error message when restoring ENT snapshot in OSS (#15066 )	2022-10-24 11:40:26 -04:00
freddygv	483720a443	Return forbidden on permission denied This commit updates the establish endpoint to bubble up a 403 status code to callers when the establishment secret from the token is invalid. This is a signal that a new peering token must be generated.	2022-10-20 17:11:49 -06:00
Chris S. Kim	569c3bce88	Update expected encoding in test go-memdb was updated in v1.3.3 to make integers in indexes sortable, which changed how integers were encoded.	2022-10-20 14:32:42 -04:00
freddygv	f3548167fc	Use plain TaggedAddressWAN	2022-10-19 16:32:44 -06:00
freddygv	1b589ba964	Add unit test	2022-10-19 16:26:15 -06:00
cskh	c0dc93e5b8	fix: wan address isn't used by peering token	2022-10-19 16:33:25 -04:00
Nitya Dhanushkodi	598670e376	Remove ability to specify external addresses in GenerateToken endpoint (#14930 ) * Reverts "update generate token endpoint to take external addresses (#13844)" This reverts commit f47319b7c6b6e7c7dd720a5af927ad2d33fa536d.	2022-10-19 09:31:36 -07:00
Kyle Havlovitz	3c13cf0994	Merge pull request #15035 from hashicorp/vault-ttl-update-warn Warn instead of returning error when missing intermediate mount tune permissions	2022-10-18 15:41:52 -07:00
cskh	e18434bcb1	peering: skip registering duplicate node and check from the peer (#14994 ) * peering: skip register duplicate node and check from the peer * Prebuilt the nodes map and checks map to avoid repeated for loop * use key type to struct: node id, service id, and check id	2022-10-18 16:19:24 -04:00
Chris S. Kim	e4c20ec190	Refactor client RPC timeouts (#14965 ) Fix an issue where rpc_hold_timeout was being used as the timeout for non-blocking queries. Users should be able to tune read timeouts without fiddling with rpc_hold_timeout. A new configuration `rpc_read_timeout` is created. Refactor some implementation from the original PR 11500 to remove the misleading linkage between RPCInfo's timeout (used to retry in case of certain modes of failures) and the client RPC timeouts.	2022-10-18 15:05:09 -04:00
Kyle Havlovitz	0a968e53b5	Warn instead of returning an error when intermediate mount tune permission is missing	2022-10-18 12:01:25 -07:00
R.B. Boyer	0712e1a456	test: possibly fix flake in TestIntentionGetExact (#15021 ) Restructure test setup to be similar to TestAgent_ServerCertificate and see if that's enough to avoid flaking after join.	2022-10-18 10:51:20 -05:00
R.B. Boyer	9f41cc4a25	cache: prevent goroutine leak in agent cache (#14908 ) There is a bug in the error handling code for the Agent cache subsystem discovered: 1. NotifyCallback calls notifyBlockingQuery which calls getWithIndex in a loop (which backs off on-error up to 1 minute) 2. getWithIndex calls fetch if there’s no valid entry in the cache 3. fetch starts a goroutine which calls Fetch on the cache-type, waits for a while (again with backoff up to 1 minute for errors) and then calls fetch to trigger a refresh The end result being that every 1 minute notifyBlockingQuery spawns an ancestry of goroutines that essentially lives forever. This PR ensures that the goroutine started by `fetch` cancels any prior goroutine spawned by the same line for the same key. In isolated testing where a cache type was tweaked to indefinitely error, this patch prevented goroutine counts from skyrocketing.	2022-10-17 14:38:10 -05:00
R.B. Boyer	ca916eec32	ca: fix a masked bug in leaf cert generation that would not be notified of root cert rotation after the first one (#15005 ) In practice this was masked by #14956 and was only uncovered fixing the other bug. go test ./agent -run TestAgentConnectCALeafCert_goodNotLocal would fail when only #14956 was fixed.	2022-10-17 13:24:27 -05:00
Chris S. Kim	58c041eb6e	Merge pull request #13388 from deblasis/feature/health-checks_windows_service Feature: Health checks windows service	2022-10-17 09:26:19 -04:00
Dan Upton	90129919a8	proxycfg: fix goroutine leak when service is re-registered (#14988 ) Fixes a bug where we'd leak a goroutine in state.run when the given context was canceled while there was a pending update.	2022-10-17 11:31:10 +01:00
Kyle Havlovitz	096ca5e4b0	Extend tcp keepalive settings to work for terminating gateways as well	2022-10-14 17:05:46 -07:00
Kyle Havlovitz	f8e745315f	Update docs and add tcp_keepalive_probes setting	2022-10-14 17:05:46 -07:00
Kyle Havlovitz	526d49c6ff	Add TCP keepalive settings to proxy config for mesh gateways	2022-10-14 17:05:46 -07:00
Derek Menteer	25d3d244f0	Fix issue with incorrect method signature on test.	2022-10-14 11:04:57 -05:00
Freddy	bbf6b17e44	Merge pull request #14981 from hashicorp/peering/dial-through-gateways	2022-10-14 09:44:56 -06:00
Dan Upton	3b9297f95a	proxycfg: rate-limit delivery of config snapshots (#14960 ) Adds a user-configurable rate limiter to proxycfg snapshot delivery, with a default limit of 250 updates per second. This addresses a problem observed in our load testing of Consul Dataplane where updating a "global" resource such as a wildcard intention or the proxy-defaults config entry could starve the Raft or Memberlist goroutines of CPU time, causing general cluster instability.	2022-10-14 15:52:00 +01:00
Derek Menteer	6c355134e8	Add tests for peering state snapshots / restores.	2022-10-14 09:48:04 -05:00
Derek Menteer	27bbdced8d	Add test for ExportedServicesForAllPeersByName	2022-10-14 09:48:04 -05:00
Dan Upton	0a0534a094	perf: remove expensive reflection from xDS hot path (#14934 ) Replaces the reflection-based implementation of proxycfg's ConfigSnapshot.Clone with code generated by deep-copy. While load testing server-based xDS (for consul-dataplane) we discovered this method is extremely expensive. The ConfigSnapshot struct, directly or indirectly, contains a copy of many of the structs in the agent/structs package, which creates a large graph for copystructure.Copy to traverse at runtime, on every proxy reconfiguration.	2022-10-14 10:26:42 +01:00
freddygv	89596f13c4	Use split var in tests	2022-10-13 17:12:47 -06:00
freddygv	b4e48f0a70	Use split wildcard partition name This way OSS avoids passing a non-empty label, which will be rejected in OSS consul.	2022-10-13 16:55:28 -06:00
Freddy	909fc33271	Merge pull request #14935 from hashicorp/fix/alias-leak	2022-10-13 16:31:15 -06:00
freddygv	452dc2867c	Lint	2022-10-13 15:55:55 -06:00
Derek Menteer	092e5fd074	Reset wait on ensureServerAddrSubscription	2022-10-13 15:58:26 -05:00
freddygv	437a513d9b	Fix CA init error code	2022-10-13 14:58:11 -06:00
freddygv	37a765f8df	Update leader routine to maybe use gateways	2022-10-13 14:58:00 -06:00
freddygv	239f0e3084	Update peering establishment to maybe use gateways When peering through mesh gateways we expect outbound dials to peer servers to flow through the local mesh gateway addresses. Now when establishing a peering we get a list of dial addresses as a ring buffer that includes local mesh gateway addresses if the local DC is configured to peer through mesh gateways. The ring buffer includes the mesh gateway addresses first, but also includes the remote server addresses as a fallback. This fallback is present because it's possible that direct egress from the servers may be allowed. If not allowed then the leader will cycle back to a mesh gateway address through the ring. When attempting to dial the remote servers we retry up to a fixed timeout. If using mesh gateways we also have an initial wait in order to allow for the mesh gateways to configure themselves. Note that if we encounter a permission denied error we do not retry since that error indicates that the secret in the peering token is invalid.	2022-10-13 14:57:55 -06:00
malizz	27d0181806	increase protobuf size limit for cluster peering (#14976 )	2022-10-13 13:46:51 -07:00
Derek Menteer	ff01c11672	Address PR comments.	2022-10-13 14:11:02 -05:00
Derek Menteer	cc0a05ffa0	Disallow peering to the same cluster.	2022-10-13 14:11:02 -05:00
Derek Menteer	d47c9b446c	Prevent consul peer-exports by discovery chain.	2022-10-13 12:45:09 -05:00
Derek Menteer	ee49db9a2f	Prevent the "consul" service from being exported.	2022-10-13 12:45:09 -05:00
Derek Menteer	bfa4adbfce	Add remote peer partition and datacenter info.	2022-10-13 10:37:41 -05:00
Dan Upton	de7f380385	xds: properly merge central config for "agentless" services (#14962 )	2022-10-13 12:04:59 +01:00
Dan Upton	36a3d00f0d	bug: fix goroutine leaks caused by incorrect usage of `WatchCh` (#14916 ) memdb's `WatchCh` method creates a goroutine that will publish to the returned channel when the watchset is triggered or the given context is canceled. Although this is called out in its godoc comment, it's not obvious that this method creates a goroutine who's lifecycle you need to manage. In the xDS capacity controller, we were calling `WatchCh` on each iteration of the control loop, meaning the number of goroutines would grow on each autopilot event until there was catalog churn. In the catalog config source, we were calling `WatchCh` with the background context, meaning that the goroutine would keep running after the sync loop had terminated.	2022-10-13 12:04:27 +01:00
Hans Hasselberg	56580d6fa6	adding configuration option cloud.scada_address (#14936 ) * adding scada_address * config tests * add changelog entry	2022-10-13 11:31:28 +02:00
Paul Glass	be1a4438a9	Add consul.xds.server.streamStart metric (#14957 ) This adds a new consul.xds.server.streamStart metric to measure the time taken to first generate xDS resources after an xDS stream is opened.	2022-10-12 14:17:58 -05:00
Riddhi Shah	474d9cfcdc	Service http checks data source for agentless proxies (#14924 ) Adds another datasource for proxycfg.HTTPChecks, for use on server agents. Typically these checks are performed by local client agents and there is no equivalent of this in agentless (where servers configure consul-dataplane proxies). Hence, the data source is mostly a no-op on servers but in the case where the service is present within the local state, it delegates to the cache data source.	2022-10-12 07:49:56 -07:00
Freddy	4cf0bf4865	Merge pull request #14958 from hashicorp/peering/nonce	2022-10-12 08:18:15 -06:00
freddygv	4d1e7c4cbb	Actually track nonce in test	2022-10-12 07:50:17 -06:00
Derek Menteer	00312bcf57	Fix incorrect backoff-wait logic.	2022-10-12 08:01:10 -05:00
freddygv	c9d171c031	Add basic nonce management This commit adds a monotonically increasing nonce to include in peering replication response messages. Every ack/nack from the peer handling a response will include this nonce, allowing to correlate the ack/nack with a specific resource. At the moment nothing is done with the nonce when it is received. In the future we may want to add functionality such as retries on NACKs, depending on the class of error.	2022-10-11 19:02:04 -06:00
Paul Glass	8cf430140a	gRPC server metrics (#14922 ) * Move stats.go from grpc-internal to grpc-middleware * Update grpc server metrics with server type label * Add stats test to grpc-external * Remove global metrics instance from grpc server tests	2022-10-11 17:00:32 -05:00
cskh	45278cb69e	fix(peering): add missing grpc_tls_port for server address reconciliation (#14944 )	2022-10-11 10:56:29 -04:00
freddygv	9f0ab69aef	Fix alias check leak Preivously when alias check was removed it would not be stopped nor cleaned up from the associated aliasChecks map. This means that any time an alias check was deregistered we would leak a goroutine for CheckAlias.run() because the stopCh would never be closed. This issue mostly affects service mesh deployments on platforms where the client agent is mostly static but proxy services come and go regularly, since by default sidecars are registered with an alias check.	2022-10-10 16:42:29 -06:00
James Oulman	a8695c88d4	Configure Envoy alpn_protocols based on service protocol (#14356 ) * Configure Envoy alpn_protocols based on service protocol * define alpnProtocols in a more standard way * http2 protocol should be h2 only * formatting * add test for getAlpnProtocol() * create changelog entry * change scope is connect-proxy * ignore errors on ParseProxyConfig; fixes linter * add tests for grpc and http2 public listeners * remove newlines from PR * Add alpn_protocol configuration for ingress gateway * Guard against nil tlsContext * add ingress gateway w/ TLS tests for gRPC and HTTP2 * getAlpnProtocols: add TCP protocol test * add tests for ingress gateway with grpc/http2 and per-listener TLS config * add tests for ingress gateway with grpc/http2 and per-listener TLS config * add Gateway level TLS config with mixed protocol listeners to validate ALPN * update changelog to include ingress-gateway * add http/1.1 to http2 ALPN * go fmt * fix test on custom-trace-listener	2022-10-10 13:13:56 -07:00
freddygv	55b5c1a073	Fixup test	2022-10-10 13:20:14 -06:00
Chris S. Kim	7f48033d0b	Fix nil pointer	2022-10-10 13:20:14 -06:00
Chris S. Kim	9d4fb0445a	Include stream-related information in peering endpoints	2022-10-10 13:20:14 -06:00
Paul Glass	a3fccf5e5b	Merge central config for GetEnvoyBootstrapParams (#14869 ) This fixes GetEnvoyBootstrapParams to merge in proxy-defaults and service-defaults. Co-authored-by: Dan Upton <daniel@floppy.co>	2022-10-10 12:40:27 -05:00
Freddy	8d93f120ea	Merge pull request #14796 from hashicorp/peering/use-connect-ca	2022-10-07 10:37:37 -06:00
freddygv	ae9b3eb662	Fixup test	2022-10-07 09:34:16 -06:00
freddygv	6ef8d329d2	Require Connect and TLS to generate peering tokens By requiring Connect and a gRPC TLS listener we can automatically configure TLS for all peering control-plane traffic.	2022-10-07 09:06:29 -06:00
freddygv	a21e5799f7	Use internal server certificate for peering TLS A previous commit introduced an internally-managed server certificate to use for peering-related purposes. Now the peering token has been updated to match that behavior: - The server name matches the structure of the server cert - The CA PEMs correspond to the Connect CA Note that if Conect is disabled, and by extension the Connect CA, we fall back to the previous behavior of returning the manually configured certs and local server SNI. Several tests were updated to use the gRPC TLS port since they enable Connect by default. This means that the peering token will embed the Connect CA, and the dialer will expect a TLS listener.	2022-10-07 09:05:32 -06:00
freddygv	1c696922fe	Simplify mgw watch mgmt	2022-10-07 08:54:37 -06:00
freddygv	b67d001b2c	Use existing query options to build ctx	2022-10-07 08:46:53 -06:00
DanStough	df94470e76	feat: xDS updates for peerings control plane through mesh gw	2022-10-07 08:46:42 -06:00
Eric Haberkorn	2f08fab317	Make the mesh gateway changes to allow `local` mode for cluster peering data plane traffic (#14817 ) Make the mesh gateway changes to allow `local` mode for cluster peering data plane traffic	2022-10-06 09:54:14 -04:00
cskh	53ff317b01	fix: missing UDP field in checkType (#14885 ) * fix: missing UDP field in checkType * Add changelog * Update doc	2022-10-05 15:57:21 -04:00
Derek Menteer	fbee1272e7	Fix explicit tproxy listeners with discovery chains. (#14751 ) Fix explicit tproxy listeners with discovery chains.	2022-10-05 14:38:25 -05:00
Alex Oskotsky	4d9309327f	Add the ability to retry on reset connection to service-routers (#12890 )	2022-10-05 13:06:44 -04:00
John Murret	08203ace4a	Upgrade serf to v0.10.1 and memberlist to v0.5.0 to get memberlist size metrics and broadcast queue depth metric (#14873 ) * updating to serf v0.10.1 and memberlist v0.5.0 to get memberlist size metrics and memberlist broadcast queue depth metric * update changelog * update changelog * correcting changelog * adding "QueueCheckInterval" for memberlist to test * updating integration test containers to grab latest api	2022-10-04 17:51:37 -06:00
Evan Culver	42423ffce2	connect: Bump Envoy 1.20 to 1.20.7, 1.21 to 1.21.5 and 1.22 to 1.22.5 (#14831 )	2022-10-04 13:15:01 -07:00
Eric Haberkorn	2178e38204	Rename `PeerName` to `Peer` on prepared queries and exported services (#14854 )	2022-10-04 14:46:15 -04:00
Freddy	89141256c7	Merge pull request #14734 from hashicorp/NET-643-update-mesh-gateway-envoy-config-for-inbound-peering-control-plane-traffic	2022-10-03 12:54:11 -06:00
freddygv	0d61aa5d37	Update xds generation for peering over mesh gws This commit adds the xDS resources needed for INBOUND traffic from peer clusters: - 1 filter chain for all inbound peering requests. - 1 cluster for all inbound peering requests. - 1 endpoint per voting server with the gRPC TLS port configured. There is one filter chain and cluster because unlike with WAN federation, peer clusters will not attempt to dial individual servers. Peer clusters will only dial the local mesh gateway addresses.	2022-10-03 12:42:27 -06:00
freddygv	2c5caec97c	Share mgw addrs in peering stream if needed This commit adds handling so that the replication stream considers whether the user intends to peer through mesh gateways. The subscription will return server or mesh gateway addresses depending on the mesh configuration setting. These watches can be updated at runtime by modifying the mesh config entry.	2022-10-03 11:42:20 -06:00
freddygv	17463472b7	Return mesh gateway addrs if peering through mgw	2022-10-03 11:35:10 -06:00
chappie	f49332a151	Merge pull request #14811 from hashicorp/chappie/dns Add DNS gRPC proxying support	2022-10-03 08:02:48 -07:00
Chris Chapman	1b24aafb23	Making suggested comments	2022-09-30 15:03:33 -07:00
Chris Chapman	399fafb679	Making suggested changes	2022-09-30 14:51:12 -07:00
Chris Chapman	8e44a8c644	Update comment	2022-09-30 09:35:01 -07:00
DanStough	16fe27c9b8	chore: fix flakey scada provider test	2022-09-30 11:56:40 -04:00
Chris Chapman	c4c5f900e0	Bind a dns mux handler to gRPC proxy	2022-09-29 21:44:45 -07:00
Chris Chapman	175e6e56f9	Adding grpc handler for dns proxy	2022-09-29 21:19:51 -07:00
Eric Haberkorn	5fd1e6daea	Add exported services event to cluster peering replication. (#14797 )	2022-09-29 15:37:19 -04:00
Ashwin Venkatesh	ddcd3e06e7	bug: watch local mesh gateways in non-default partitions with agentless (#14799 )	2022-09-29 13:19:04 -04:00
cskh	4ece020bf1	feat(ingress gateway: support configuring limits in ingress-gateway c… (#14749 ) * feat(ingress gateway: support configuring limits in ingress-gateway config entry - a new Defaults field with max_connections, max_pending_connections, max_requests is added to ingress gateway config entry - new field max_connections, max_pending_connections, max_requests in individual services to overwrite the value in Default - added unit test and integration test - updated doc Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Jeff Boruszak <104028618+boruszak@users.noreply.github.com> Co-authored-by: Dan Stough <dan.stough@hashicorp.com>	2022-09-28 14:56:46 -04:00
malizz	5c470b28dd	Support Stale Queries for Trust Bundle Lookups (#14724 ) * initial commit * add tags, add conversations * add test for query options utility functions * update previous tests * fix test * don't error out on empty context * add changelog * update decode config	2022-09-28 09:56:59 -07:00
Eric Haberkorn	e80b7068a6	Enable outbound peered requests to go through local mesh gateway (#14763 )	2022-09-27 09:49:28 -04:00
Nick Ethier	5e4b3ef5d4	add HCP integration component (#14723 ) * add HCP integration * lint: use non-deprecated logging interface	2022-09-26 14:58:15 -04:00
Derek Menteer	d9e42b0f1c	Add envoy connection balancing. (#14616 ) Add envoy connection balancing config.	2022-09-26 11:29:06 -05:00
Chris S. Kim	7ec8a0667a	Add new internal endpoint to list exported services to a peer	2022-09-23 09:43:56 -04:00
freddygv	520507232f	Manage local server watches depending on mesh cfg Routing peering control plane traffic through mesh gateways can be enabled or disabled at runtime with the mesh config entry. This commit updates proxycfg to add or cancel watches for local servers depending on this central config. Note that WAN federation over mesh gateways is determined by a service metadata flag, and any updates to the gateway service registration will force the creation of a new snapshot. If enabled, WAN-fed over mesh gateways will trigger a local server watch on initialize(). Because of this we will only add/remove server watches if WAN federation over mesh gateways is disabled.	2022-09-22 19:32:10 -06:00
Alessandro De Blasis	6e99434215	fix(check): added missing OSService props	2022-09-21 13:10:21 +01:00
Alessandro De Blasis	6471184754	fix(checks): os_service OK message in output	2022-09-21 09:27:33 +01:00
Alessandro De Blasis	7f7c320746	fix(checks): os_service lifecycle bugfix	2022-09-21 09:26:47 +01:00
Alessandro De Blasis	3b9061ab5b	fix(agent): uninitialized map panic error	2022-09-21 09:25:54 +01:00
malizz	a3fc665eef	increase the size of txn to support vault (#14599 ) * increase the size of txn to support vault * add test, revert change to acl endpoint * add changelog * update test, add passing test case * Update .changelog/14599.txt Co-authored-by: Freddy <freddygv@users.noreply.github.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2022-09-19 09:07:19 -07:00
freddygv	8166a870b6	Add awareness of server mode to TLS configurator Preivously the TLS configurator would default to presenting auto TLS certificates as client certificates. Server agents should not have this behavior and should instead present the manually configured certs. The autoTLS certs for servers are exclusively used for peering and should not be used as the default for outbound communication.	2022-09-16 17:57:10 -06:00
freddygv	107e4d8494	Test fixes - Pulls in CLI test fix from main - Updates psutils to fix TestAgent_Host on M1 Mac	2022-09-16 17:57:10 -06:00
freddygv	0c3853a2d0	Add server certificate manager This certificate manager will request a leaf certificate for server agents and then keep them up to date.	2022-09-16 17:57:10 -06:00
freddygv	ef99b30cb8	Generate ACL token for server management This commit introduces a new ACL token used for internal server management purposes. It has a few key properties: - It has unlimited permissions. - It is persisted through Raft as System Metadata rather than in the ACL tokens table. This is to avoid users seeing or modifying it. - It is re-generated on leadership establishment.	2022-09-16 17:54:34 -06:00
freddygv	a33a014b9c	Add handling in agent cache for server leaf certs	2022-09-16 17:54:34 -06:00
Kyle Havlovitz	40da079f18	Merge pull request #14598 from hashicorp/root-removal-fix connect/ca: Don't discard old roots on primaryInitialize	2022-09-15 14:36:01 -07:00
Kyle Havlovitz	fe10009a12	connect/ca: don't discard old roots on primaryInitialize	2022-09-15 12:59:09 -07:00
Gabriel Santos	09c00ff39a	Middleware: `RequestRecorder` reports calls below 1ms as decimal value (#12905 ) * Typos * Test failing * Convert values <1ms to decimal * Fix test * Update docs and test error msg * Applied suggested changes to test case * Changelog file and suggested changes * Update .changelog/12905.txt Co-authored-by: Chris S. Kim <kisunji92@gmail.com> * suggested change - start duration with microseconds instead of nanoseconds * fix error * suggested change - floats Co-authored-by: alex <8968914+acpana@users.noreply.github.com> Co-authored-by: Chris S. Kim <kisunji92@gmail.com>	2022-09-15 13:04:37 -04:00
Daniel Graña	13ac6356a8	[BUGFIX] Do not use interval as timeout (#14619 ) Do not use interval as timeout	2022-09-15 12:39:48 -04:00
Evan Culver	aa40adf97e	connect: Bump latest Envoy to 1.23.1 in test matrix (#14573 )	2022-09-14 13:20:16 -07:00
DanStough	f9b4fa17f1	fix(peering): generate token metrics only for leader	2022-09-14 11:37:30 -04:00
DanStough	b37a2ba889	feat(peering): validate server name conflicts on establish	2022-09-14 11:37:30 -04:00
Kyle Havlovitz	ea4d95a5c6	Merge pull request #14516 from hashicorp/ca-ttl-fixes Fix inconsistent TTL behavior in CA providers	2022-09-13 16:07:36 -07:00
Kyle Havlovitz	33e616987c	Update intermediate pki mount/role when reconfiguring Vault provider	2022-09-13 15:42:26 -07:00
Kyle Havlovitz	1ded025400	connect/ca: Clarify behavior around IntermediateCertTTL in CA config	2022-09-13 15:42:26 -07:00
DanStough	fca4042bd9	feat: add PeerThroughMeshGateways to mesh config	2022-09-13 17:19:54 -04:00
Derek Menteer	5d1487e167	Add CSR check for number of URIs. (#14579 ) Add CSR check for number of URIs.	2022-09-13 14:21:47 -05:00
Derek Menteer	cfcd9f2a2c	Add input validation for auto-config JWT authorization checks.	2022-09-13 11:16:36 -05:00
cskh	6196be1f98	Config-entry: Support proxy config in service-defaults (#14395 ) * Config-entry: Support proxy config in service-defaults * Update website/content/docs/connect/config-entries/service-defaults.mdx Co-authored-by: Jeff Boruszak <104028618+boruszak@users.noreply.github.com>	2022-09-12 10:41:58 -04:00
Eric Haberkorn	1490eedfbc	Implement Cluster Peering Redirects (#14445 ) implement cluster peering redirects	2022-09-09 13:58:28 -04:00
skpratt	cf6c1d9388	add non-double-prefixed metrics (#14193 )	2022-09-09 12:13:43 -05:00
skpratt	1ae31a520a	PR #14057 follow up fix: service id parsing from sidecar id (#14541 ) * fix service id parsing from sidecar id * simplify suffix trimming	2022-09-09 09:47:10 -05:00
Dan Upton	9fe6c33c0d	xDS Load Balancing (#14397 ) Prior to #13244, connect proxies and gateways could only be configured by an xDS session served by the local client agent. In an upcoming release, it will be possible to deploy a Consul service mesh without client agents. In this model, xDS sessions will be handled by the servers themselves, which necessitates load-balancing to prevent a single server from receiving a disproportionate amount of load and becoming overwhelmed. This introduces a simple form of load-balancing where Consul will attempt to achieve an even spread of load (xDS sessions) between all healthy servers. It does so by implementing a concurrent session limiter (limiter.SessionLimiter) and adjusting the limit according to autopilot state and proxy service registrations in the catalog. If a server is already over capacity (i.e. the session limit is lowered), Consul will begin draining sessions to rebalance the load. This will result in the client receiving a `RESOURCE_EXHAUSTED` status code. It is the client's responsibility to observe this response and reconnect to a different server. Users of the gRPC client connection brokered by the consul-server-connection-manager library will get this for free. The rate at which Consul will drain sessions to rebalance load is scaled dynamically based on the number of proxies in the catalog.	2022-09-09 15:02:01 +01:00
Derek Menteer	8efe862b76	Merge branch 'main' of github.com:hashicorp/consul into derekm/split-grpc-ports	2022-09-08 14:53:08 -05:00
Derek Menteer	75dec4c31d	Remove rebuilding grpc server.	2022-09-08 13:45:44 -05:00
Derek Menteer	6aaf1c6035	Various cleanups.	2022-09-08 10:51:50 -05:00
Chris S. Kim	331c756471	Reuse http.DefaultTransport in UIMetricsProxy (#14521 ) http.Transport keeps a pool of connections and should be reused when possible. We instantiate a new http.DefaultTransport for every metrics request, making large numbers of concurrent requests inefficiently spin up new connections instead of reusing open ones.	2022-09-08 11:02:05 -04:00
Chris S. Kim	9b5c5c5062	Merge pull request #14285 from hashicorp/NET-638-push-server-address-updates-to-the-peer peering: Subscribe to server address changes and push updates to peers	2022-09-07 09:30:45 -04:00
skpratt	02559085ad	move port and default check logic to locked step (#14057 )	2022-09-06 19:35:31 -05:00
Freddy	a7f38384ae	Add SpiffeID for Consul server agents (#14485 ) Co-authored-by: Eric Haberkorn <erichaberkorn@gmail.com> By adding a SpiffeID for server agents, servers can now request a leaf certificate from the Connect CA. This new Spiffe ID has a key property: servers are identified by their datacenter name and trust domain. All servers that share these attributes will share a ServerURI. The aim is to use these certificates to verify the server name of ANY server in a Consul datacenter.	2022-09-06 17:58:13 -06:00
Daniel Upton	128055c44c	proxycfg-glue: server-local implementation of IntentionUpstreamsDestination This is the OSS portion of enterprise PR 2463. Generalises the serverIntentionUpstreams type to support matching on a service or destination.	2022-09-06 23:27:25 +01:00
Daniel Upton	4b76d8a8ff	proxycfg-glue: server-local implementation of InternalServiceDump This is the OSS portion of enterprise PR 2489. This PR introduces a server-local implementation of the proxycfg.InternalServiceDump interface that sources data from a blocking query against the server's state store. For simplicity, it only implements the subset of the Internal.ServiceDump RPC handler actually used by proxycfg - as such the result type has been changed to IndexedCheckServiceNodes to avoid confusion.	2022-09-06 23:27:25 +01:00
Daniel Upton	8cd6c9f95e	proxycfg-glue: server-local implementation of ResolvedServiceConfig This is the OSS portion of enterprise PR 2460. Introduces a server-local implementation of the proxycfg.ResolvedServiceConfig interface that sources data from a blocking query against the server's state store. It moves the service config resolution logic into the agent/configentry package so that it can be used in both the RPC handler and data source. I've also done a little re-arranging and adding comments to call out data sources for which there is to be no server-local equivalent.	2022-09-06 23:27:25 +01:00
Derek Menteer	b50bc443f3	Merge branch 'main' of github.com:hashicorp/consul into derekm/split-grpc-ports	2022-09-06 10:51:04 -05:00
Derek Menteer	d771725a14	Add kv txn get-not-exists operation.	2022-09-06 10:28:59 -05:00
Chris S. Kim	0148263780	PR feedback on terminated state checking	2022-09-06 10:28:20 -04:00
Chris S. Kim	9ad8bf67a5	Add testcase for parsing grpc_port	2022-09-06 10:17:44 -04:00
Kyle Havlovitz	a484a759c8	Merge pull request #14429 from hashicorp/ca-prune-intermediates Prune old expired intermediate certs when appending a new one	2022-09-02 15:34:33 -07:00

1 2 3 4 5 ...

4831 Commits