open-consul

Commit Graph

Author	SHA1	Message	Date
freddygv	9e0958f1d2	Clean up chains separately from their watches	2021-12-13 18:56:14 -07:00
freddygv	ea26a7b7cf	Store intention upstreams in snapshot	2021-12-13 18:56:13 -07:00
R.B. Boyer	72a81cfc4a	proxycfg: ensure all of the watches are canceled if they are cancelable (#11824 )	2021-12-13 15:56:17 -06:00
R.B. Boyer	3dccd14d31	proxycfg: use external addresses in tproxy when crossing partition boundaries (#11823 )	2021-12-13 14:34:49 -06:00
R.B. Boyer	a0156785dd	various partition related todos (#11822 )	2021-12-13 11:43:33 -06:00
R.B. Boyer	83bf7ab3ff	re-run gofmt on 1.17 (#11579 ) This should let freshly recompiled golangci-lint binaries using Go 1.17 pass 'make lint'	2021-11-16 12:04:01 -06:00
freddygv	f33eae6fe1	Update proxycfg for ingress service partitions	2021-11-12 14:33:31 -07:00
Freddy	eb2b40b22d	Update filter chain creation for sidecar/ingress listeners (#11245 ) The duo of `makeUpstreamFilterChainForDiscoveryChain` and `makeListenerForDiscoveryChain` were really hard to reason about, and led to concealing a bug in their branching logic. There were several issues here: - They tried to accomplish too much: determining filter name, cluster name, and whether RDS should be used. - They embedded logic to handle significantly different kinds of upstream listeners (passthrough, prepared query, typical services, and catch-all) - They needed to coalesce different data sources (Upstream and CompiledDiscoveryChain) Rather than handling all of those tasks inside of these functions, this PR pulls out the RDS/clusterName/filterName logic. This refactor also fixed a bug with the handling of [UpstreamDefaults](https://www.consul.io/docs/connect/config-entries/service-defaults#defaults). These defaults get stored as UpstreamConfig in the proxy snapshot with a DestinationName of "", since they apply to all upstreams. However, this wildcard destination name must not be used when creating the name of the associated upstream cluster. The coalescing logic in the original functions here was in some situations creating clusters with a `.` prefix, which is not a valid destination.	2021-11-09 14:43:51 -07:00
Daniel Upton	caa5b5a5a6	xds: prefer fed state gateway definitions if they're fresher (#11522 ) Fixes an issue described in #10132, where if two DCs are WAN federated over mesh gateways, and the gateway in the non-primary DC is terminated and receives a new IP address (as is commonly the case when running them on ephemeral compute instances) the primary DC is unable to re-establish its connection until the agent running on its own gateway is restarted. This was happening because we always preferred gateways discovered by the `Internal.ServiceDump` RPC (which would fail because there's no way to dial the remote DC) over those discovered in the federation state, which is replicated as long as the primary DC's gateway is reachable.	2021-11-09 16:45:36 +00:00
freddygv	ecccf22fd7	Exclude default partition from GatewayKey string This will behave the way we handle SNI and SPIFFE IDs, where the default partition is excluded. Excluding the default ensures that don't attempt to compare default.dc2 to dc2 in OSS.	2021-11-01 14:45:52 -06:00
freddygv	d944e6ae3a	Update GatewayKeys deduplication Federation states data is only keyed on datacenter, so it cannot be directly compared against keys for gateway groups.	2021-11-01 13:58:53 -06:00
freddygv	ce43e8cf99	Store GatewayKey in proxycfg snapshot for re-use	2021-11-01 13:58:53 -06:00
freddygv	6657c88296	Update locality check in proxycfg	2021-11-01 13:58:53 -06:00
freddygv	40271beb38	Fixup partitions assertion	2021-10-27 11:15:25 -06:00
freddygv	9769b31641	Move the exportingpartitions constant to enterprise	2021-10-27 11:15:25 -06:00
freddygv	0391a65772	Replace default partition check	2021-10-27 11:15:25 -06:00
freddygv	ee45ac9dc5	PR comments	2021-10-27 11:15:25 -06:00
freddygv	8b5a9369eb	Account for partitions in xds gen for mesh gw This commit avoids skipping gateways in remote partitions of the local DC when generating listeners/clusters/endpoints.	2021-10-27 11:15:25 -06:00
freddygv	4f0432be5e	Update xds pkg to account for GatewayKey	2021-10-27 09:03:56 -06:00
freddygv	f3f15640a9	Update mesh gateway proxy watches for partitions This commit updates mesh gateway watches for cross-partitions communication. * Mesh gateways are keyed by partition and datacenter. * Mesh gateways will now watch gateways in partitions that export services to their partition. * Mesh gateways in non-default partitions will not have cross-datacenter watches. They are not involved in traditional WAN federation.	2021-10-27 09:03:56 -06:00
freddygv	1bade08f91	Replace Split with SplitN	2021-10-26 23:36:01 -06:00
freddygv	3966677aaf	Finish removing useInDatacenter	2021-10-26 23:36:01 -06:00
freddygv	ea311d2e47	Configure sidecars to watch gateways in partitions Previously the datacenter of the gateway was the key identifier, now it is the datacenter and partition. When dialing services in other partitions or datacenters we now watch the appropriate partition.	2021-10-26 23:35:37 -06:00
Paul Banks	5c8702b182	Add support for enabling connect-based ingress TLS per listener.	2021-10-19 20:58:28 +01:00
Daniel Nephin	1bc07c5166	structs: rename the last helper method. This one gets used a bunch, but we can rename it to make the behaviour more obvious.	2021-09-29 11:48:38 -04:00
Daniel Nephin	17652227f6	structs: remove two methods that were only used once each. These methods only called a single function. Wrappers like this end up making code harder to read because it adds extra ways of doing things. We already have many helper functions for constructing these types, we don't need additional methods.	2021-09-29 11:47:03 -04:00
Paul Banks	f4f0793a10	Minor PR typo and cleanup fixes	2021-09-23 10:13:19 +01:00
Paul Banks	4cc1ccf892	Revert abandonned changes to proxycfg for Ent test consistency	2021-09-23 10:13:19 +01:00
Paul Banks	9422e4ebc7	Handle namespaces in route names correctly; add tests for enterprise	2021-09-23 10:09:11 +01:00
Paul Banks	8548e15f1b	Update proxycfg to hold more ingress config state	2021-09-23 10:08:02 +01:00
Paul Banks	0e410a1b1f	Add ingress-gateway config for SDS	2021-09-23 10:08:02 +01:00
freddygv	661f520841	Fixup proxycfg tproxy case	2021-09-16 15:05:28 -06:00
freddygv	8a9bf3748c	Account for partitions in ixn match/decision	2021-09-16 14:39:01 -06:00
freddygv	7927a97c2f	Fixup manager tests	2021-09-15 17:24:05 -06:00
freddygv	0cdcbbb4c9	Pass partition to intention match query	2021-09-15 17:23:52 -06:00
Paul Banks	1dd1683ed9	Header manip for split legs plumbing	2021-09-10 21:09:24 +01:00
Paul Banks	f70f7b2389	Header manip for service-router plumbed through	2021-09-10 21:09:24 +01:00
Paul Banks	fc2ed4cdf4	Ingress gateway header manip plumbing	2021-09-10 21:09:24 +01:00
Dhia Ayachi	96d7842118	partition dicovery chains (#10983 ) * partition dicovery chains * fix default partition for OSS	2021-09-07 16:29:32 -04:00
Dhia Ayachi	eb19271fd7	add partition to SNI when partition is non default (#10917 )	2021-09-01 10:35:39 -04:00
freddygv	ed79e38a36	Update comment for test function	2021-08-20 17:40:33 -06:00
freddygv	b1050e4229	Update prepared query cluster SAN validation Previously SAN validation for prepared queries was broken because we validated against the name, namespace, and datacenter for prepared queries. However, prepared queries can target: - Services with a name that isn't their own - Services in multiple datacenters This means that the SpiffeID to validate needs to be based on the prepared query endpoints, and not the prepared query's upstream definition. This commit updates prepared query clusters to account for that.	2021-08-20 17:40:33 -06:00
freddygv	1f192eb7d9	Fixup proxy config test fixtures - The TestNodeService helper created services with the fixed name "web", and now that name is overridable. - The discovery chain snapshot didn't have prepared query endpoints so the endpoints tests were missing data for prepared queries	2021-08-20 17:38:57 -06:00
Dhia Ayachi	f766b6dff7	oss portion of ent #1069 (#10883 )	2021-08-20 12:57:45 -04:00
R.B. Boyer	61f1c01b83	agent: ensure that most agent behavior correctly respects partition configuration (#10880 )	2021-08-19 15:09:42 -05:00
Daniel Nephin	7c865d03ac	proxycfg: Lookup the agent token as a default When no ACL token is provided with the service registration.	2021-08-12 15:51:34 -04:00
Daniel Nephin	d189524e71	proxycfg: Add a test to show the bug When a token is not provided at registration, the agent token is not being used.	2021-08-12 15:47:59 -04:00
Freddy	57ca0ed480	Log the correlation ID when blocking queries fire (#10689 ) Knowing that blocking queries are firing does not provide much information on its own. If we know the correlation IDs we can piece together which parts of the snapshot have been populated. Some of these responses might be empty from the blocking query timing out. But if they're returning quickly I think we can reasonably assume they contain data.	2021-07-23 16:36:17 -06:00
R.B. Boyer	62ac98b564	agent/structs: add a bunch more EnterpriseMeta helper functions to help with partitioning (#10669 )	2021-07-22 13:20:45 -05:00
freddygv	b6b42c34dc	Add TODOs about partition handling	2021-07-14 22:21:55 -06:00
freddygv	a7de87e95b	Validate SANs for passthrough clusters and failovers	2021-07-14 22:21:55 -06:00
Daniel Nephin	6a61c5d772	proxycfg: remove unused method This method was accidentally re-introduced in an earlier rebase. It was removed in ed1082510dc80523b1f2a3a740fa5a13c77594f9 as part of the tproxy work.	2021-06-21 15:54:40 -04:00
Daniel Nephin	41bf0670a8	proxycfg: move each handler into a seprate file There is no interaction between these handlers, so splitting them into separate files makes it easier to discover the full implementation of each kindHandler.	2021-06-21 15:48:40 -04:00
Daniel Nephin	96896409d6	Merge pull request #9489 from hashicorp/dnephin/proxycfg-state-2 proxycfg: split state into a handler for each kind	2021-06-18 13:57:28 -04:00
Nitya Dhanushkodi	ffbbe9e73f	proxycfg: reference to entry in map should not panic	2021-06-17 11:49:04 -07:00
Daniel Nephin	b7293242f1	Replace type conversion with embedded structs	2021-06-17 13:23:35 -04:00
Daniel Nephin	40ff895927	proxycfg: split state into kind-specific types This commit extracts all the kind-specific logic into handler types, and keeps the generic parts on the state struct. This change should make it easier to add new kinds, and see the implementation of each kind more clearly.	2021-06-16 14:04:01 -04:00
Daniel Nephin	b57f03feff	proxycfg: unmethod hostnameEndpoints the method receiver can be replaced by the first argument. This will allow us to extract more from the state struct in the future.	2021-06-16 14:03:30 -04:00
Daniel Nephin	f2ae6cb47c	Remove duplicate import because two PRs crossed paths.	2021-06-16 13:19:54 -04:00
Daniel Nephin	b40174ccf2	Merge pull request #9466 from hashicorp/dnephin/proxycfg-state proxycfg: prepare state for split by kind	2021-06-16 13:14:26 -04:00
Nitya Dhanushkodi	08ed3edf71	proxycfg: Ensure that endpoints for explicit upstreams in other datacenters are watched in transparent mode (#10391 ) Co-authored-by: Freddy Vallenilla <freddy@hashicorp.com>	2021-06-15 11:00:26 -07:00
Daniel Nephin	cbcc1a3a86	proxycfg: extract two types from state struct These two new struct types will allow us to make polymorphic handler for each kind, instad of having all the logic for each proxy kind on the state struct.	2021-06-10 17:42:17 -04:00
Daniel Nephin	b99da95e70	proxycfg: pass context around where it is needed context.Context should never be stored on a struct (as it says in the godoc) because it is easy to to end up with the wrong context when it is stored. Also see https://blog.golang.org/context-and-structs This change is also in preparation for splitting state into kind-specific handlers so that the implementation of each kind is grouped together.	2021-06-10 17:34:50 -04:00
Freddy	61ae2995b7	Add flag for transparent proxies to dial individual instances (#10329 )	2021-06-09 14:34:17 -06:00
freddygv	abcfb2aeda	Ensure entmeta is encoded in test correlationID	2021-05-05 12:31:23 -06:00
Daniel Nephin	55f620d636	Merge pull request #10155 from hashicorp/dnephin/config-entry-remove-fields config-entry: remove Kind and Name field from Mesh config entry	2021-05-04 17:27:56 -04:00
Mark Anderson	c3510e6d47	Add tests for xds/listeners Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-05-04 12:41:43 -07:00
Mark Anderson	626b27a874	Continue working through proxy and agent Rework/listeners, rename makeListener Refactor, tests pass Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-05-04 12:41:43 -07:00
Freddy	ec38cf3206	Fixup discovery chain handling in transparent mode (#10168 ) Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> Previously we would associate the address of a discovery chain target with the discovery chain's filter chain. This was broken for a few reasons: - If the upstream is a virtual service, the client proxy has no way of dialing it because virtual services are not targets of their discovery chains. The targets are distinct services. This is addressed by watching the endpoints of all upstream services, not just their discovery chain targets. - If multiple discovery chains resolve to the same target, that would lead to multiple filter chains attempting to match on the target's virtual IP. This is addressed by only matching on the upstream's virtual IP. NOTE: this implementation requires an intention to the redirecting virtual service and not just to the final destination. This is how we can know that the virtual service is an upstream to watch. A later PR will look into traversing discovery chains when computing upstreams so that intentions are only required to the discovery chain targets.	2021-05-04 08:45:19 -06:00
Daniel Nephin	bf4c289804	config-entry: remove Kind and Name field from Mesh config entry No config entry needs a Kind field. It is only used to determine the Go type to target. As we introduce new config entries (like this one) we can remove the kind field and have the GetKind method return the single supported value. In this case (similar to proxy-defaults) the Name field is also unnecessary. We always use the same value. So we can omit the name field entirely.	2021-04-29 17:11:21 -04:00
R.B. Boyer	91bee6246f	Support Incremental xDS mode (#9855 ) This adds support for the Incremental xDS protocol when using xDS v3. This is best reviewed commit-by-commit and will not be squashed when merged. Union of all commit messages follows to give an overarching summary: xds: exclusively support incremental xDS when using xDS v3 Attempts to use SoTW via v3 will fail, much like attempts to use incremental via v2 will fail. Work around a strange older envoy behavior involving empty CDS responses over incremental xDS. xds: various cleanups and refactors that don't strictly concern the addition of incremental xDS support Dissolve the connectionInfo struct in favor of per-connection ResourceGenerators instead. Do a better job of ensuring the xds code uses a well configured logger that accurately describes the connected client. xds: pull out checkStreamACLs method in advance of a later commit xds: rewrite SoTW xDS protocol tests to use protobufs rather than hand-rolled json strings In the test we very lightly reuse some of the more boring protobuf construction helper code that is also technically under test. The important thing of the protocol tests is testing the protocol. The actual inputs and outputs are largely already handled by the xds golden output tests now so these protocol tests don't have to do double-duty. This also updates the SoTW protocol test to exclusively use xDS v2 which is the only variant of SoTW that will be supported in Consul 1.10. xds: default xds.Server.AuthCheckFrequency at use-time instead of construction-time	2021-04-29 13:54:05 -05:00
Freddy	401f3010e0	Rename "cluster" config entry to "mesh" (#10127 ) This config entry is being renamed primarily because in k8s the name cluster could be confusing given that the config entry applies across federated datacenters. Additionally, this config entry will only apply to Consul as a service mesh, so the more generic "cluster" name is not needed.	2021-04-28 16:13:29 -06:00
Daniel Nephin	18c9e73832	connect: do not set QuerySource.Node Setting this field to a value is equivalent to using the 'near' query paramter. The intent is to sort the results by proximity to the node requesting them. However with connect we send the results to envoy, which doesn't care about the order, so setting this field is increasing the work performed for no gain. It is necessary to unset this field now because we would like connect to use streaming, but streaming does not support sorting by proximity.	2021-04-27 19:03:16 -04:00
Freddy	6d15569062	Split Upstream.Identifier() so non-empty namespace is always prepended in ent (#10031 )	2021-04-15 13:54:40 -06:00
freddygv	36e9326dab	Fixup wildcard ent assertion	2021-04-12 17:04:33 -06:00
freddygv	eeccba945d	Replace TransparentProxy bool with ProxyMode This PR replaces the original boolean used to configure transparent proxy mode. It was replaced with a string mode that can be set to: - "": Empty string is the default for when the setting should be defaulted from other configuration like config entries. - "direct": Direct mode is how applications originally opted into the mesh. Proxy listeners need to be dialed directly. - "transparent": Transparent mode enables configuring Envoy as a transparent proxy. Traffic must be captured and redirected to the inbound and outbound listeners. This PR also adds a struct for transparent proxy specific configuration. Initially this is not stored as a pointer. Will revisit that decision before GA.	2021-04-12 09:35:14 -06:00
freddygv	0d0205e0dc	PR comments	2021-04-08 11:16:03 -06:00
freddygv	ddc6c9b7ca	Ensure mesh gateway mode override is set for upstreams for intentions	2021-04-07 09:32:48 -06:00
freddygv	619dc5ede4	Finish resolving upstream defaults in proxycfg	2021-04-07 09:32:48 -06:00
R.B. Boyer	82245585c6	connect: add toggle to globally disable wildcard outbound network access when transparent proxy is enabled (#9973 ) This adds a new config entry kind "cluster" with a single special name "cluster" where this can be controlled.	2021-04-06 13:19:59 -05:00
freddygv	b56bd690aa	Fixup enterprise tests from tproxy changes	2021-03-17 23:05:00 -06:00
freddygv	291d7562d1	Cancel watch on all errors	2021-03-17 21:44:14 -06:00
freddygv	6c43195e2a	Merge master and fix upstream config protocol defaulting	2021-03-17 21:13:40 -06:00
freddygv	3c7e5c3308	PR comments	2021-03-17 16:18:56 -06:00
freddygv	3c97e5a777	Update proxycfg for transparent proxy	2021-03-17 13:40:39 -06:00
Daniel Nephin	2a53b8293a	proxycfg: use rpcclient/health.Client instead of passing around cache name This should allow us to swap out the implementation with something other than `agent/cache` without making further code changes.	2021-03-12 11:46:04 -05:00
Daniel Nephin	410b1261c2	proxycfg: Use streaming in connect state	2021-03-12 11:35:42 -05:00
Freddy	5a50b26767	Avoid potential proxycfg/xDS deadlock using non-blocking send	2021-02-08 16:14:06 -07:00
freddygv	a417f88e44	Update comments on avoiding proxycfg deadlock	2021-02-08 09:45:45 -07:00
R.B. Boyer	77424e179a	xds: prevent LDS flaps in mesh gateways due to unstable datacenter lists (#9651 ) Also fix a similar issue in Terminating Gateways that was masked by an overzealous test.	2021-02-08 10:19:57 -06:00
freddygv	0a8f2f2105	Retry send after timer fires, in case no updates occur	2021-02-05 18:00:59 -07:00
freddygv	57c29aba5d	Update proxycfg logging, labels were already attached	2021-02-05 15:14:49 -07:00
freddygv	a0be7dcc1d	Add trace logs to proxycfg state runner and xds srv	2021-02-02 12:26:38 -07:00
freddygv	0fb96afe31	Avoid potential deadlock using non-blocking send Deadlock scenario: 1. Due to scheduling, the state runner sends one snapshot into snapCh and then attempts to send a second. The first send succeeds because the channel is buffered, but the second blocks. 2. Separately, Manager.Watch is called by the xDS server after getting a discovery request from Envoy. This function acquires the manager lock and then blocks on receiving the CurrentSnapshot from the state runner. 3. Separately, there is a Manager goroutine that reads the snapshots from the channel in step 1. These reads are done to notify proxy watchers, but they require holding the manager lock. This goroutine goes to acquire that lock, but can't because it is held by step 2. Now, the goroutine from step 3 is waiting on the one from step 2 to release the lock. The goroutine from step 2 won't release the lock until the goroutine in step 1 advances. But the goroutine in step 1 is waiting for the one in step 3. Deadlock. By making this send non-blocking step 1 above can proceed. The coalesce timer will be reset and a new valid snapshot will be delivered after it elapses or when one is requested by xDS.	2021-02-02 11:31:14 -07:00
Daniel Nephin	ef0999547a	testing: skip slow tests with -short Add a skip condition to all tests slower than 100ms. This change was made using `gotestsum tool slowest` with data from the last 3 CI runs of master. See https://github.com/gotestyourself/gotestsum#finding-and-skipping-slow-tests With this change: ``` $ time go test -count=1 -short ./agent ok github.com/hashicorp/consul/agent 0.743s real 0m4.791s $ time go test -count=1 -short ./agent/consul ok github.com/hashicorp/consul/agent/consul 4.229s real 0m8.769s ```	2020-12-07 13:42:55 -05:00
freddygv	e0db834148	Fix text type assertion	2020-09-14 16:28:40 -06:00
freddygv	43efb4809c	Merge master	2020-09-14 16:17:43 -06:00
freddygv	66e5c5989a	Fix type assertion	2020-09-14 16:12:21 -06:00
freddygv	60cb306524	Add session flag to cookie config	2020-09-11 18:34:03 -06:00
freddygv	5871b667a5	Revert EnvoyConfig nesting	2020-09-11 09:21:43 -06:00

1 2 3 4 5

241 Commits