open-consul

Commit Graph

Author	SHA1	Message	Date
Chris S. Kim	829554c706	peering: Make Upstream peer-aware (#12900 ) Adds DestinationPeer field to Upstream. Adds Peer field to UpstreamID and its string conversion functions.	2022-04-29 18:12:51 -04:00
R.B. Boyer	cc4733e60d	proxycfg: change how various proxycfg test helpers for making ConfigSnapshot copies works to be more correct and less error prone (#12531 ) Prior to this PR for the envoy xDS golden tests in the agent/xds package we were hand-creating a proxycfg.ConfigSnapshot structure in the proper format for input to the xDS generator. Over time this intermediate structure has gotten trickier to build correctly for the various tests. This PR proposes to switch to using the existing mechanism for turning a structs.NodeService and a sequence of cache.UpdateEvent copies into a proxycfg.ConfigSnapshot, as that is less error prone to construct and aligns more with how the data arrives. NOTE: almost all of this is in test-related code. I tried super hard to craft correct event inputs to get the golden files to be the same, or similar enough after construction to feel ok that i recreated the spirit of the original test cases.	2022-03-07 11:47:14 -06:00
freddygv	7fba7456ec	Fix race of upstreams with same passthrough ip Due to timing, a transparent proxy could have two upstreams to dial directly with the same address. For example: - The orders service can dial upstreams shipping and payment directly. - An instance of shipping at address 10.0.0.1 is deregistered. - Payments is scaled up and scheduled to have address 10.0.0.1. - The orders service receives the event for the new payments instance before seeing the deregistration for the shipping instance. At this point two upstreams have the same passthrough address and Envoy will reject the listener configuration. To disambiguate this commit considers the Raft index when storing passthrough addresses. In the example above, 10.0.0.1 would only be associated with the newer payments service instance.	2022-02-10 17:01:57 -07:00
R.B. Boyer	baf886c6f3	proxycfg: introduce explicit UpstreamID in lieu of bare string (#12125 ) The gist here is that now we use a value-type struct proxycfg.UpstreamID as the map key in ConfigSnapshot maps where we used to use "upstream id-ish" strings. These are internal only and used just for bidirectional trips through the agent cache keyspace (like the discovery chain target struct). For the few places where the upstream id needs to be projected into xDS, that's what (proxycfg.UpstreamID).EnvoyID() is for. This lets us ALWAYS inject the partition and namespace into these things without making stuff like the golden testdata diverge.	2022-01-20 10:12:04 -06:00
Dhia Ayachi	5f6bf369af	reset `coalesceTimer` to nil as soon as the event is consumed (#11924 ) * reset `coalesceTimer` to nil as soon as the event is consumed * add change log * refactor to add relevant test. * fix linter * Apply suggestions from code review Co-authored-by: Freddy <freddygv@users.noreply.github.com> * remove non needed check Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2022-01-05 12:17:47 -05:00
R.B. Boyer	a0156785dd	various partition related todos (#11822 )	2021-12-13 11:43:33 -06:00
Freddy	eb2b40b22d	Update filter chain creation for sidecar/ingress listeners (#11245 ) The duo of `makeUpstreamFilterChainForDiscoveryChain` and `makeListenerForDiscoveryChain` were really hard to reason about, and led to concealing a bug in their branching logic. There were several issues here: - They tried to accomplish too much: determining filter name, cluster name, and whether RDS should be used. - They embedded logic to handle significantly different kinds of upstream listeners (passthrough, prepared query, typical services, and catch-all) - They needed to coalesce different data sources (Upstream and CompiledDiscoveryChain) Rather than handling all of those tasks inside of these functions, this PR pulls out the RDS/clusterName/filterName logic. This refactor also fixed a bug with the handling of [UpstreamDefaults](https://www.consul.io/docs/connect/config-entries/service-defaults#defaults). These defaults get stored as UpstreamConfig in the proxy snapshot with a DestinationName of "", since they apply to all upstreams. However, this wildcard destination name must not be used when creating the name of the associated upstream cluster. The coalescing logic in the original functions here was in some situations creating clusters with a `.` prefix, which is not a valid destination.	2021-11-09 14:43:51 -07:00
freddygv	ce43e8cf99	Store GatewayKey in proxycfg snapshot for re-use	2021-11-01 13:58:53 -06:00
freddygv	6657c88296	Update locality check in proxycfg	2021-11-01 13:58:53 -06:00
freddygv	ea311d2e47	Configure sidecars to watch gateways in partitions Previously the datacenter of the gateway was the key identifier, now it is the datacenter and partition. When dialing services in other partitions or datacenters we now watch the appropriate partition.	2021-10-26 23:35:37 -06:00
Freddy	57ca0ed480	Log the correlation ID when blocking queries fire (#10689 ) Knowing that blocking queries are firing does not provide much information on its own. If we know the correlation IDs we can piece together which parts of the snapshot have been populated. Some of these responses might be empty from the blocking query timing out. But if they're returning quickly I think we can reasonably assume they contain data.	2021-07-23 16:36:17 -06:00
Daniel Nephin	41bf0670a8	proxycfg: move each handler into a seprate file There is no interaction between these handlers, so splitting them into separate files makes it easier to discover the full implementation of each kindHandler.	2021-06-21 15:48:40 -04:00
Daniel Nephin	96896409d6	Merge pull request #9489 from hashicorp/dnephin/proxycfg-state-2 proxycfg: split state into a handler for each kind	2021-06-18 13:57:28 -04:00
Nitya Dhanushkodi	ffbbe9e73f	proxycfg: reference to entry in map should not panic	2021-06-17 11:49:04 -07:00
Daniel Nephin	b7293242f1	Replace type conversion with embedded structs	2021-06-17 13:23:35 -04:00
Daniel Nephin	40ff895927	proxycfg: split state into kind-specific types This commit extracts all the kind-specific logic into handler types, and keeps the generic parts on the state struct. This change should make it easier to add new kinds, and see the implementation of each kind more clearly.	2021-06-16 14:04:01 -04:00
Daniel Nephin	b57f03feff	proxycfg: unmethod hostnameEndpoints the method receiver can be replaced by the first argument. This will allow us to extract more from the state struct in the future.	2021-06-16 14:03:30 -04:00
Daniel Nephin	b40174ccf2	Merge pull request #9466 from hashicorp/dnephin/proxycfg-state proxycfg: prepare state for split by kind	2021-06-16 13:14:26 -04:00
Nitya Dhanushkodi	08ed3edf71	proxycfg: Ensure that endpoints for explicit upstreams in other datacenters are watched in transparent mode (#10391 ) Co-authored-by: Freddy Vallenilla <freddy@hashicorp.com>	2021-06-15 11:00:26 -07:00
Daniel Nephin	cbcc1a3a86	proxycfg: extract two types from state struct These two new struct types will allow us to make polymorphic handler for each kind, instad of having all the logic for each proxy kind on the state struct.	2021-06-10 17:42:17 -04:00
Daniel Nephin	b99da95e70	proxycfg: pass context around where it is needed context.Context should never be stored on a struct (as it says in the godoc) because it is easy to to end up with the wrong context when it is stored. Also see https://blog.golang.org/context-and-structs This change is also in preparation for splitting state into kind-specific handlers so that the implementation of each kind is grouped together.	2021-06-10 17:34:50 -04:00
Freddy	61ae2995b7	Add flag for transparent proxies to dial individual instances (#10329 )	2021-06-09 14:34:17 -06:00
Mark Anderson	626b27a874	Continue working through proxy and agent Rework/listeners, rename makeListener Refactor, tests pass Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-05-04 12:41:43 -07:00
Freddy	ec38cf3206	Fixup discovery chain handling in transparent mode (#10168 ) Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> Previously we would associate the address of a discovery chain target with the discovery chain's filter chain. This was broken for a few reasons: - If the upstream is a virtual service, the client proxy has no way of dialing it because virtual services are not targets of their discovery chains. The targets are distinct services. This is addressed by watching the endpoints of all upstream services, not just their discovery chain targets. - If multiple discovery chains resolve to the same target, that would lead to multiple filter chains attempting to match on the target's virtual IP. This is addressed by only matching on the upstream's virtual IP. NOTE: this implementation requires an intention to the redirecting virtual service and not just to the final destination. This is how we can know that the virtual service is an upstream to watch. A later PR will look into traversing discovery chains when computing upstreams so that intentions are only required to the discovery chain targets.	2021-05-04 08:45:19 -06:00
Freddy	401f3010e0	Rename "cluster" config entry to "mesh" (#10127 ) This config entry is being renamed primarily because in k8s the name cluster could be confusing given that the config entry applies across federated datacenters. Additionally, this config entry will only apply to Consul as a service mesh, so the more generic "cluster" name is not needed.	2021-04-28 16:13:29 -06:00
freddygv	eeccba945d	Replace TransparentProxy bool with ProxyMode This PR replaces the original boolean used to configure transparent proxy mode. It was replaced with a string mode that can be set to: - "": Empty string is the default for when the setting should be defaulted from other configuration like config entries. - "direct": Direct mode is how applications originally opted into the mesh. Proxy listeners need to be dialed directly. - "transparent": Transparent mode enables configuring Envoy as a transparent proxy. Traffic must be captured and redirected to the inbound and outbound listeners. This PR also adds a struct for transparent proxy specific configuration. Initially this is not stored as a pointer. Will revisit that decision before GA.	2021-04-12 09:35:14 -06:00
freddygv	0d0205e0dc	PR comments	2021-04-08 11:16:03 -06:00
freddygv	ddc6c9b7ca	Ensure mesh gateway mode override is set for upstreams for intentions	2021-04-07 09:32:48 -06:00
freddygv	619dc5ede4	Finish resolving upstream defaults in proxycfg	2021-04-07 09:32:48 -06:00
R.B. Boyer	82245585c6	connect: add toggle to globally disable wildcard outbound network access when transparent proxy is enabled (#9973 ) This adds a new config entry kind "cluster" with a single special name "cluster" where this can be controlled.	2021-04-06 13:19:59 -05:00
freddygv	291d7562d1	Cancel watch on all errors	2021-03-17 21:44:14 -06:00
freddygv	6c43195e2a	Merge master and fix upstream config protocol defaulting	2021-03-17 21:13:40 -06:00
freddygv	3c7e5c3308	PR comments	2021-03-17 16:18:56 -06:00
freddygv	3c97e5a777	Update proxycfg for transparent proxy	2021-03-17 13:40:39 -06:00
Daniel Nephin	2a53b8293a	proxycfg: use rpcclient/health.Client instead of passing around cache name This should allow us to swap out the implementation with something other than `agent/cache` without making further code changes.	2021-03-12 11:46:04 -05:00
Daniel Nephin	410b1261c2	proxycfg: Use streaming in connect state	2021-03-12 11:35:42 -05:00
freddygv	a417f88e44	Update comments on avoiding proxycfg deadlock	2021-02-08 09:45:45 -07:00
freddygv	0a8f2f2105	Retry send after timer fires, in case no updates occur	2021-02-05 18:00:59 -07:00
freddygv	57c29aba5d	Update proxycfg logging, labels were already attached	2021-02-05 15:14:49 -07:00
freddygv	a0be7dcc1d	Add trace logs to proxycfg state runner and xds srv	2021-02-02 12:26:38 -07:00
freddygv	0fb96afe31	Avoid potential deadlock using non-blocking send Deadlock scenario: 1. Due to scheduling, the state runner sends one snapshot into snapCh and then attempts to send a second. The first send succeeds because the channel is buffered, but the second blocks. 2. Separately, Manager.Watch is called by the xDS server after getting a discovery request from Envoy. This function acquires the manager lock and then blocks on receiving the CurrentSnapshot from the state runner. 3. Separately, there is a Manager goroutine that reads the snapshots from the channel in step 1. These reads are done to notify proxy watchers, but they require holding the manager lock. This goroutine goes to acquire that lock, but can't because it is held by step 2. Now, the goroutine from step 3 is waiting on the one from step 2 to release the lock. The goroutine from step 2 won't release the lock until the goroutine in step 1 advances. But the goroutine in step 1 is waiting for the one in step 3. Deadlock. By making this send non-blocking step 1 above can proceed. The coalesce timer will be reset and a new valid snapshot will be delivered after it elapses or when one is requested by xDS.	2021-02-02 11:31:14 -07:00
freddygv	43efb4809c	Merge master	2020-09-14 16:17:43 -06:00
freddygv	66e5c5989a	Fix type assertion	2020-09-14 16:12:21 -06:00
R.B. Boyer	f2b8bf109c	xds: use envoy's rbac filter to handle intentions entirely within envoy (#8569 )	2020-08-27 12:20:58 -05:00
Daniel Nephin	1ef8279ac9	Merge pull request #8034 from hashicorp/dnephin/add-linter-staticcheck-4 ci: enable SA4006 staticcheck check and add ineffassign	2020-06-17 12:16:02 -04:00
Daniel Nephin	89d95561df	Enable gofmt simplify Code changes done automatically with 'gofmt -s -w'	2020-06-16 13:21:11 -04:00
Daniel Nephin	5f24171f13	ci: enable SA4006 staticcheck check And fix the 'value not used' issues. Many of these are not bugs, but a few are tests not checking errors, and one appears to be a missed error in non-test code.	2020-06-16 13:10:11 -04:00
freddygv	1e7e716742	Move compound service names to use ServiceName type	2020-06-12 13:47:43 -06:00
Freddy	66e2def461	Only pass one hostname via EDS and prefer healthy ones (#8084 ) Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com> Currently when passing hostname clusters to Envoy, we set each service instance registered with Consul as an LbEndpoint for the cluster. However, Envoy can only handle one per cluster: [2020-06-04 18:32:34.094][1][warning][config] [source/common/config/grpc_subscription_impl.cc:87] gRPC config for type.googleapis.com/envoy.api.v2.Cluster rejected: Error adding/updating cluster(s) dc2.internal.ddd90499-9b47-91c5-4616-c0cbf0fc358a.consul: LOGICAL_DNS clusters must have a single locality_lb_endpoint and a single lb_endpoint, server.dc2.consul: LOGICAL_DNS clusters must have a single locality_lb_endpoint and a single lb_endpoint Envoy is currently handling this gracefully by only picking one of the endpoints. However, we should avoid passing multiple to avoid these warning logs. This PR: * Ensures we only pass one endpoint, which is tied to one service instance. * We prefer sending an endpoint which is marked as Healthy by Consul. * If no endpoints are healthy we emit a warning and skip the cluster. * If multiple unique hostnames are spread across service instances we emit a warning and let the user know which will be resolved.	2020-06-12 13:46:17 -06:00
Freddy	f759a48726	Enable gateways to resolve hostnames to IPv4 addresses (#7999 ) The DNS resolution will be handled by Envoy and defaults to LOGICAL_DNS. This discovery type can be overridden on a per-gateway basis with the envoy_dns_discovery_type Gateway Option. If a service contains an instance with a hostname as an address we set the Envoy cluster to use DNS as the discovery type rather than EDS. Since both mesh gateways and terminating gateways route to clusters using SNI, whenever there is a mix of hostnames and IP addresses associated with a service we use the hostname + CDS rather than the IPs + EDS. Note that we detect hostnames by attempting to parse the service instance's address as an IP. If it is not a valid IP we assume it is a hostname.	2020-06-03 15:28:45 -06:00

1 2

94 Commits