open-consul

Commit Graph

Author	SHA1	Message	Date
Chris S. Kim	7b338c8d00	Merge pull request #13998 from jorgemarey/f-new-tracing-envoy Add new envoy tracing configuration	2022-09-01 08:57:23 -04:00
Derek Menteer	ab9d421ba2	Change serf-tag references to field references.	2022-08-31 16:38:42 -05:00
malizz	ad30192499	validate args before deleting proxy defaults (#14290 ) * validate args before deleting proxy defaults * add changelog * validate name when normalizing proxy defaults * add test for proxyConfigEntry * add comments	2022-08-31 13:03:38 -07:00
Kyle Havlovitz	c5370d52e9	Prune old expired intermediate certs when appending a new one	2022-08-31 11:41:58 -07:00
Eric Haberkorn	06e7f3cadb	Finish up cluster peering failover (#14396 )	2022-08-30 11:46:34 -04:00
Chris S. Kim	9c157e40a3	Merge branch 'main' into NET-638-push-server-address-updates-to-the-peer # Conflicts: # agent/grpc-external/services/peerstream/stream_test.go	2022-08-30 11:09:25 -04:00
Jorge Marey	e3813586f3	Fix typos. Add test. Add documentation	2022-08-30 16:59:02 +02:00
Jorge Marey	4d8f5ab539	Add new tracing configuration	2022-08-30 16:59:02 +02:00
Freddy	f27a9effca	Merge pull request #13496 from maxb/fix-kv_entries-metric	2022-08-29 15:35:11 -06:00
Freddy	69d99aa8c0	Merge pull request #14364 from hashicorp/peering/term-delete	2022-08-29 15:33:18 -06:00
Max Bowsher	3aefc4123f	Merge branch 'main' into fix-kv_entries-metric	2022-08-29 22:22:10 +01:00
Chris S. Kim	7b267f5c01	Merge pull request #14371 from hashicorp/kisunji/peering-metrics-update Adjust metrics reporting for peering tracker	2022-08-29 17:16:19 -04:00
Chris S. Kim	e4a154c88e	Add heartbeat timeout grace period when accounting for peering health	2022-08-29 16:32:26 -04:00
Derek Menteer	b641dcf03d	Expose `grpc_tls` via serf for cluster peering.	2022-08-29 13:43:49 -05:00
Derek Menteer	4a01d75cf8	Add separate grpc_tls port. To ease the transition for users, the original gRPC port can still operate in a deprecated mode as either plain-text or TLS mode. This behavior should be removed in a future release whenever we no longer support this. The resulting behavior from this commit is: `ports.grpc > 0 && ports.grpc_tls > 0` spawns both plain-text and tls ports. `ports.grpc > 0 && grpc.tls == undefined` spawns a single plain-text port. `ports.grpc > 0 && grpc.tls != undefined` spawns a single tls port (backwards compat mode).	2022-08-29 13:43:43 -05:00
freddygv	f790d84c04	Add validation to prevent switching dialing mode This prevents unexpected changes to the output of ShouldDial, which should never change unless a peering is deleted and recreated.	2022-08-29 12:31:13 -06:00
Eric Haberkorn	13992d5dc8	Update max_ejection_percent on outlier detection for peered clusters to 100% (#14373 ) We can't trust health checks on peered services when service resolvers, splitters and routers are used.	2022-08-29 13:46:41 -04:00
Chris S. Kim	a58e943502	Rename test	2022-08-29 10:34:50 -04:00
Chris S. Kim	78bf8437d8	Fix test	2022-08-29 10:20:30 -04:00
Eric Haberkorn	2a370d456b	Update the structs and discovery chain for service resolver redirects to cluster peers. (#14366 )	2022-08-29 09:51:32 -04:00
Chris S. Kim	b1025f2dd9	Adjust metrics reporting for peering tracker	2022-08-26 17:34:17 -04:00
freddygv	19f25fc3a5	Allow terminated peerings to be deleted Peerings are terminated when a peer decides to delete the peering from their end. Deleting a peering sends a termination message to the peer and triggers them to mark the peering as terminated but does NOT delete the peering itself. This is to prevent peerings from disappearing from both sides just because one side deleted them. Previously the Delete endpoint was skipping the deletion if the peering was not marked as active. However, terminated peerings are also inactive. This PR makes some updates so that peerings marked as terminated can be deleted by users.	2022-08-26 10:52:47 -06:00
Chris S. Kim	7e95b35102	Fix casing	2022-08-26 11:56:26 -04:00
Chris S. Kim	516a6daefa	Merge branch 'main' into catalog-service-list-filter	2022-08-26 11:16:06 -04:00
Chris S. Kim	a2c857df40	Fix tests for enterprise	2022-08-26 11:14:02 -04:00
Chris S. Kim	a5e9ea6d96	Merge branch 'main' into NET-638-push-server-address-updates-to-the-peer # Conflicts: # agent/grpc-external/services/peerstream/stream_test.go	2022-08-26 10:43:56 -04:00
Chris S. Kim	a8090268d4	Replace ring buffer with async version (#14314 ) We need to watch for changes to peerings and update the server addresses which get served by the ring buffer. Also, if there is an active connection for a peer, we are getting up-to-date server addresses from the replication stream and can safely ignore the token's addresses which may be stale.	2022-08-26 10:27:13 -04:00
alex	f64af3be24	peering: add peer health metric (#14004 ) Signed-off-by: acpana <8968914+acpana@users.noreply.github.com>	2022-08-25 16:32:59 -07:00
Chris S. Kim	2e75833133	Exit loop when context is cancelled	2022-08-25 11:48:25 -04:00
cskh	7ee1c857c3	Fix: the inboundconnection limit filter should be placed in front of http co… (#14325 ) * fix: the inboundconnection limit should be placed in front of http connection manager Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2022-08-24 14:13:10 -04:00
Chris S. Kim	eac63fea1f	Update test comment	2022-08-24 13:50:24 -04:00
Chris S. Kim	6f98c853b8	Add check for zero-length server addresses	2022-08-24 13:30:52 -04:00
skpratt	c039028401	no-op: refactor usagemetrics tests for clarity and DRY cases (#14313 )	2022-08-24 12:00:09 -05:00
Pablo Ruiz García	4188769c32	Added new auto_encrypt.grpc_server_tls config option to control AutoTLS enabling of GRPC Server's TLS usage Fix for #14253 Co-authored-by: trujillo-adam <47586768+trujillo-adam@users.noreply.github.com>	2022-08-24 12:31:38 -04:00
Dan Upton	20c87d235f	dataplane: update envoy bootstrap params for consul-dataplane (#14017 ) Contains 2 changes to the GetEnvoyBootstrapParams response to support consul-dataplane. Exposing node_name and node_id: consul-dataplane will support providing either the node_id or node_name in its configuration. Unfortunately, supporting both in the xDS meta adds a fair amount of complexity (partly because most tables are currently indexed on node_name) so for now we're going to return them both from the bootstrap params endpoint, allowing consul-dataplane to exchange a node_id for a node_name (which it will supply in the xDS meta). Properly setting service for gateways: To avoid the need to special case gateways in consul-dataplane, service will now either be the destination service name for connect proxies, or the gateway service name. This means it can be used as-is in Envoy configuration (i.e. as a cluster name or in metric tags).	2022-08-24 12:03:15 +01:00
Daniel Upton	1cd7ec0543	proxycfg: terminate stream on irrecoverable errors This is the OSS portion of enterprise PR 2339. It improves our handling of "irrecoverable" errors in proxycfg data sources. The canonical example of this is what happens when the ACL token presented by Envoy is deleted/revoked. Previously, the stream would get "stuck" until the xDS server re-checked the token (after 5 minutes) and terminated the stream. Materializers would also sit burning resources retrying something that could never succeed. Now, it is possible for data sources to mark errors as "terminal" which causes the xDS stream to be closed immediately. Similarly, the submatview.Store will evict materializers when it observes they have encountered such an error.	2022-08-23 20:17:49 +01:00
Chris S. Kim	1e7a3b8d8d	PR feedback to specify Node name in test mock	2022-08-23 11:51:04 -04:00
Eric Haberkorn	3d45306e1b	Cluster peering failover disco chain changes (#14296 )	2022-08-23 09:13:43 -04:00
Chris S. Kim	c14b166b80	Fix flakes	2022-08-22 14:45:31 -04:00
Chris S. Kim	587c57d3f4	Increase heartbeat rate to reduce test flakes	2022-08-22 14:24:05 -04:00
Chris S. Kim	c68e589f26	Remove check for ResponseNonce	2022-08-22 13:55:01 -04:00
Chris S. Kim	0ae3462e61	Add missing mock assertions	2022-08-22 13:55:01 -04:00
Chris S. Kim	575a56062f	Fix data race newMockSnapshotHandler has an assertion on t.Cleanup which gets called before the event publisher is cancelled. This commit reorders the context.WithCancel so it properly gets cancelled before the assertion is made.	2022-08-22 13:55:01 -04:00
cskh	e30d6bfc40	Fix: add missing ent meta for test (#14289 )	2022-08-22 13:51:04 -04:00
Chris S. Kim	98d102326f	Handle server addresses update as client	2022-08-22 13:42:12 -04:00
Chris S. Kim	205e873689	Send server addresses on update from server	2022-08-22 13:41:44 -04:00
Chris S. Kim	4cf54bef4e	Add new subscription for server addresses	2022-08-22 13:40:25 -04:00
Chris S. Kim	e1a7456a69	Cleanup unused logger	2022-08-22 13:40:23 -04:00
Chris S. Kim	9f96f98ab6	Expose external gRPC port in autopilot The grpc_port was added to a NodeService's meta in ea58f235f5da416224ba615405269661ba1f4d8d	2022-08-22 10:07:00 -04:00
cskh	a87d8f48be	fix: missing MaxInboundConnections field in service-defaults config entry (#14072 ) * fix: missing max_inbound_connections field in merge config	2022-08-19 14:11:21 -04:00
cskh	7f66dfc780	Fix: upgrade pkg imdario/merg to prevent merge config panic (#14237 ) * upgrade imdario/merg to prevent merge config panic * test: service definition takes precedence over service-defaults in merged results	2022-08-17 21:14:04 -04:00
James Hartig	a5a200e0e9	Use the maximum jitter when calculating the timeout The timeout should include the maximum possible jitter since the server will randomly add to it's timeout a jitter. If the server's timeout is less than the client's timeout then the client will return an i/o deadline reached error. Before: ``` time curl 'http://localhost:8500/v1/catalog/service/service?dc=other-dc&stale=&wait=600s&index=15820644' rpc error making call: i/o deadline reached real 10m11.469s user 0m0.018s sys 0m0.023s ``` After: ``` time curl 'http://localhost:8500/v1/catalog/service/service?dc=other-dc&stale=&wait=600s&index=15820644' [...] real 10m35.835s user 0m0.021s sys 0m0.021s ```	2022-08-17 10:24:09 -04:00
Eric Haberkorn	40ce1c8288	Add `Targets` field to service resolver failovers. (#14162 ) This field will be used for cluster peering failover.	2022-08-15 09:20:25 -04:00
cskh	c20d016f62	fix: missing segment and partition (#14194 )	2022-08-12 15:21:39 -04:00
Eric Haberkorn	11884bfb99	Refactor failover code to use Envoy's aggregate clusters (#14178 )	2022-08-12 14:30:46 -04:00
cskh	e7b5baa3cc	feat(telemetry): add labels to serf and memberlist metrics (#14161 ) * feat(telemetry): add labels to serf and memberlist metrics * changelog * doc update Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2022-08-11 22:09:56 -04:00
Chris S. Kim	182399255b	Handle breaking change for ServiceVirtualIP restore (#14149 ) Consul 1.13.0 changed ServiceVirtualIP to use PeeredServiceName instead of ServiceName which was a breaking change for those using service mesh and wanted to restore their snapshot after upgrading to 1.13.0. This commit handles existing data with older ServiceName and converts it during restore so that there are no issues when restoring from older snapshots.	2022-08-11 14:47:10 -04:00
Chris S. Kim	55945a8231	Add test to verify forwarding	2022-08-11 11:16:02 -04:00
Chris S. Kim	fbbb54fdc2	Register peerStreamServer internally to enable RPC forwarding	2022-08-11 11:16:02 -04:00
Chris S. Kim	534096a6ac	Handle wrapped errors in isFailedPreconditionErr	2022-08-11 11:16:02 -04:00
Daniel Kimsey	4243e1e05f	Add support for filtering the 'List Services' API 1. Create a bexpr filter for performing the filtering 2. Change the state store functions to return the raw (not aggregated) list of ServiceNodes. 3. Move the aggregate service tags by name logic out of the state store functions into a new function called from the RPC endpoint 4. Perform the filtering in the endpoint before aggregation.	2022-08-10 16:52:32 -05:00
cskh	647f9787f8	fix: shadowed err in retryJoin() (#14112 ) - err value will be used later to surface the error message if r.join() returns any err.	2022-08-10 10:53:57 -04:00
skpratt	070ed3738d	Merge pull request #14056 from hashicorp/proxy-register-port-race Refactor sidecar_service method to separate port assignment	2022-08-10 09:46:29 -05:00
skpratt	7f1f095b2f	Merge branch 'main' into proxy-register-port-race	2022-08-10 08:40:45 -05:00
Chris S. Kim	79d00f59cd	Close active listeners on error If startListeners successfully created listeners for some of its input addresses but eventually failed, the function would return an error and existing listeners would not be cleaned up.	2022-08-09 12:22:39 -04:00
Chris S. Kim	4de96a1f3c	Add retry in TestAgentConnectCALeafCert_good	2022-08-09 11:20:37 -04:00
Kyle Havlovitz	57afbb58ac	Merge pull request #13958 from hashicorp/gateway-wildcard-fix Fix wildcard picking up services it shouldn't for ingress/terminating gateways	2022-08-08 12:54:40 -07:00
Kyle Havlovitz	2a0ab31ca4	Add some extra handling for destination deletes	2022-08-08 11:38:13 -07:00
freddygv	1e48b4f665	Update snapshot test	2022-08-08 09:17:15 -06:00
freddygv	65bcd3d84f	Re-validate existing secrets at state store Previously establishment and pending secrets were only checked at the RPC layer. However, given that these are Check-and-Set transactions we should ensure that the given secrets are still valid when persisting a secret exchange or promotion. Otherwise it would be possible for concurrent requests to overwrite each other.	2022-08-08 09:06:07 -06:00
freddygv	67aa7ed15c	Test fixes	2022-08-08 08:31:47 -06:00
freddygv	01b0cbcbd7	Use proto message for each secrets write op Previously there was a field indicating the operation that triggered a secrets write. Now there is a message for each operation and it contains the secret ID being persisted.	2022-08-08 01:41:00 -06:00
Kyle Havlovitz	3f435f31ac	Update ingress/terminating wildcard logic and handle destinations	2022-08-05 07:56:10 -07:00
freddygv	3a623f2e9d	Inherit active secret when exchanging	2022-08-03 17:32:53 -05:00
freddygv	b089472a12	Pass explicit signal with op for secrets write Previously the updates to the peering secrets UUID table relied on inferring what action triggered the update based on a reconciliation against the existing secrets. Instead we now explicitly require the operation to be given so that the inference isn't necessary. This makes the UUID table logic easier to reason about and fixes some related bugs. There is also an update so that the peering secrets get handled on snapshots/restores.	2022-08-03 17:25:12 -05:00
freddygv	544b3603e9	Avoid deleting peering secret UUIDs at dialers Dialers do not keep track of peering secret UUIDs, so they should not attempt to clean up data from that table when their peering is deleted. We also now keep peer server addresses when marking peerings for deletion. Peer server addresses are used by the ShouldDial() helper when determining whether the peering is for a dialer or an acceptor. We need to keep this data so that peering secrets can be cleaned up accordingly.	2022-08-03 16:34:57 -05:00
skpratt	1ded7a7632	Merge pull request #13906 from skpratt/validate-port-agent-split Separate port and socket path validation for local agent	2022-08-02 16:58:41 -05:00
Dhia Ayachi	c1ca9afdf2	add token to the request when creating a cacheIntentions query (#14005 )	2022-08-02 14:27:34 -04:00
Kyle Havlovitz	fce49a1ec0	Fix wildcard picking up services it shouldn't for ingress/terminating gateways	2022-08-02 09:41:31 -07:00
Daniel Upton	8da6710958	proxycfg-sources: fix hot loop when service not found in catalog Fixes a bug where a service getting deleted from the catalog would cause the ConfigSource to spin in a hot loop attempting to look up the service. This is because we were returning a nil WatchSet which would always unblock the select. Kudos to @freddygv for discovering this!	2022-08-02 15:42:29 +01:00
Freddy	56144cf5f7	Various peering fixes (#13979 ) * Avoid logging StreamSecretID * Wrap additional errors in stream handler * Fix flakiness in leader test and rename servers for clarity. There was a race condition where the peering was being deleted in the test before the stream was active. Now the test waits for the stream to be connected on both sides before deleting the associated peering. * Run flaky test serially	2022-08-01 15:06:18 -06:00
DanStough	e46a4b3cc1	fix: ipv4 destination dns resolution	2022-08-01 16:45:57 -04:00
Luke Kysow	e9960dfdf3	peering: default to false (#13963 ) * defaulting to false because peering will be released as beta * Ignore peering disabled error in bundles cachetype Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com> Co-authored-by: freddygv <freddy@hashicorp.com> Co-authored-by: Matt Keeler <mjkeeler7@gmail.com>	2022-08-01 15:22:36 -04:00
Freddy	a54903b0f4	Merge branch 'main' into fix-kv_entries-metric	2022-08-01 13:19:27 -06:00
Freddy	593add2ec0	Merge pull request #13499 from maxb/delete-unused-metric Delete definition of metric `consul.acl.blocked.node.deregistration`	2022-08-01 12:31:05 -06:00
Dhia Ayachi	cf7e175eab	Tgtwy egress HTTP support (#13953 ) * add golden files * add support to http in tgateway egress destination * fix slice sorting to include both address and port when using server_names * fix listener loop for http destination * fix routes to generate a route per port and a virtualhost per port-address combination * sort virtual hosts list to have a stable order * extract redundant serviceNode	2022-08-01 14:12:43 -04:00
Matt Keeler	795e5830c6	Implement/Utilize secrets for Peering Replication Stream (#13977 )	2022-08-01 10:33:18 -04:00
alex	0f6354685b	block PeerName register requests (#13887 ) Signed-off-by: acpana <8968914+acpana@users.noreply.github.com>	2022-07-29 14:36:22 -07:00
Luke Kysow	17594a123e	peering: retry establishing connection more quickly on certain errors (#13938 ) When we receive a FailedPrecondition error, retry that more quickly because we expect it will resolve shortly. This is particularly important in the context of Consul servers behind a load balancer because when establishing a connection we have to retry until we randomly land on a leader node. The default retry backoff goes from 2s, 4s, 8s, etc. which can result in very long delays quite quickly. Instead, this backoff retries in 8ms five times, then goes exponentially from there: 16ms, 32ms, ... up to a max of 8152ms.	2022-07-29 13:04:32 -07:00
Sarah Pratt	11c7a465b7	Separate port and socket path requirement in case of local agent assignment	2022-07-29 13:28:21 -05:00
alex	74d79cc7e6	Merge pull request #13952 from hashicorp/sync-more-acl sync more acl enforcement	2022-07-28 12:31:02 -07:00
Dhia Ayachi	09340a846c	inject gateway addons to destination clusters (#13951 )	2022-07-28 15:17:35 -04:00
acpana	70e052f35f	sync more acl enforcement sync w ent at 32756f7 Signed-off-by: acpana <8968914+acpana@users.noreply.github.com>	2022-07-28 12:01:52 -07:00
alex	08b94640bc	Merge pull request #13929 from hashicorp/fix-validation [sync] fix empty partitions matching	2022-07-28 10:14:49 -07:00
Sarah Pratt	f01a4f91dc	refactor sidecare_service method into parts	2022-07-28 09:07:13 -05:00
Ashwin Venkatesh	e4aaf467eb	Add peer counts to emitted metrics. (#13930 )	2022-07-27 18:34:04 -04:00
Luke Kysow	a2290791b2	Merge pull request #13924 from hashicorp/lkysow/util-metric-peering peering: don't track imported services/nodes in usage	2022-07-27 14:49:55 -07:00
acpana	778c796ec9	use EqualPartitions Signed-off-by: acpana <8968914+acpana@users.noreply.github.com>	2022-07-27 14:48:30 -07:00
acpana	8042b3aeed	better fix Signed-off-by: acpana <8968914+acpana@users.noreply.github.com>	2022-07-27 14:28:08 -07:00
acpana	b03467e3bd	sync w ent Signed-off-by: acpana <8968914+acpana@users.noreply.github.com>	2022-07-27 11:41:39 -07:00

1 2 3 4 5 ...

4611 Commits