open-consul

Commit Graph

Author	SHA1	Message	Date
Chris S. Kim	58ffa0488d	Revert getPathSuffixUnescaped (#13256 )	2022-06-01 13:17:14 -04:00
Dan Upton	e6dc26e087	proxycfg: replace direct agent cache usage with interfaces (#13320 ) This is the OSS portion of enterprise PRs 1904, 1905, 1906, 1907, 1949, and 1971. It replaces the proxycfg manager's direct dependency on the agent cache with interfaces that will be implemented differently when serving xDS sessions from a Consul server.	2022-06-01 16:18:06 +01:00
Chris S. Kim	44a318ef73	Reimplement fs.FileInfo interface (#13315 ) Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2022-06-01 11:09:51 -04:00
Dhia Ayachi	d4a04457e1	update gateway-services table with endpoints (#13217 ) * update gateway-services table with endpoints * fix failing test * remove unneeded config in test * rename "endpoint" to "destination" * more endpoint renaming to destination in tests * update isDestination based on service-defaults config entry creation * use a 3 state kind to be able to set the kind to unknown (when neither a service or a destination exist) * set unknown state to empty to avoid modifying alot of tests * fix logic to set the kind correctly on CRUD * fix failing tests * add missing tests and fix service delete * fix failing test * Apply suggestions from code review Co-authored-by: Dan Stough <dan.stough@hashicorp.com> * fix a bug with kind and add relevant test * fix compile error * fix failing tests * add kind to clone * fix failing tests * fix failing tests in catalog endpoint * fix service dump test * Apply suggestions from code review Co-authored-by: Dan Stough <dan.stough@hashicorp.com> * remove duplicate tests * rename consts and fix kind when no destination is defined in the service-defaults. * rename Kind to ServiceKind and change switch to use .(type) Co-authored-by: Dan Stough <dan.stough@hashicorp.com>	2022-05-31 16:20:12 -04:00
Chris S. Kim	ea1e4aa52d	Update repo to use go:embed (#10996 ) Replace bindata packages with stdlib go:embed. Modernize some uiserver code with newer interfaces introduced in go 1.16 (mainly working with fs.File instead of http.File. Remove steps that are no longer used from our build files. Add Github Action to detect differences in agent/uiserver/dist and verify that the files are correct (by compiling UI assets and comparing contents).	2022-05-31 15:33:56 -04:00
Riddhi Shah	d558914a0f	[OSS] Fix merge central config tests (#13309 ) Setting the right enterprise meta to fix the merge central config tests. Re-added the tests that were failing on the OSS to ENT merge.	2022-05-31 12:04:19 -07:00
freddygv	14bff4fba6	Use embedded SpiffeID for peered upstreams	2022-05-31 09:55:37 -06:00
freddygv	4d3e09e8f8	Remove intermediate representation of SPIFFE IDs xDS only ever uses the string representation, so we can avoid passing around connect.SpiffeIDService objects around.	2022-05-31 09:55:37 -06:00
freddygv	5cd5108075	Return SPIFFE ID for connect proxies in PeerMeta Proxies dialing exporting services need to know the SPIFFE ID of services dialed so that the upstream's SANs can be validated. This commit attaches the SPIFFE ID to all connect proxies exported over the peering stream so that they are available to importing clusters. The data in the SPIFFE ID cannot be re-constructed in peer clusters because the partition of exported services is overwritten on imports.	2022-05-31 09:55:37 -06:00
Freddy	a75af9d94a	[OSS] Add grpc endpoint to fetch a specific trust bundle (#13292 ) Co-authored-by: R.B. Boyer <rb@hashicorp.com>	2022-05-31 09:54:40 -06:00
Matt Keeler	b9e8b5c692	Fix a flaky test (#13282 ) At the end of this test we were trying to ensure that updating a service in the local state causes it to re-register the service with the config manager. The config manager in the same method will also call RegisteredProxies to determine if any need to be removed. This portion of the test is not attempting to verify that behavior. Because the test is only blocked waiting for the Register event before it can end and assert all the mock expectations were met, we may not see the call to RegisteredProxies. This is especially apparent when tests are run with the race detector. As we don’t actually care if that method is executed before the end of the test we can simply transition from expecting it to be called exactly once to a 0 or 1 times assertion.	2022-05-27 13:25:08 -04:00
Dan Upton	a6a6d5a8ee	Enable servers to configure arbitrary proxies from the catalog (#13244 ) OSS port of enterprise PR 1822 Includes the necessary changes to the `proxycfg` and `xds` packages to enable Consul servers to configure arbitrary proxies using catalog data. Broadly, `proxycfg.Manager` now has public methods for registering, deregistering, and listing registered proxies — the existing local agent state-sync behavior has been moved into a separate component that makes use of these methods. When an xDS session is started for a proxy service in the catalog, a goroutine will be spawned to watch the service in the server's state store and re-register it with the `proxycfg.Manager` whenever it is updated (and clean it up when the client goes away).	2022-05-27 12:38:52 +01:00
alex	2d8664d384	monitor leadership in peering service (#13257 ) Signed-off-by: acpana <8968914+acpana@users.noreply.github.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2022-05-26 17:55:16 -07:00
Riddhi Shah	8714ade534	Termporarily disable validation of merge central config response (#13266 ) Temporarily disabling the validation of merge central config response since it is breaking OSS to ENT merging. A follow up PR will patch the fixes.	2022-05-26 13:49:40 -07:00
Chris S. Kim	d73a9522cb	Add support for streaming CA roots to peers (#13260 ) Sender watches for changes to CA roots and sends them through the replication stream. Receiver saves CA roots to tablePeeringTrustBundle	2022-05-26 15:24:09 -04:00
Riddhi Shah	6f57acc1bf	Remove tests failing on ent (#13255 ) Will follow up with the fixed version of these tests that passes in ent.	2022-05-26 10:17:59 -07:00
John Cowen	bf5f1482fd	Export top-level HCP Enabled go-template variable for UI (#13165 ) * Update ui template data to export HCPEnabled at the top level	2022-05-26 17:23:56 +01:00
DanStough	65ca7e0bfb	fix: multiple grpc/http2 services for ingress listeners	2022-05-26 10:43:58 -04:00
Riddhi Shah	e5f1d8dce4	Add support for merge-central-config query param (#13001 ) Adds a new query param merge-central-config for use with the below endpoints: /catalog/service/:service /catalog/connect/:service /health/service/:service /health/connect/:service If set on the request, the response will include a fully resolved service definition which is merged with the proxy-defaults/global and service-defaults/:service config entries (on-demand style). This is useful to view the full service definition for a mesh service (connect-proxy kind or gateway kind) which might not be merged before being written into the catalog (example: in case of services in the agentless model).	2022-05-25 13:20:17 -07:00
R.B. Boyer	4f9a9bb851	remove a source of test panics (#13227 )	2022-05-25 14:33:00 -05:00
R.B. Boyer	dae47101fa	api: ensure peering API endpoints do not use protobufs (#13204 ) I noticed that the JSON api endpoints for peerings json encodes protobufs directly, rather than converting them into their `api` package equivalents before marshal/unmarshaling them. I updated this and used `mog` to do the annoying part in the middle. Other changes: - the status enum was converted into the friendlier string form of the enum for readability with tools like `curl` - some of the `api` library functions were slightly modified to match other similar endpoints in UX (cc: @ndhanushkodi ) - peeringRead returns `nil` if not found - partitions are NOT inferred from the agent's partition (matching 1.11-style logic)	2022-05-25 13:43:35 -05:00
R.B. Boyer	bc10055edc	peering: replicate expected SNI, SPIFFE, and service protocol to peers (#13218 ) The importing peer will need to know what SNI and SPIFFE name corresponds to each exported service. Additionally it will need to know at a high level the protocol in use (L4/L7) to generate the appropriate connection pool and local metrics. For replicated connect synthetic entities we edit the `Connect{}` part of a `NodeService` to have a new section: { "PeerMeta": { "SNI": [ "web.default.default.owt.external.183150d5-1033-3672-c426-c29205a576b8.consul" ], "SpiffeID": [ "spiffe://183150d5-1033-3672-c426-c29205a576b8.consul/ns/default/dc/dc1/svc/web" ], "Protocol": "tcp" } } This data is then replicated and saved as-is at the importing side. Both SNI and SpiffeID are slices for now until I can be sure we don't need them for how mesh gateways will ultimately work.	2022-05-25 12:37:44 -05:00
R.B. Boyer	69191fc0da	peering: disable requirement for mesh gateways initially (#13213 )	2022-05-25 10:13:23 -05:00
Kyle Havlovitz	cebf7b23f6	Merge pull request #13143 from hashicorp/envoy-connection-limit Add connection limit setting to service defaults	2022-05-25 07:48:50 -07:00
Kyle Havlovitz	f5f949d486	Fix proto lint errors after version bump	2022-05-24 18:44:54 -07:00
Kyle Havlovitz	749591ec98	Specify go_package explicitly	2022-05-24 10:22:53 -07:00
cskh	b7eec4c05b	fix: non-leader agents return 404 on Get Intention exact api (#13179 ) * fix: non-leader agents return 404 on Get Intention exact api - rpc call method appends extra error message, so change == to "Strings.Contains" Co-authored-by: Chris S. Kim <ckim@hashicorp.com>	2022-05-24 13:21:15 -04:00
Kyle Havlovitz	03dea180ad	Add connection limit setting to service defaults	2022-05-24 10:13:38 -07:00
DanStough	2c8ca25d8a	chore(test): Update bats version	2022-05-24 11:56:08 -04:00
DanStough	df59d8ab0d	feat: add endpoint struct to ServiceConfigEntry	2022-05-24 11:56:08 -04:00
alex	451dc50f4f	peering: expose IsLeader, hung up on dialer if follower (#13164 ) Signed-off-by: acpana <8968914+acpana@users.noreply.github.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2022-05-23 11:30:58 -07:00
Matt Keeler	1fd02a13c2	Migrate from `protoc` to `buf` (#12841 ) * Install `buf` instead of `protoc` * Created `buf.yaml` and `buf.gen.yaml` files in the two proto directories to control how `buf` generates/lints proto code. * Invoke `buf` instead of `protoc` * Added a `proto-format` make target. * Committed the reformatted proto files. * Added a `proto-lint` make target. * Integrated proto linting with CI * Fixed tons of proto linter warnings. * Got rid of deprecated builtin protoc-gen-go grpc plugin usage. Moved to direct usage of protoc-gen-go-grpc. * Unified all proto directories / go packages around using pb prefixes but ensuring all proto packages do not have the prefix.	2022-05-23 10:37:52 -04:00
cskh	39cb731988	Upgrade golangci-lint for go v1.18 (#13176 )	2022-05-23 10:26:45 -04:00
R.B. Boyer	3b12a5179f	test: fix flaky test TestEventBufferFuzz (#13175 )	2022-05-23 09:22:30 -05:00
Matt Keeler	c629e89289	Fix tests broken in #13173 (#13178 ) I changed the error type returned in a situation but didn’t update the tests to expect that error.	2022-05-23 10:00:06 -04:00
Matt Keeler	8a968299dd	Fix flaky tests in the agent/grpc/public/services/serverdiscovery package (#13173 ) Occasionally we had seen the TestWatchServers_ACLToken_PermissionDenied be flagged as flaky in circleci. This change should fix that. Why it fixes it is complicated. The test was failing with a panic when a mocked ACL Resolver was being called more times than expected. I struggled for a while to determine how that could be. This test should call authorize once and only once and the error returned should cause the stream to be terminated and the error returned to the gRPC client. Another oddity was no amount of running this test locally seemed to be able to reproduce the issue. I ran the test hundreds of thousands of time and it always passed. It turns out that there is nothing wrong with the test. It just so happens that the panic from unexpected invocation of a mocked call happened during the test but was caused by a previous test (specifically the TestWatchServers_StreamLifecycle test) The stream from the previous test remained open after all the test Cleanup functions were run and it just so happened that when the EventPublisher eventually picked up that the context was cancelled during cleanup, it force closes all subscriptions which causes some loops to be re-entered and the streams to be reauthorized. Its that looping in response to forced subscription closures that causes the mock to eventually panic. All the components, publisher, server, client all operate based on contexts. We cancel all those contexts but there is no syncrhonous way to know when they are stopped. We could have implemented a syncrhonous stop but in the context of an actual running Consul, context cancellation + async stopping is perfectly fine. What we (Dan and I) eventually thought was that the behavior of grpc streams such as this when a server was shutting down wasn’t super helpful. What we would want is for a client to be able to distinguish between subscription closed because something may have changed requiring re-authentication and subscription closed because the server is shutting down. That way we can send back appropriate error messages to detail that the server is shutting down and not confuse users with potentially needing to resubscribe. So thats what this PR does. We have introduced a shutting down state to our event subscriptions and the various streaming gRPC services that rely on the event publisher will all just behave correctly and actually stop the stream (not attempt transparent reauthorization) if this particular error is the one we get from the stream. Additionally the error that gets transmitted back through gRPC when this does occur indicates to the consumer that the server is going away. That is more helpful so that a client can then attempt to reconnect to another server.	2022-05-23 08:59:13 -04:00
R.B. Boyer	69d3e729a4	agent: allow for service discovery queries involving peer name to use streaming (#13168 )	2022-05-20 15:27:01 -05:00
Dan Upton	30775ed54d	proxycfg: remove dependency on `cache.UpdateEvent` (#13144 ) OSS portion of enterprise PR 1857. This removes (most) references to the `cache.UpdateEvent` type in the `proxycfg` package. As we're going to be direct usage of the agent cache with interfaces that can be satisfied by alternative server-local datasources, it doesn't make sense to depend on this type everywhere anymore (particularly on the `state.ch` channel). We also plan to extract `proxycfg` out of Consul into a shared library in the future, which would require removing this dependency. Aside from a fairly rote find-and-replace, the main change is that the `cache.Cache` and `health.Client` types now accept a callback function parameter, rather than a `chan<- cache.UpdateEvents`. This allows us to do the type conversion without running another goroutine.	2022-05-20 15:47:40 +01:00
R.B. Boyer	63a9175bd6	peering: accept replication stream of discovery chain information at the importing side (#13151 )	2022-05-19 16:37:52 -05:00
R.B. Boyer	68789effeb	test: TestServer_RPC_MetricsIntercept should use a concurrency-safe metrics store (#13157 )	2022-05-19 15:39:28 -05:00
cskh	df27fa0c84	Retry on bad dogstatsd connection (#13091 ) - Introduce a new telemetry configurable parameter retry_failed_connection. User can set the value to true to let consul agent continue its start process on failed connection to datadog server. When set to false, agent will stop on failed start. The default behavior is true. Co-authored-by: Dan Upton <daniel@floppy.co> Co-authored-by: Evan Culver <eculver@users.noreply.github.com>	2022-05-19 16:03:46 -04:00
R.B. Boyer	91691eca87	peering: replicate discovery chains information to importing peers Treat each exported service as a "discovery chain" and replicate one synthetic CheckServiceNode for each chain and remote mesh gateway. The health will be a flattened generated check of the checks for that mesh gateway node.	2022-05-19 14:21:44 -05:00
R.B. Boyer	bf05e8c1f1	prefactor some functions out of the monolithic file	2022-05-19 14:21:29 -05:00
R.B. Boyer	09861a2792	test: fix incorrect use of t instead of r in retry test (#13146 )	2022-05-19 14:00:07 -05:00
Dan Upton	7492357b43	config: prevent top-level `verify_incoming` enabling mTLS on gRPC port (#13118 ) Fixes #13088 This is a backwards-compatibility bug introduced in 1.12.	2022-05-18 16:15:57 +01:00
Freddy	6c868b6c0e	Patches to peering initiation for POC demo (#13076 ) Co-authored-by: R.B. Boyer <rb@hashicorp.com>	2022-05-13 13:01:00 -06:00
Dhia Ayachi	70b93ea693	When a host header is defined override `req.Host` in the metrics ui (#13071 ) * When a host header is defined override the req.Host in the metrics ui endpoint. * add changelog	2022-05-13 14:05:22 -04:00
Freddy	160acdf876	Actually block when syncing subscriptions (#13066 ) By changing to use WatchCtx we will actually block for changes to the peering list. WatchCh creates a goroutine to collect errors from WatchCtx and returns immediately. The existing behavior wouldn't result in a tight loop because of the rate limiting in the surrounding function, but it would still lead to more work than is necessary.	2022-05-12 17:36:14 -06:00
Evan Culver	535e811020	peering: add TrustBundleListByService endpoint (#13048 )	2022-05-12 15:58:22 -07:00
Freddy	8894365c5a	[OSS] Add upsert handling for receiving CheckServiceNode (#13061 )	2022-05-12 15:04:44 -06:00

1 2 3 4 5 ...

4314 Commits