open-consul

Commit Graph

Author	SHA1	Message	Date
Chris S. Kim	aea00f10ae	Merge pull request #12442 from danieleva/12422-keyring Allows keyring operations on client agents	2022-02-25 16:28:56 -05:00
Evan Culver	7889071385	connect: Update supported Envoy versions to include 1.19.3 and 1.18.6	2022-02-24 16:59:33 -08:00
Evan Culver	9f4d9f3f74	connect: Upgrade Envoy 1.20 to 1.20.2 (#12443 )	2022-02-24 16:19:39 -08:00
R.B. Boyer	4b0f657b31	fix flaky test panic (#12446 )	2022-02-24 17:35:46 -06:00
R.B. Boyer	a97d20cf63	catalog: compare node names case insensitively in more places (#12444 ) Many places in consul already treated node names case insensitively. The state store indexes already do it, but there are a few places that did a direct byte comparison which have now been corrected. One place of particular consideration is ensureCheckIfNodeMatches which is executed during snapshot restore (among other places). If a node check used a slightly different casing than the casing of the node during register then the snapshot restore here would deterministically fail. This has been fixed. Primary approach: git grep -i "node.[!=]=.node" -- ':!_test.go' ':!docs' git grep -i '\[[^]]member[^]]\] git grep -i '\[[^]]$member\\|name\\|node$[^]]\]' -- ':!_test.go' ':!website' ':!ui' ':!agent/proxycfg/testing.go:' ':!*.md'	2022-02-24 16:54:47 -06:00
Daniele Vazzola	397b5ed957	Allows keyring operations on client agents	2022-02-24 17:24:57 +00:00
R.B. Boyer	d860384731	server: partly fix config entry replication issue that prevents replication in some circumstances (#12307 ) There are some cross-config-entry relationships that are enforced during "graph validation" at persistence time that are required to be maintained. This means that config entries may form a digraph at times. Config entry replication procedes in a particular sorted order by kind and name. Occasionally there are some fixups to these digraphs that end up replicating in the wrong order and replicating the leaves (ingress-gateway) before the roots (service-defaults) leading to replication halting due to a graph validation error related to things like mismatched service protocol requirements. This PR changes replication to give each computed change (upsert/delete) a fair shot at being applied before deciding to terminate that round of replication in error. In the case where we've simply tried to do the operations in the wrong order at least ONE of the outstanding requests will complete in the right order, leading the subsequent round to have fewer operations to do, with a smaller likelihood of graph validation errors. This does not address all scenarios, but for scenarios where the edits are being applied in the wrong order this should avoid replication halting. Fixes #9319 The scenario that is NOT ADDRESSED by this PR is as follows: 1. create: service-defaults: name=new-web, protocol=http 2. create: service-defaults: name=old-web, protocol=http 3. create: service-resolver: name=old-web, redirect-to=new-web 4. delete: service-resolver: name=old-web 5. update: service-defaults: name=old-web, protocol=grpc 6. update: service-defaults: name=new-web, protocol=grpc 7. create: service-resolver: name=old-web, redirect-to=new-web If you shutdown dc2 just before (4) and turn it back on after (7) replication is impossible as there is no single edit you can make to make forward progress.	2022-02-23 17:27:48 -06:00
Chris S. Kim	4b528edbe6	Merge pull request #12430 from hashicorp/ci/main-assetfs-build auto-updated agent/uiserver/bindata_assetfs.go from commit 73b6687c5	2022-02-23 18:19:30 -05:00
Daniel Nephin	3639f4b551	Merge pull request #11910 from hashicorp/dnephin/ca-provider-interface-for-ica-in-primary ca: add support for an external trusted CA	2022-02-22 13:14:52 -05:00
R.B. Boyer	11fdc70b34	configentry: make a new package to hold shared config entry structs that aren't used for RPC or the FSM (#12384 ) First two candidates are ConfigEntryKindName and DiscoveryChainConfigEntries.	2022-02-22 10:36:36 -06:00
Dhia Ayachi	378f688a6a	file watcher to be used for configuration auto-reload feature (#12301 ) * add config watcher to the config package * add logging to watcher * add test and refactor to add WatcherEvent. * add all API calls and fix a bug with recreated files * add tests for watcher * remove the unnecessary use of context * Add debug log and a test for file rename * use inode to detect if the file is recreated/replaced and only listen to create events. * tidy ups (#1535) * tidy ups * Add tests for inode reconcile * fix linux vs windows syscall * fix linux vs windows syscall * fix windows compile error * increase timeout * use ctime ID * remove remove/creation test as it's a use case that fail in linux * fix linux/windows to use Ino/CreationTime * fix the watcher to only overwrite current file id * fix linter error * fix remove/create test * set reconcile loop to 200 Milliseconds * fix watcher to not trigger event on remove, add more tests * on a remove event try to add the file back to the watcher and trigger the handler if success * fix race condition * fix flaky test * fix race conditions * set level to info * fix when file is removed and get an event for it after * fix to trigger handler when we get a remove but re-add fail * fix error message * add tests for directory watch and fixes * detect if a file is a symlink and return an error on Add * rename Watcher to FileWatcher and remove symlink deref * add fsnotify@v1.5.1 * fix go mod * fix flaky test * Apply suggestions from code review Co-authored-by: Ashwin Venkatesh <ashwin@hashicorp.com> * fix a possible stack overflow * do not reset timer on errors, rename OS specific files * start the watcher when creating it * fix data race in tests * rename New func * do not call handler when a remove event happen * events trigger on write and rename * fix watcher tests * make handler async * remove recursive call * do not produce events for sub directories * trim "/" at the end of a directory when adding * add missing test * fix logging * add todo * fix failing test * fix flaking tests * fix flaky test * add logs * fix log text * increase timeout * reconcile when remove * check reconcile when removed * fix reconcile move test * fix logging * delete invalid file * Apply suggestions from code review Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * fix review comments * fix is watched to properly catch a remove * change test timeout * fix test and rename id * fix test to create files with different mod time. * fix deadlock when stopping watcher * Apply suggestions from code review Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * fix a deadlock when calling stop while emitting event is blocked * make sure to close the event channel after the event loop is done * add go doc * back date file instead of sleeping * Apply suggestions from code review Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * check error Co-authored-by: Ashwin Venkatesh <ashwin@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2022-02-21 11:36:52 -05:00
hc-github-team-consul-core	ef5b6c8415	auto-updated agent/uiserver/bindata_assetfs.go from commit 73b6687c5	2022-02-21 12:27:52 +00:00
Evan Culver	067223337d	checks: populate interval and timeout when registering services (#11138 )	2022-02-18 12:05:33 -08:00
Kyle Havlovitz	9c03b5dc3d	Merge pull request #12385 from hashicorp/tproxy-http-upstream-fix xds: respect chain protocol on default discovery chain	2022-02-18 10:08:59 -08:00
Daniel Nephin	cb1a80184f	rpc: set response to nil when not found Otherwise when the query times out we might incorrectly send a value for the reply, when we should send an empty reply. Also document errNotFound and how to handle the result in that case.	2022-02-18 12:26:06 -05:00
Daniel Nephin	79820738cc	ca: test that original certs from secondary still verify There's a chance this could flake if the secondary hasn't received the update yet, but running this test many times doesn't show any flakes yet.	2022-02-17 18:45:16 -05:00
Daniel Nephin	ca4e60e09b	Update TODOs to reference an issue with more details And remove a no longer needed TODO	2022-02-17 18:21:30 -05:00
Daniel Nephin	0abaf29c10	ca: add test cases for rotating external trusted CA	2022-02-17 18:21:30 -05:00
Daniel Nephin	aacc40012f	ca: add a test for secondary with external CA	2022-02-17 18:21:30 -05:00
Daniel Nephin	471b2098bb	ca: examine the full chain in newCARoot make TestNewCARoot much more strict compare the full result instead of only a few fields. add a test case with 2 and 3 certificates in the pem	2022-02-17 18:21:30 -05:00
Daniel Nephin	fc6c0ec139	ca: small docs improvements	2022-02-17 18:21:30 -05:00
Daniel Nephin	af651eaaad	ca: cleanup validateSetIntermediate	2022-02-17 18:21:30 -05:00
Daniel Nephin	ef03f7be73	ca: only return the leaf cert from Sign in vault provider The interface is documented as 'Sign will only return the leaf', and the other providers only return the leaf. It seems like this was added during the initial implementation, so is likely just something we missed. It doesn't break anything , but it does cause confusing cert chains in the API response which could break something in the future.	2022-02-17 18:21:30 -05:00
Daniel Nephin	2d5254a73b	Merge pull request #12110 from hashicorp/dnephin/blocking-queries-not-found rpc: make blocking queries for non-existent items more efficient	2022-02-17 18:09:39 -05:00
Ashwin Venkatesh	39be071264	Parse datacenter from request (#12370 ) * Parse datacenter from request - Parse the value of the datacenter from the create/delete requests for AuthMethods and BindingRules so that they can be created in and deleted from the datacenters specified in the request.	2022-02-17 16:41:27 -05:00
Kyle Havlovitz	58172c260b	xds: respect chain protocol on default discovery chain	2022-02-17 11:47:20 -08:00
Florian Apolloner	895da50986	Support for connect native services in topology view. (#12098 )	2022-02-16 16:51:54 -05:00
Chris S. Kim	18096fd2fb	Move IndexEntryName helpers to common files (#12365 )	2022-02-16 12:56:38 -05:00
Daniel Nephin	06657e5be0	rpc: add errNotFound to all Get queries Any query that returns a list of items is not part of this commit.	2022-02-15 18:24:34 -05:00
Daniel Nephin	bdafa24c50	Make blockingQuery efficient with 'not found' results. By using the query results as state. Blocking queries are efficient when the query matches some results, because the ModifyIndex of those results, returned as queryMeta.Mindex, will never change unless the items themselves change. Blocking queries for non-existent items are not efficient because the queryMeta.Index can (and often does) change when other entities are written. This commit reduces the churn of these queries by using a different comparison for "has changed". Instead of using the modified index, we use the existence of the results. If the previous result was "not found" and the new result is still "not found", we know we can ignore the modified index and continue to block. This is done by setting the minQueryIndex to the returned queryMeta.Index, which prevents the query from returning before a state change is observed.	2022-02-15 18:24:33 -05:00
Daniel Nephin	6e73df7dc2	Add a test for blocking query on non-existent entry This test shows how blocking queries are not efficient when the query returns no results. The test fails with 100+ calls instead of the expected 2. This test is still a bit flaky because it depends on the timing of the writes. It can sometimes return 3 calls. A future commit should fix this and make blocking queries even more optimal for not-found results.	2022-02-15 18:23:17 -05:00
Daniel Nephin	a4e1c59cd8	rpc: improve docs for blockingQuery Follow the Go convention of accepting a small interface that documents the methods used by the function. Clarify the rules for implementing a query function passed to blockingQuery.	2022-02-15 14:20:14 -05:00
R.B. Boyer	b216d52b66	server: conditionally avoid writing a config entry to raft if it was already the same (#12321 ) This will both save on unnecessary raft operations as well as unnecessarily incrementing the raft modify index of config entries subject to no-op updates.	2022-02-14 14:39:12 -06:00
FFMMM	1f8fb17be7	Vendor in rpc mono repo for net/rpc fork, go-msgpack, msgpackrpc. (#12311 ) This commit syncs ENT changes to the OSS repo. Original commit details in ENT: ``` commit 569d25f7f4578981c3801e6e067295668210f748 Author: FFMMM <FFMMM@users.noreply.github.com> Date: Thu Feb 10 10:23:33 2022 -0800 Vendor fork net rpc (#1538) * replace net/rpc w consul-net-rpc/net/rpc Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * replace msgpackrpc and go-msgpack with fork from mono repo Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * gofmt all files touched Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> ``` Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2022-02-14 09:45:45 -08:00
R.B. Boyer	d54a3e6aa1	missed this test adjustment (#12331 )	2022-02-14 11:39:00 -06:00
R.B. Boyer	0b80f70a39	local: fixes a data race in anti-entropy sync (#12324 ) The race detector noticed this initially in `TestAgentConfigWatcherSidecarProxy` but it is not restricted to just tests. The two main changes here were: - ensure that before we mutate the internal `agent/local` representation of a Service (for tags or VIPs) we clone those fields - ensure that there's no function argument joint ownership between the caller of a function and the local state when calling `AddService`, `AddCheck`, and related using `copystructure` for now.	2022-02-14 10:41:33 -06:00
Dao Thanh Tung	0519a9240e	URL-encode/decode resource names for HTTP API part 5 (#12297 )	2022-02-14 10:47:06 -05:00
Mark Anderson	fa95afdcf6	Refactor to make ACL errors more structured. (#12308 ) * First phase of refactoring PermissionDeniedError Add extended type PermissionDeniedByACLError that captures information about the accessor, particular permission type and the object and name of the thing being checked. It may be worth folding the test and error return into a single helper function, that can happen at a later date. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2022-02-11 12:53:23 -08:00
Freddy	f45bec7779	Merge pull request #12223 from hashicorp/proxycfg/passthrough-cleanup	2022-02-10 17:35:51 -07:00
freddygv	8eaca35df1	Account for upstream targets in another DC. Transparent proxies typically cannot dial upstreams in remote datacenters. However, if their upstream configures a redirect to a remote DC then the upstream targets will be in another datacenter. In that sort of case we should use the WAN address for the passthrough.	2022-02-10 17:01:57 -07:00
freddygv	7fba7456ec	Fix race of upstreams with same passthrough ip Due to timing, a transparent proxy could have two upstreams to dial directly with the same address. For example: - The orders service can dial upstreams shipping and payment directly. - An instance of shipping at address 10.0.0.1 is deregistered. - Payments is scaled up and scheduled to have address 10.0.0.1. - The orders service receives the event for the new payments instance before seeing the deregistration for the shipping instance. At this point two upstreams have the same passthrough address and Envoy will reject the listener configuration. To disambiguate this commit considers the Raft index when storing passthrough addresses. In the example above, 10.0.0.1 would only be associated with the newer payments service instance.	2022-02-10 17:01:57 -07:00
freddygv	d5a2eb677f	Ensure passthrough addresses get cleaned up Transparent proxies can set up filter chains that allow direct connections to upstream service instances. Services that can be dialed directly are stored in the PassthroughUpstreams map of the proxycfg snapshot. Previously these addresses were not being cleaned up based on new service health data. The list of addresses associated with an upstream service would only ever grow. As services scale up and down, eventually they will have instances assigned to an IP that was previously assigned to a different service. When IP addresses are duplicated across filter chain match rules the listener config will be rejected by Envoy. This commit updates the proxycfg snapshot management so that passthrough addresses can get cleaned up when no longer associated with a given upstream. There is still the possibility of a race condition here where due to timing an address is shared between multiple passthrough upstreams. That concern is mitigated by #12195, but will be further addressed in a follow-up.	2022-02-10 17:01:57 -07:00
Freddy	bb129384b7	Prevent xDS tight loop on cfg errors (#12195 )	2022-02-10 15:37:36 -07:00
Dhia Ayachi	de7598f064	fix race when starting a service while the agent `serviceManager` is … (#12302 ) * fix race when starting a service while the agent `serviceManager` is stopping * add changelog	2022-02-10 13:30:49 -05:00
Daniel Nephin	db4675bd1a	Merge pull request #12277 from hashicorp/dnephin/panic-in-service-register catalog: initialize the refs map to prevent a nil panic	2022-02-09 19:48:22 -05:00
Daniel Nephin	6376141464	config-entry: fix a panic when registering a service or ingress gateway	2022-02-09 18:49:48 -05:00
R.B. Boyer	0cd0d505fa	xds: allow only one outstanding delta request at a time (#12236 ) Fixes #11876 This enforces that multiple xDS mutations are not issued on the same ADS connection at once, so that we can 100% control the order that they are applied. The original code made assumptions about the way multiple in-flight mutations were applied on the Envoy side that was incorrect.	2022-02-08 10:36:48 -06:00
Daniel Nephin	c20412ab14	Merge pull request #12265 from hashicorp/dnephin/logging-in-tests sdk: add TestLogLevel for setting log level in tests	2022-02-07 16:11:23 -05:00
Daniel Nephin	5a0e6700c1	A test to reproduce the issue	2022-02-04 14:04:12 -05:00
Daniel Nephin	7b466a024b	Make test more readable And fix typo	2022-02-03 18:44:09 -05:00
Daniel Nephin	6721c1246d	ca: relax and move private key type/bit validation for vault This commit makes two changes to the validation. Previously we would call this validation in GenerateRoot, which happens both on initialization (when a follower becomes leader), and when a configuration is updated. We only want to do this validation during config update so the logic was moved to the UpdateConfiguration function. Previously we would compare the config values against the actual cert. This caused problems when the cert was created manually in Vault (not created by Consul). Now we compare the new config against the previous config. Using a already created CA cert should never error now. Adding the key bit and types to the config should only error when the previous values were not the defaults.	2022-02-03 17:21:20 -05:00
Daniel Nephin	3b78f81f9a	ca: small cleanup of TestConnectCAConfig_Vault_TriggerRotation_Fails Before adding more test cases	2022-02-03 17:21:20 -05:00
Daniel Nephin	f6d7a0f7b2	testing: fix test failures caused by new log level These two tests require debug logging enabled, because they look for log lines. Also switched to testify assertions because the previous errors were not clear.	2022-02-03 17:07:39 -05:00
Daniel Nephin	1a9a656a7f	sdk: add TestLogLevel for setting log level in tests And default log level to WARN.	2022-02-03 13:42:28 -05:00
Daniel Nephin	44f9229b96	ca: add a test that uses an intermediate CA as the primary CA This test found a bug in the secondary. We were appending the root cert to the PEM, but that cert was already appended. This was failing validation in Vault here: https://github.com/hashicorp/vault/blob/sdk/v0.3.0/sdk/helper/certutil/types.go#L329 Previously this worked because self signed certs have the same SubjectKeyID and AuthorityKeyID. So having the same self-signed cert repeated doesn't fail that check. However with an intermediate that is not self-signed, those values are different, and so we fail the check. A test I added in a previous commit should show that this continues to work with self-signed root certs as well.	2022-02-02 13:41:35 -05:00
Daniel Nephin	d00a9abca2	acl: un-embed ACLIdentity This is safer than embedding two interface because there are a number of places where we check the concrete type. If we check the concrete type on the top-level interface it will fail. So instead expose the ACLIdentity from a method.	2022-02-02 12:07:31 -05:00
Daniel Nephin	18ff00f985	Merge pull request #12167 from hashicorp/dnephin/acl-resolve-token-3 acl: rename ResolveTokenToIdentityAndAuthorizer to ResolveToken	2022-01-31 19:21:06 -05:00
Daniel Nephin	ff64c13c3e	Merge pull request #12166 from hashicorp/dnephin/acl-resolve-token-2 acl: remove ResolveTokenToIdentity	2022-01-31 19:19:21 -05:00
Daniel Nephin	aa4dbe2a17	acl: rename ResolveTokenToIdentityAndAuthorizer to ResolveToken This change allows us to remove one of the last remaining duplicate resolve token methods (Server.ResolveToken). With this change we are down to only 2, where the second one also handles setting the default EnterpriseMeta from the token.	2022-01-31 18:04:19 -05:00
Daniel Nephin	57eac90cae	acl: remove unused methods on fakes, and add changelog Also document the metric that was removed in a previous commit.	2022-01-31 17:53:53 -05:00
Daniel Nephin	1fb2d49826	Merge pull request #12165 from hashicorp/dnephin/acl-resolve-token acl: remove some of the duplicate resolve token methods	2022-01-31 13:27:49 -05:00
Mathew Estafanous	1113a7533c	Change error-handling across handlers. (#12225 )	2022-01-31 11:17:35 -05:00
Fulvio	eff69b484b	URL-encode/decode resource names for HTTP API part 4 (#12190 )	2022-01-28 15:01:47 -05:00
Dan Upton	ebdda4848f	streaming: split event buffer by key (#12080 )	2022-01-28 12:27:00 +00:00
freddygv	68dea758dd	Add failing test The updated test fails because passthrough upstream addresses are not being cleaned up.	2022-01-27 18:56:47 -07:00
Daniel Nephin	fa8ff28a63	ca/provider: remove ActiveRoot from Provider	2022-01-27 13:07:37 -05:00
Daniel Nephin	722e3a6ac4	ca: update MockProvider for new interface	2022-01-27 12:51:35 -05:00
Daniel Nephin	80f215675c	ca: update GenerateRoot godoc	2022-01-27 12:51:35 -05:00
Daniel Nephin	d56a1dfb2c	Merge pull request #11663 from hashicorp/dnephin/ca-remove-one-call-to-active-root-2 ca: remove second call to Provider.ActiveRoot	2022-01-27 12:41:05 -05:00
Daniel Nephin	d3324d0d27	Merge pull request #12109 from hashicorp/dnephin/blocking-query-1 rpc: make blockingQuery easier to read	2022-01-26 18:13:55 -05:00
Daniel Nephin	6fe2311ce0	acl: Remove a call to aclAccessorID I missed this on the first pass, we no longer need to look up this ID, because we have it from the Authorizer.	2022-01-26 17:21:45 -05:00
Daniel Nephin	14a40fab1a	Merge pull request #11221 from hashicorp/dnephin/acl-resolver-5 acl: extract a backend type for the ACLResolverBackend	2022-01-26 16:57:03 -05:00
Dao Thanh Tung	42d6c61b62	URL-encode/decode resource names for HTTP API part 3 (#12103 )	2022-01-26 13:12:42 -05:00
Daniel Nephin	74dc9925cc	Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com>	2022-01-26 12:24:13 -05:00
Daniel Nephin	2c311161cc	acl: extract a backend type for the ACLResolverBackend This is a small step to isolate the functionality that is used for the ACLResolver from the large Client and Server structs.	2022-01-26 12:24:10 -05:00
R.B. Boyer	b999b3edfc	xds: fix for delta xDS reconnect bug in LDS/CDS (#12174 ) When a wildcard xDS type (LDS/CDS/SRDS) reconnects from a delta xDS stream, prior to envoy `1.19.0` it would populate the `ResourceNamesSubscribe` field with the full list of currently subscribed items, instead of simply omitting it to infer that it wanted everything (which is what wildcard mode means). This upstream issue was filed in envoyproxy/envoy#16063 and fixed in envoyproxy/envoy#16153 which went out in Envoy `1.19.0` and is fixed in later versions (later refactored in envoyproxy/envoy#16855). This PR conditionally forces LDS/CDS to be wildcard-only even when the connected Envoy requests a non-wildcard subscription, but only does so on versions prior to `1.19.0`, as we should not need to do this on later versions. This fixes the failure case as described here: #11833 (comment) Co-authored-by: Huan Wang <fredwanghuan@gmail.com>	2022-01-25 11:24:27 -06:00
Daniel Nephin	c1da07e2ea	acl: remove calls to ResolveIdentityFromToken We already have an ACLResolveResult, so we can get the accessor ID from it.	2022-01-22 15:05:42 -05:00
Daniel Nephin	ed1cc5f255	acl: remove ResolveTokenToIdentity By exposing the AccessorID from the primary ResolveToken method we can remove this duplication.	2022-01-22 14:47:59 -05:00
Daniel Nephin	26f0ebd96f	acl: return a resposne from ResolveToken that includes the ACLIdentity So that we can duplicate duplicate methods.	2022-01-22 14:33:09 -05:00
Daniel Nephin	314614f073	acl: remove duplicate methods Now that ACLResolver is embedded we don't need ResolveTokenToIdentity on Client and Server. Moving ResolveTokenAndDefaultMeta to ACLResolver removes the duplicate implementation.	2022-01-22 14:12:08 -05:00
Daniel Nephin	62c09b2d0a	acl: embed ACLResolver in Client and Server In preparation for removing duplicate resolve token methods.	2022-01-22 14:07:26 -05:00
Chris S. Kim	9ef448dedd	Generate bindata_assetfs.go (#12146 )	2022-01-21 16:06:44 -05:00
R.B. Boyer	05c7373a28	bulk rewrite using this script set -euo pipefail unset CDPATH cd "$(dirname "$0")" for f in $(git grep '\brequire := require\.New(' \| cut -d':' -f1 \| sort -u); do echo "=== require: $f ===" sed -i '/require := require.New(t)/d' $f # require.XXX(blah) but not require.XXX(tblah) or require.XXX(rblah) sed -i 's/\brequire\.$[a-zA-Z0-9_]$($[^tr]$/require.\1(t,\2/g' $f # require.XXX(tblah) but not require.XXX(t, blah) sed -i 's/\brequire\.$[a-zA-Z0-9_]$($t[^,]$/require.\1(t,\2/g' $f # require.XXX(rblah) but not require.XXX(r, blah) sed -i 's/\brequire\.$[a-zA-Z0-9_]$($r[^,]$/require.\1(t,\2/g' $f gofmt -s -w $f done for f in $(git grep '\bassert := assert\.New(' \| cut -d':' -f1 \| sort -u); do echo "=== assert: $f ===" sed -i '/assert := assert.New(t)/d' $f # assert.XXX(blah) but not assert.XXX(tblah) or assert.XXX(rblah) sed -i 's/\bassert\.$[a-zA-Z0-9_]$($[^tr]$/assert.\1(t,\2/g' $f # assert.XXX(tblah) but not assert.XXX(t, blah) sed -i 's/\bassert\.$[a-zA-Z0-9_]$($t[^,]$/assert.\1(t,\2/g' $f # assert.XXX(rblah) but not assert.XXX(r, blah) sed -i 's/\bassert\.$[a-zA-Z0-9_]$($r[^,]$/assert.\1(t,\2/g' $f gofmt -s -w $f done	2022-01-20 10:46:23 -06:00
R.B. Boyer	c12b0ee3d2	test: normalize require.New and assert.New syntax	2022-01-20 10:45:56 -06:00
R.B. Boyer	baf886c6f3	proxycfg: introduce explicit UpstreamID in lieu of bare string (#12125 ) The gist here is that now we use a value-type struct proxycfg.UpstreamID as the map key in ConfigSnapshot maps where we used to use "upstream id-ish" strings. These are internal only and used just for bidirectional trips through the agent cache keyspace (like the discovery chain target struct). For the few places where the upstream id needs to be projected into xDS, that's what (proxycfg.UpstreamID).EnvoyID() is for. This lets us ALWAYS inject the partition and namespace into these things without making stuff like the golden testdata diverge.	2022-01-20 10:12:04 -06:00
Dan Upton	088ba2edaf	[OSS] Remove remaining references to master (#11827 )	2022-01-20 12:47:50 +00:00
VictorBac	145703972a	Add GRPC and GRPCUseTLS to api.HealthCheckDefinition (#12108 ) * Add GRPC to HealthCheckDefinition * add GRPC and GRPCUseTLS	2022-01-19 16:09:15 -05:00
Evan Culver	ec65890f01	connect: Upgrade Envoy 1.20 to 1.20.1 (#11895 )	2022-01-18 14:35:27 -05:00
Daniel Nephin	59206e38c7	rpc: cleanup exit and blocking condition logic in blockingQuery Remove some unnecessary comments around query_blocking metric. The only line that needs any comments in the atomic decrement. Cleanup the block and return comments and logic. The old comment about AbandonCh may have been relevant before, but it is expected behaviour now. The logic was simplified by inverting the err condition.	2022-01-17 16:59:25 -05:00
Daniel Nephin	a28d1268cb	rpc: extract rpcQueryTimeout method This helps keep the logic in blockingQuery more focused. In the future we may have a separate struct for RPC queries which may allow us to move this off of Server.	2022-01-17 16:59:25 -05:00
Daniel Nephin	751bc2e7d3	rpc: move the index defaulting to setQueryMeta. This safeguard should be safe to apply in general. We are already applying it to non-blocking queries that call blockingQuery, so it should be fine to apply it to others.	2022-01-17 16:59:25 -05:00
Daniel Nephin	95e471052b	rpc: add subtests to blockingQuery test	2022-01-17 16:59:25 -05:00
Daniel Nephin	6bf8efe607	rpc: refactor blocking query To remove the TODO, and make it more readable. In general this reduces the scope of variables, making them easier to reason about. It also introduces more early returns so that we can see the flow from the structure of the function.	2022-01-17 16:58:47 -05:00
Daniel Nephin	1971a58b29	Merge pull request #11661 from hashicorp/dnephin/ca-remove-one-call-to-active-root ca: remove one call to Provider.ActiveRoot	2022-01-13 16:48:12 -05:00
Kyle Havlovitz	2ba76486d0	Add virtual IP generation for term gateway backed services	2022-01-12 12:08:49 -08:00
Chris S. Kim	4330a6a21a	Fix race with tags (#12041 )	2022-01-12 11:24:51 -05:00
Chris S. Kim	4f0a3a997c	Fix races in anti-entropy tests (#12028 )	2022-01-11 14:28:51 -05:00
Mike Morris	277c41d336	ingress: allow setting TLS min version and cipher suites in ingress gateway config entries (#11576 ) * xds: refactor ingress listener SDS configuration * xds: update resolveListenerSDS call args in listeners_test * ingress: add TLS min, max and cipher suites to GatewayTLSConfig * xds: implement envoyTLSVersions and envoyTLSCipherSuites * xds: merge TLS config * xds: configure TLS parameters with ingress TLS context from leaf * xds: nil check in resolveListenerTLSConfig validation * xds: nil check in makeTLSParameters* functions * changelog: add entry for TLS params on ingress config entries * xds: remove indirection for TLS params in TLSConfig structs * xds: return tlsContext, nil instead of ambiguous err Co-authored-by: Chris S. Kim <ckim@hashicorp.com> * xds: switch zero checks to types.TLSVersionUnspecified * ingress: add validation for ingress config entry TLS params * ingress: validate listener TLS config * xds: add basic ingress with TLS params tests * xds: add ingress listeners mixed TLS min version defaults precedence test * xds: add more explicit tests for ingress listeners inheriting gateway defaults * xds: add test for single TLS listener on gateway without TLS defaults * xds: regen golden files for TLSVersionInvalid zero value, add TLSVersionAuto listener test * types/tls: change TLSVersion to string * types/tls: update TLSCipherSuite to string type * types/tls: implement validation functions for TLSVersion and TLSCipherSuites, make some maps private * api: add TLS params to GatewayTLSConfig, add tests * api: add TLSMinVersion to ingress gateway config entry test JSON * xds: switch to Envoy TLS cipher suite encoding from types package * xds: fixup validation for TLSv1_3 min version with cipher suites * add some kitchen sink tests and add a missing struct tag * xds: check if mergedCfg.TLSVersion is in TLSVersionsWithConfigurableCipherSuites * xds: update connectTLSEnabled comment * xds: remove unsued resolveGatewayServiceTLSConfig function * xds: add makeCommonTLSContextFromLeafWithoutParams * types/tls: add LessThan comparator function for concrete values * types/tls: change tlsVersions validation map from string to TLSVersion keys * types/tls: remove unused envoyTLSCipherSuites * types/tls: enable chacha20 cipher suites for Consul agent * types/tls: remove insecure cipher suites from allowed config TLS_ECDHE_ECDSA_WITH_AES_128_CBC_SHA256 and TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256 are both explicitly listed as insecure and disabled in the Go source. Refs https://cs.opensource.google/go/go/+/refs/tags/go1.17.3:src/crypto/tls/cipher_suites.go;l=329-330 * types/tls: add ValidateConsulAgentCipherSuites function, make direct lookup map private * types/tls: return all unmatched cipher suites in validation errors * xds: check that Envoy API value matching TLS version is found when building TlsParameters * types/tls: check that value is found in map before appending to slice in MarshalEnvoyTLSCipherSuiteStrings * types/tls: cast to string rather than fmt.Printf in TLSCihperSuite.String() * xds: add TLSVersionUnspecified to list of configurable cipher suites * structs: update note about config entry warning * xds: remove TLS min version cipher suite unconfigurable test placeholder * types/tls: update tests to remove assumption about private map values Co-authored-by: R.B. Boyer <rb@hashicorp.com>	2022-01-11 11:46:42 -05:00
Dao Thanh Tung	217e2dc656	URL-encode/decode resource names for HTTP API part 2 (#11957 )	2022-01-11 08:52:45 -05:00
Daniel Nephin	262898e561	ca: remove unnecessary var, and slightly reduce cyclo complexity `newIntermediate` is always equal to `needsNewIntermediate`, so we can remove the extra variable and use the original directly. Also remove the `activeRoot.ID != newActiveRoot.ID` case from an if, because that case is already checked above, and `needsNewIntermediate` will already be true in that case. This condition now reads a lot better: > Persist a new root if we did not have one before, or if generated a new intermediate.	2022-01-06 16:56:49 -05:00
Daniel Nephin	d406f78c5c	ca: remove unused provider.ActiveRoot call In the previous commit the single use of this storedRoot was removed. In this commit the original objective is completed. The Provider.ActiveRoot is being removed because 1. the secondary should get the active root from the Consul primary DC, not the provider, so that secondary DCs do not need to communicate with a provider instance in a different DC. 2. so that the Provider.ActiveRoot interface can be changed without impacting other code paths.	2022-01-06 16:56:48 -05:00
Daniel Nephin	4d15e8a9ec	ca: extract the lookup of the active primary CA This method had only one caller, which always looked for the active root. This commit moves the lookup into the method to reduce the logic in the one caller. This is being done in preparation for a larger change. Keeping this separate so it is easier to see. The `storedRootID != primaryRoots.ActiveRootID` is being removed because these can never be different. The `storedRootID` comes from `provider.ActiveRoot`, the `primaryRoots.ActiveRootID` comes from the store `CARoot` from the primary. In both cases the source of the data is the primary DC. Technically they could be different if someone modified the provider outside of Consul, but that would break many things, so is not a supported flow. If these were out of sync because of ordering of events then the secondary will soon receive an update to `primaryRoots` and everything will be sorted out again.	2022-01-06 16:56:48 -05:00
Daniel Nephin	37b09df427	ca: update godoc To clarify what to expect from the data stored in this field, and the behaviour of this function.	2022-01-06 16:56:48 -05:00
Daniel Nephin	1f670c22f5	ca: remove one call to provider.ActiveRoot ActiveRoot should not be called from the secondary DC, because there should not be a requirement to run the same Vault instance in a secondary DC. SignIntermediate is called in a secondary DC, so it should not call ActiveRoot We would also like to change the interface of ActiveRoot so that we can support using an intermediate cert as the primary CA in Consul. In preparation for making that change I am reducing the number of calls to ActiveRoot, so that there are fewer code paths to modify when the interface changes. This change required a change to the mockCAServerDelegate we use in tests. It was returning the RootCert for SignIntermediate, but that is not an accurate fake of production. In production this would also be a separate cert.	2022-01-06 16:55:50 -05:00
Daniel Nephin	1f66120c20	ca: remove redundant append of an intermediate cert Immediately above this line we are already appending the full list of intermediates. The `provider.ActiveIntermediate` MUST be in this list of intermediates because it must be available to all the other non-leader Servers. If it was not in this list of intermediates then any proxy that received data from a non-leader would have the wrong certs. This is being removed now because we are planning on changing the `Provider.ActiveIntermediate` interface, and removing these extra calls ahead of time helps make that change easier.	2022-01-06 16:55:50 -05:00
Daniel Nephin	b66d259c1a	ca: only generate a single private key for the whole test case Using tracing and cpu profiling I found that the majority of the time in these test cases is spent generating a private key. We really don't need separate private keys, so we can generate only one and use it for all cases. With this change the test runs much faster.	2022-01-06 16:55:50 -05:00
Daniel Nephin	92a054cfa6	ca: cleanup a test Fix the name to match the function it is testing Remove unused code Fix the signature, instead of returning (error, string) which should be (string, error) accept a testing.T to emit errors. Handle the error from encode.	2022-01-06 16:55:49 -05:00
Daniel Nephin	9ec7e07db4	ca: use the new leaf signing lookup func in leader metrics	2022-01-06 16:55:49 -05:00
Blake Covarrubias	b13fb553ac	api: Return 404 when deregistering a non-existent check (#11950 ) Update the `/agent/check/deregister/` API endpoint to return a 404 HTTP response code when an attempt is made to de-register a check ID that does not exist on the agent. This brings the behavior of /agent/check/deregister/ in line with the behavior of /agent/service/deregister/ which was changed in #10632 to similarly return a 404 when de-registering non-existent services. Fixes #5821	2022-01-06 12:38:37 -08:00
Dhia Ayachi	7e0b8354a5	clone the service under lock to avoid a data race (#11940 ) * clone the service under lock to avoid a data race * add change log * create a struct and copy the pointer to mutate it to avoid a data race * fix failing test * revert added space * add comments, to clarify the data race.	2022-01-06 14:33:06 -05:00
Daniel Nephin	d05264041e	Merge pull request #11918 from hashicorp/dnephin/tob-followup Fix a few small bugs	2022-01-05 18:50:48 -05:00
Daniel Nephin	4983c27703	snapshot: return the error from replyFn The only function passed to SnapshotRPC today always returns a nil error, so there's no way to exercise this bug in practice. This change is being made for correctness so that it doesn't become a problem in the future, if we ever pass a different function to SnapshotRPC.	2022-01-05 17:51:03 -05:00
Daniel Nephin	affe97e22d	config: correctly capture all errors. Some calls to multierror.Append were not using the existing b.err, which meant we were losing all previous errors.	2022-01-05 17:51:03 -05:00
Chris S. Kim	f7f5aca058	Fix test for ENT (#11946 )	2022-01-05 15:18:08 -05:00
Chris S. Kim	407b0b8963	Fix test for ENT (#11941 )	2022-01-05 12:24:44 -05:00
Dhia Ayachi	5f6bf369af	reset `coalesceTimer` to nil as soon as the event is consumed (#11924 ) * reset `coalesceTimer` to nil as soon as the event is consumed * add change log * refactor to add relevant test. * fix linter * Apply suggestions from code review Co-authored-by: Freddy <freddygv@users.noreply.github.com> * remove non needed check Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2022-01-05 12:17:47 -05:00
Mathew Estafanous	dc18933cc2	Ensure consistency with error-handling across all handlers. (#11599 )	2022-01-05 12:11:03 -05:00
Jared Kirschner	a9371f18e5	Clarify service and check error messages (use ID) Error messages related to service and check operations previously included the following substrings: - service %q - check %q From this error message, it isn't clear that the expected field is the ID for the entity, not the name. For example, if the user has a service named test, the error message would read 'Unknown service "test"'. This is misleading - a service with that name does exist, but not with that ID. The substrings above have been modified to make it clear that ID is needed, not name: - service with ID %q - check with ID %q	2022-01-04 11:42:37 -08:00
Jared Kirschner	fc076c02c7	Merge pull request #11335 from littlestar642/url-encoded-args URL-encode/decode resource names for HTTP API	2022-01-04 14:00:14 -05:00
Chris S. Kim	d87fe70a82	testing: Revert assertion for virtual IP flag (#11932 )	2022-01-04 11:24:56 -05:00
Jared Kirschner	d26f8e4529	Merge pull request #11820 from hashicorp/improve-ui-disabled-api-response http: improve UI not enabled response message	2022-01-03 12:00:01 -05:00
littlestar642	7d1f2157eb	add path escape and unescape to path params	2022-01-03 08:18:32 -08:00
Daniel Nephin	48d123e241	Merge pull request #11796 from hashicorp/dnephin/cleanup-test-server testing: stop using an old version in testServer	2021-12-22 16:04:04 -05:00
freddygv	d7975586d6	Purge chain if it shouldn't be there	2021-12-13 18:56:44 -07:00
freddygv	be85ae11ca	additional test fixes	2021-12-13 18:56:44 -07:00
freddygv	e1d4797561	Account for new upstreams constraint in tests	2021-12-13 18:56:28 -07:00
freddygv	16d3efc4b5	Check ingress upstreams when gating chain watches	2021-12-13 18:56:28 -07:00
freddygv	f4ddb5432c	Use ptr receiver in all Upstream methods	2021-12-13 18:56:14 -07:00
freddygv	d647141a7d	Avoid storing chain without an upstream	2021-12-13 18:56:14 -07:00
freddygv	9e0958f1d2	Clean up chains separately from their watches	2021-12-13 18:56:14 -07:00
freddygv	b704d4e2dd	Validate chains are associated with upstreams Previously we could get into a state where discovery chain entries were not cleaned up after the associated watch was cancelled. These changes add handling for that case where stray chain references are encountered.	2021-12-13 18:56:13 -07:00
freddygv	ea26a7b7cf	Store intention upstreams in snapshot	2021-12-13 18:56:13 -07:00
R.B. Boyer	72a81cfc4a	proxycfg: ensure all of the watches are canceled if they are cancelable (#11824 )	2021-12-13 15:56:17 -06:00
Jared Kirschner	7b78ded3c7	Merge pull request #11818 from hashicorp/improve-url-not-found-response http: improve 404 Not Found response message	2021-12-13 16:08:50 -05:00
R.B. Boyer	3dccd14d31	proxycfg: use external addresses in tproxy when crossing partition boundaries (#11823 )	2021-12-13 14:34:49 -06:00
Jared Kirschner	757236007a	http: improve 404 Not Found response message When a URL path is not found, return a non-empty message with the 404 status code to help the user understand what went wrong. If the URL path was not prefixed with '/v1/', suggest that may be the cause of the problem (which is a common mistake).	2021-12-13 11:03:25 -08:00
Freddy	f7eeffb98d	Use anonymousToken when querying by secret ID (#11813 ) Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Dan Upton <daniel@floppy.co> This query has been incorrectly querying by accessor ID since New ACLs were added. However, the legacy token compat allowed this to continue to work, since it made a fallback query for the anonymousToken ID. PR #11184 removed this legacy token query, which means that the query by accessor ID is now the only check for the anonymous token's existence. This PR updates the GetBySecret call to use the secret ID of the token.	2021-12-13 10:56:09 -07:00
R.B. Boyer	a0156785dd	various partition related todos (#11822 )	2021-12-13 11:43:33 -06:00
Jared Kirschner	8b8c79ea72	http: improve UI not enabled response message Response now clearly indicates: - the UI is disabled - how to enable the UI	2021-12-13 08:48:33 -08:00
Kyle Havlovitz	b9e1dcde1c	Merge pull request #11812 from hashicorp/metrics-ui-acls oss: use wildcard partition in metrics proxy ui endpoint	2021-12-10 16:24:47 -08:00
Kyle Havlovitz	9187070a93	Merge pull request #11798 from hashicorp/vip-goroutine-check leader: move the virtual IP version check into a goroutine	2021-12-10 15:59:35 -08:00
Kyle Havlovitz	ad9c104816	acl: use wildcard partition in metrics proxy ui endpoint	2021-12-10 15:58:17 -08:00
Kyle Havlovitz	45402dad63	state: fix freed VIP table id index	2021-12-10 14:41:45 -08:00
Kyle Havlovitz	ccc119c549	Exit before starting the vip check routine if possible	2021-12-10 14:30:50 -08:00
Daniel Nephin	6444d1d4b3	testing: Deprecate functions for creating a server. These helper functions actually end up hiding important setup details that should be visible from the test case. We already have a convenient way of setting this config when calling newTestServerWithConfig.	2021-12-09 20:09:29 -05:00
Daniel Nephin	74e92316de	testing: remove old config.Build version DefaultConfig already sets the version to version.Version, so by removing this our tests will run with the version that matches the code.	2021-12-09 20:09:29 -05:00
Kyle Havlovitz	2a52630067	leader: move the virtual IP version check into a goroutine	2021-12-09 17:00:33 -08:00
FFMMM	336a234927	[sync ent] increase segment max limit to 464, make configurable (#1424 ) (#11795 ) commit b6eb27563e747a78b7647d2b5da405e46364cc46 Author: FFMMM <FFMMM@users.noreply.github.com> Date: Thu Dec 9 13:53:44 2021 -0800 increase segment max limit to 464, make configurable (#1424) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> fix: rename ent changelog file Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-12-09 15:36:11 -08:00
Daniel Nephin	ded49b3ab0	Merge pull request #11780 from hashicorp/dnephin/ca-test-vault-in-secondary ca: improve test coverage for RenewIntermediate	2021-12-09 12:29:43 -05:00
R.B. Boyer	5f6bf4e756	agent: ensure service maintenance checks for matching partitions ahead of other errors (#11788 ) This matches behavior in most other agent api endpoints.	2021-12-09 10:05:02 -06:00
Daniel Nephin	e6615bdaa7	fix misleading errors on vault shutdown	2021-12-08 18:42:52 -05:00
Daniel Nephin	15c4de0c15	ca: prune some unnecessary lookups in the tests	2021-12-08 18:42:52 -05:00
Daniel Nephin	bf798094d5	ca: remove duplicate WaitFor function	2021-12-08 18:42:52 -05:00
Daniel Nephin	984986f007	ca: fix flakes in RenewIntermediate tests I suspect one problem was that we set structs.IntermediateCertRenewInterval to 1ms, which meant that in some cases the intermediate could renew before we stored the original value. Another problem was that the 'wait for intermediate' loop was calling the provider.ActiveIntermediate, but the comparison needs to use the RPC endpoint to accurately represent a user request. So changing the 'wait for' to use the state store ensures we don't race. Also moves the patching into a separate function. Removes the addition of ca.CertificateTimeDriftBuffer as part of calculating halfTime. This was added in a previous commit to attempt to fix the flake, but it did not appear to fix the problem. Adding the time here was making the tests fail when using the shared patch function. It's not clear to me why, but there's no reason we should be including this time in the halfTime calculation.	2021-12-08 18:42:52 -05:00
Daniel Nephin	bc7ec4455f	ca: improve RenewIntermediate tests Use the new verifyLearfCert to show the cert verifies with intermediates from both sources. This required using the RPC interface so that the leaf pem was constructed correctly. Add IndexedCARoots.Active since that is a common operation we see in a few places.	2021-12-08 18:42:52 -05:00
Daniel Nephin	0784073d5e	ca: add a test for Vault in secondary DC	2021-12-08 18:42:51 -05:00
Daniel Nephin	373f445db5	ca: Add CARoots.Active method Which will be used in the next commit.	2021-12-08 18:41:51 -05:00
R.B. Boyer	2f345cca33	acl: ensure that the agent recovery token is properly partitioned (#11782 )	2021-12-08 17:11:55 -06:00
Daniel Nephin	0f95a2c3b1	Merge pull request #11721 from hashicorp/dnephin/ca-export-fsm-operation ca: use the real FSM operation in tests	2021-12-08 17:49:00 -05:00
Daniel Nephin	be1ddc5942	ca: use the real FSM operation in tests Previously we had a couple copies that reproduced the FSM operation. These copies introduce risk that the test does not accurately match production. This PR removes the test versions of the FSM operation, and exports the real production FSM operation so that it can be used in tests. The consul provider tests did need to change because of this. Previously we would return a hardcoded value of 2, but in production this value is always incremented.	2021-12-08 17:29:44 -05:00
R.B. Boyer	957758cb61	test: test server should auto cleanup (#11779 )	2021-12-08 13:26:06 -06:00
Evan Culver	32a04317bf	rpc: Unset partition before forwarding to remote datacenter (#11758 )	2021-12-08 11:02:14 -08:00
Daniel Nephin	52c8b4994b	Merge remote-tracking branch 'origin/main' into serve-panic-recovery	2021-12-07 16:30:41 -05:00
Dan Upton	b19c7f17ef	Rename `Master` and `AgentMaster` fields in config protobuf (#11764 )	2021-12-07 19:59:38 +00:00
Chris S. Kim	b74ddd7b70	Godocs updates for catalog endpoints (#11716 )	2021-12-07 10:18:28 -05:00
Mathew Estafanous	6626f91ff1	Transition all endpoint tests in agent_endpoint_test.go to go through ServeHTTP (#11499 )	2021-12-07 09:44:03 -05:00
Dan Upton	4192468358	Remove references to "master" ACL tokens in tests (#11751 )	2021-12-07 12:48:50 +00:00
Dan Upton	8bc11b08dc	Rename `ACLMasterToken` => `ACLInitialManagementToken` (#11746 )	2021-12-07 12:39:28 +00:00
Dan Upton	0230ebb4ef	agent/token: rename `agent_master` to `agent_recovery` (internally) (#11744 )	2021-12-07 12:12:47 +00:00
R.B. Boyer	89e90d1ffc	return the max	2021-12-06 15:36:52 -06:00
freddygv	65875a7c69	Remove support for failover to partition Failing over to a partition is more siimilar to failing over to another datacenter than it is to failing over to a namespace. In a future release we should update how localities for failover are specified. We should be able to accept a list of localities which can include both partition and datacenter.	2021-12-06 12:32:24 -07:00
freddygv	a1c1e36be7	Allow cross-partition references in disco chain * Add partition fields to targets like service route destinations * Update validation to prevent cross-DC + cross-partition references * Handle partitions when reading config entries for disco chain * Encode partition in compiled targets	2021-12-06 12:32:19 -07:00
R.B. Boyer	5ea4b82940	light refactors to support making partitions and serf-based wan federation are mutually exclusive (#11755 )	2021-12-06 13:18:02 -06:00
R.B. Boyer	80422c0dfe	areas: make the gRPC server tracker network area aware (#11748 ) Fixes a bug whereby servers present in multiple network areas would be properly segmented in the Router, but not in the gRPC mirror. This would lead servers in the current datacenter leaving from a network area (possibly during the network area's removal) from deleting their own records that still exist in the standard WAN area. The gRPC client stack uses the gRPC server tracker to execute all RPCs, even those targeting members of the current datacenter (which is unlike the net/rpc stack which has a bypass mechanism). This would manifest as a gRPC method call never opening a socket because it would block forever waiting for the current datacenter's pool of servers to be non-empty.	2021-12-06 09:55:54 -06:00
Freddy	d86b98c503	Merge pull request #11739 from hashicorp/ap/exports-rename	2021-12-06 08:20:50 -07:00
freddygv	a2fd30e514	Clean up additional refs to partition exports	2021-12-04 15:16:40 -07:00
freddygv	02fb323652	Rename partition-exports to exported-services Using a name less tied to partitions gives us more flexibility to use this config entry in OSS for exports between datacenters/meshes.	2021-12-03 17:47:31 -07:00
freddygv	fcfed67246	Update intention topology to use new table	2021-12-03 17:28:31 -07:00
freddygv	4acbdc4618	Avoid updating default decision from wildcard ixn Given that we do not allow wildcard partitions in intentions, no one ixn can override the DefaultAllow setting. Only the default ACL policy applies across all partitions.	2021-12-03 17:28:12 -07:00
freddygv	142d8193e5	Add a new table to query service names by kind This table purposefully does not index by partition/namespace. It's a global view into all service names. This table is intended to replace the current serviceListTxn watch in intentionTopologyTxn. For cross-partition transparent proxying we need to be able to calculate upstreams from intentions in any partition. This means that the existing serviceListTxn function is insufficient since it's scoped to a partition. Moving away from that function is also beneficial because it watches the main "services" table, so watchers will wake up when any instance is registered or deregistered.	2021-12-03 17:28:12 -07:00
freddygv	97b4068137	Update listener generation to account for consul VIP	2021-12-03 17:27:56 -07:00
Freddy	3eddf98e62	Merge pull request #11680 from hashicorp/ap/partition-exports-oss	2021-12-03 16:57:50 -07:00
Dan Upton	2f4b8d7a7d	internal: support `ResultsFilteredByACLs` flag/header (#11643 )	2021-12-03 23:04:24 +00:00
Dan Upton	43e28a3af6	query: support `ResultsFilteredByACLs` in query list endpoint (#11620 )	2021-12-03 23:04:09 +00:00
Dhia Ayachi	e38ccf0a22	port oss changes (#11736 )	2021-12-03 17:23:55 -05:00
Freddy	3791d6d7da	Merge pull request #11720 from hashicorp/bbolt	2021-12-03 14:44:36 -07:00
Dan Upton	1d694df02b	fedstate: support `ResultsFilteredByACLs` in `ListMeshGateways` endpoint (#11644 )	2021-12-03 20:56:55 +00:00
Dan Upton	0489ea187d	catalog: support `ResultsFilteredByACLs` flag/header (#11594 )	2021-12-03 20:56:14 +00:00
Dan Upton	8bb1b89554	coordinate: support `ResultsFilteredByACLs` flag/header (#11617 )	2021-12-03 20:51:02 +00:00
Dan Upton	a62aa3847d	sessions: support `ResultsFilteredByACLs` flag/header (#11606 )	2021-12-03 20:43:43 +00:00
Dan Upton	0a7ba5162e	txn: support `ResultsFilteredByACLs` flag in `Read` endpoint (#11632 )	2021-12-03 20:41:03 +00:00
Dan Upton	001bcac084	agent: support `X-Consul-Results-Filtered-By-ACLs` header in agent-local endpoints (#11610 )	2021-12-03 20:36:28 +00:00
Dhia Ayachi	a8874c65f7	sessions partitioning tests (#11734 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused operations_ent.go and operations_oss.go func * remove unused const * convert `IndexID` of `session_checks` table * convert `indexSession` of `session_checks` table * convert `indexNodeCheck` of `session_checks` table * partition `indexID` and `indexSession` of `tableSessionChecks` * fix oss linter * fix review comments * remove partition for Checks as it's always use the session partition * fix tests * fix tests * do not namespace nodeChecks index Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-12-03 15:36:07 -05:00
Dan Upton	b10e69ffda	intention: support `ResultsFilteredByACLs` flag/header (#11612 )	2021-12-03 20:35:54 +00:00
Mark Anderson	e8f542030e	Cross port of ent #1383 (#11726 ) Cross port of ent #1383 "Reject non-default datacenter when making partitioned ACLs" On the OSS side this is a minor refactor to add some more checks that are only applicable to enterprise code. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-12-03 10:20:25 -08:00
Dan Upton	1d571bb503	config: support `ResultsFilteredByACLs` in list/list all endpoints (#11621 )	2021-12-03 17:39:47 +00:00
Dan Upton	86cf697e52	event: support `X-Consul-Results-Filtered-By-ACLs` header in list (#11616 )	2021-12-03 17:38:59 +00:00
Dan Upton	44bc833318	kv: support `ResultsFilteredByACLs` in list/list keys (#11593 )	2021-12-03 17:31:48 +00:00
Dan Upton	3ad8540d23	health: support `ResultsFilteredByACLs` flag/header (#11602 )	2021-12-03 17:31:32 +00:00
Dan Upton	0efe478044	Groundwork for exposing when queries are filtered by ACLs (#11569 )	2021-12-03 17:11:26 +00:00
Kyle Havlovitz	a0ea359147	dns: add endpoint for querying service virtual IPs	2021-12-02 16:40:28 -08:00
Kyle Havlovitz	dbb58b726a	Merge pull request #11724 from hashicorp/service-virtual-ips oss: add virtual IP generation for connect services	2021-12-02 16:16:57 -08:00
Kyle Havlovitz	db88f95fbe	consul: add virtual IP generation for connect services	2021-12-02 15:42:47 -08:00
R.B. Boyer	6ec84cfbe2	agent: add variation of force-leave that exclusively works on the WAN (#11722 ) Fixes #6548	2021-12-02 17:15:10 -06:00
Matt Keeler	68e629a476	Emit raft-boltdb metrics	2021-12-02 16:56:15 -05:00
Daniel Nephin	8e2c71528f	config: add NoFreelistSync option # Conflicts: # agent/config/testdata/TestRuntimeConfig_Sanitize-enterprise.golden # agent/consul/server.go	2021-12-02 16:56:15 -05:00
Matt Keeler	1f49738167	Use raft-boltdb/v2	2021-12-02 16:56:15 -05:00
Daniel Nephin	fa32c78429	ca: set the correct SigningKeyID after config update with Vault provider The test added in this commit shows the problem. Previously the SigningKeyID was set to the RootCert not the local leaf signing cert. This same bug was fixed in two other places back in 2019, but this last one was missed. While fixing this bug I noticed I had the same few lines of code in 3 places, so I extracted a new function for them. There would be 4 places, but currently the InitializeCA flow sets this SigningKeyID in a different way, so I've left that alone for now.	2021-12-02 16:07:11 -05:00
Daniel Nephin	a0014e13fd	Merge pull request #11713 from hashicorp/dnephin/ca-test-names ca: make test naming consistent	2021-12-02 16:05:42 -05:00
Daniel Nephin	720d782225	Merge pull request #11671 from hashicorp/dnephin/ca-fix-storing-vault-intermediate ca: fix storing the leaf signing cert with Vault provider	2021-12-02 16:02:24 -05:00
Daniel Nephin	a0160f7426	Merge pull request #11677 from hashicorp/dnephin/freeport-interface sdk: use t.Cleanup in freeport and remove unnecessary calls	2021-12-02 15:58:41 -05:00
Daniel Nephin	c1cb77b829	ca: make test naming consistent While working on the CA system it is important to be able to run all the tests related to the system, without having to wait for unrelated tests. There are many slow and unrelated tests in agent/consul, so we need some way to filter to only the relevant tests. This PR renames all the CA system related tests to start with either `TestCAMananger` for tests of internal operations that don't have RPC endpoint, or `TestConnectCA` for tests of RPC endpoints. This allows us to run all the test with: go test -run 'TestCAMananger\|TestConnectCA' ./agent/consul The test naming follows an undocumented convention of naming tests as follows: Test[<struct name>_]<function name>[_<test case description>] I tried to always keep Primary/Secondary at the end of the description, and _Vault_ has to be in the middle because of our regex to run those tests as a separate CI job. You may notice some of the test names changed quite a bit. I did my best to identify the underlying method being tested, but I may have been slightly off in some cases.	2021-12-02 14:57:09 -05:00
FFMMM	38c457b486	add MustRevalidate flag to connect_ca_leaf cache type; always use on non-blocking queries (#11693 ) * always use MustRevalidate on non-blocking queries for connect ca leaf Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Update agent/agent_endpoint_test.go Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * pr feedback Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-12-02 11:32:15 -08:00
Daniel Nephin	460f8919c9	ca: make getLeafSigningCertFromRoot safer As a method on the struct type this would not be safe to call without first checking c.isIntermediateUsedToSignLeaf. So for now, move this logic to the CAMananger, so that it is always correct.	2021-12-02 12:42:49 -05:00
Daniel Nephin	64532ef636	ca: fix stored CARoot representation with Vault provider We were not adding the local signing cert to the CARoot. This commit fixes that bug, and also adds support for fixing existing CARoot on upgrade. Also update the tests for both primary and secondary to be more strict. Check the SigningKeyID is correct after initialization and rotation.	2021-12-02 12:42:49 -05:00
Dan Upton	eff3dc09b6	Rename `agent_master` ACL token in the API and CLI (#11669 )	2021-12-02 17:05:27 +00:00
Dan Upton	e1829a8706	Rename `master` and `agent_master` ACL tokens in the config file format (#11665 )	2021-12-01 21:08:14 +00:00
Chris S. Kim	67eacee31e	ENT to OSS sync (#11703 )	2021-12-01 14:56:10 -05:00
R.B. Boyer	70b143ddc5	auto-config: ensure the feature works properly with partitions (#11699 )	2021-12-01 13:32:34 -06:00
Daniel Nephin	963a9819d0	ca: add some godoc and func for finding leaf signing cert This will be used in a follow up commit.	2021-11-30 18:36:41 -05:00
Daniel Nephin	056a52ba64	sdk/freeport: rename Port to GetOne For better consistency with GetN	2021-11-30 17:32:41 -05:00
Chris S. Kim	e9c661db7f	Refactor test helper (#11689 ) Allow custom ACL root tokens to be passed	2021-11-30 13:22:07 -05:00
Chris S. Kim	0ec67cc2d1	acl: Fill authzContext from token in Coordinate endpoints (#11688 )	2021-11-30 13:17:41 -05:00
freddygv	76146dfc5b	Move ent config test to ent file	2021-11-29 12:15:17 -07:00
freddygv	6d51282adf	Prevent partition-exports entry from OSS usage Validation was added on the config entry kind since that is called when validating config entries to bootstrap via agent configuration and when applying entries via the config RPC endpoint.	2021-11-29 11:24:16 -07:00
Daniel Nephin	4f0d092c95	testing: remove unnecessary calls to freeport Previously we believe it was necessary for all code that required ports to use freeport to prevent conflicts. https://github.com/dnephin/freeport-test shows that it is actually save to use port 0 (`127.0.0.1:0`) as long as it is passed directly to `net.Listen`, and the listener holds the port for as long as it is needed. This works because freeport explicitly avoids the ephemeral port range, and port 0 always uses that range. As you can see from the test output of https://github.com/dnephin/freeport-test, the two systems never use overlapping ports. This commit converts all uses of freeport that were being passed directly to a net.Listen to use port 0 instead. This allows us to remove a bit of wrapping we had around httptest, in a couple places.	2021-11-29 12:19:43 -05:00
Daniel Nephin	20a8e11bf2	testing: use the new freeport interfaces	2021-11-27 15:39:46 -05:00
Daniel Nephin	2cf41e4dc8	go-sso: remove returnFunc now that freeport handles return	2021-11-27 15:29:38 -05:00
Daniel Nephin	8219e8571e	sdk: add freeport functions that use t.Cleanup	2021-11-27 15:04:43 -05:00
Daniel Nephin	772d8f7381	ca: clean up unnecessary raft.Apply response checking In d2ab767fef21244e9fe3b9887ea70fc177912381 raftApply was changed to handle this check in a single place, instad of having every caller check it. It looks like these few places were missed when I did that clean up. This commit removes the remaining resp.(error) checks, since they are all no-ops now.	2021-11-26 17:57:55 -05:00
Daniel Nephin	48954adfdc	Merge pull request #11339 from hashicorp/dnephin/ca-manager-isolate-secondary-2 ca: reduce use of state in the secondary	2021-11-26 14:41:45 -05:00
Daniel Nephin	8240286956	ca: remove state check in secondarySetPrimaryRoots This function is only ever called from operations that have already acquired the state lock, so checking the value of state can never fail. This change is being made in preparation for splitting out a separate type for the secondary logic. The state can't easily be shared, so really only the expored top-level functions should acquire the 'state lock'.	2021-11-26 14:14:47 -05:00
Daniel Nephin	877094e2fa	ca: remove actingSecondaryCA This commit removes the actingSecondaryCA field, and removes the stateLock around it. This field was acting as a proxy for providerRoot != nil, so replace it with that check instead. The two methods which called secondarySetCAConfigured already set the state, so checking the state again at this point will not catch runtime errors (only programming errors, which we can catch with tests). In general, handling state transitions should be done on the "entrypoint" methods where execution starts, not in every internal method. This is being done to remove some unnecessary references to c.state, in preparations for extracting types for primary/secondary.	2021-11-26 14:14:47 -05:00
Daniel Nephin	cd5f6b2dfb	ca: reduce consul provider backend interface a bit This makes it easier to fake, which will allow me to use the ConsulProvider as an 'external PKI' to test a customer setup where the actual root CA is not the root we use for the Consul CA. Replaces a call to the state store to fetch the clusterID with the clusterID field already available on the built-in provider.	2021-11-25 11:46:06 -05:00
Dhia Ayachi	f605689154	Partition/kv indexid sessions (#11639 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused operations_ent.go and operations_oss.go func * convert `indexNodeCheck` of `session_checks` table * partition `indexID` and `indexSession` of `tableSessionChecks` * remove partition for Checks as it's always use the session partition * partition sessions index id table * fix rebase issues Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-24 11:34:36 -05:00
Dhia Ayachi	b1c4be3da0	Partition session checks store (#11638 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused operations_ent.go and operations_oss.go func * remove unused const * convert `IndexID` of `session_checks` table * convert `indexSession` of `session_checks` table * convert `indexNodeCheck` of `session_checks` table * partition `indexID` and `indexSession` of `tableSessionChecks` * fix oss linter * fix review comments * remove partition for Checks as it's always use the session partition Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-24 09:10:38 -05:00
Chris S. Kim	c22adc8dc7	cleanup: Clarify deprecated legacy intention endpoints (#11635 )	2021-11-23 19:32:18 -05:00
Chris S. Kim	d2b86e7f48	Merge from ent (#11506 )	2021-11-19 11:50:44 -05:00
R.B. Boyer	fa7a66cd30	agent: purge service/check registration files for incorrect partitions on reload (#11607 )	2021-11-18 14:44:20 -06:00
Iryna Shustava	bd3fb0d0e9	connect: Support auth methods for the vault connect CA provider (#11573 ) * Support vault auth methods for the Vault connect CA provider * Rotate the token (re-authenticate to vault using auth method) when the token can no longer be renewed	2021-11-18 13:15:28 -07:00
Daniel Nephin	fee9696d4f	ca: use the cluster ID passed to the primary instead of fetching it from the state store.	2021-11-16 16:57:22 -05:00
Daniel Nephin	07a33a1526	ca: accept only the cluster ID to SpiffeIDSigningForCluster To make it more obivous where ClusterID is used, and remove the need to create a struct when only one field is used.	2021-11-16 16:57:21 -05:00
Will Jordan	2e66b7a5e6	Update node info sync comment (#11465 )	2021-11-16 11:16:11 -08:00
R.B. Boyer	83bf7ab3ff	re-run gofmt on 1.17 (#11579 ) This should let freshly recompiled golangci-lint binaries using Go 1.17 pass 'make lint'	2021-11-16 12:04:01 -06:00
R.B. Boyer	086ff42b56	partitions: various refactors to support partitioning the serf LAN pool (#11568 )	2021-11-15 09:51:14 -06:00
freddygv	f33eae6fe1	Update proxycfg for ingress service partitions	2021-11-12 14:33:31 -07:00
freddygv	dc7ea2ef1e	Accept partition for ingress services	2021-11-12 14:33:14 -07:00
freddygv	5ac1ab359b	Move assertion to after config fetch	2021-11-10 10:50:08 -07:00
freddygv	2261d51515	Use ClusterID to check for readiness The TrustDomain is populated from the Host() method which includes the hard-coded "consul" domain. This means that despite having an empty cluster ID, the TrustDomain won't be empty.	2021-11-10 10:45:22 -07:00
freddygv	482d3bc610	Prevent replicating partition-exports	2021-11-09 16:42:42 -07:00
freddygv	739490df12	handle error scenario of empty local DC	2021-11-09 16:42:42 -07:00
freddygv	b9b41625b9	Restrict DC for partition-exports writes There are two restrictions: - Writes from the primary DC which explicitly target a secondary DC. - Writes to a secondary DC that do not explicitly target the primary DC. The first restriction is because the config entry is not supported in secondary datacenters. The second restriction is to prevent the scenario where a user writes the config entry to a secondary DC, the write gets forwarded to the primary, but then the config entry does not apply in the secondary. This makes the scope more explicit.	2021-11-09 16:42:42 -07:00
Freddy	eb2b40b22d	Update filter chain creation for sidecar/ingress listeners (#11245 ) The duo of `makeUpstreamFilterChainForDiscoveryChain` and `makeListenerForDiscoveryChain` were really hard to reason about, and led to concealing a bug in their branching logic. There were several issues here: - They tried to accomplish too much: determining filter name, cluster name, and whether RDS should be used. - They embedded logic to handle significantly different kinds of upstream listeners (passthrough, prepared query, typical services, and catch-all) - They needed to coalesce different data sources (Upstream and CompiledDiscoveryChain) Rather than handling all of those tasks inside of these functions, this PR pulls out the RDS/clusterName/filterName logic. This refactor also fixed a bug with the handling of [UpstreamDefaults](https://www.consul.io/docs/connect/config-entries/service-defaults#defaults). These defaults get stored as UpstreamConfig in the proxy snapshot with a DestinationName of "", since they apply to all upstreams. However, this wildcard destination name must not be used when creating the name of the associated upstream cluster. The coalescing logic in the original functions here was in some situations creating clusters with a `.` prefix, which is not a valid destination.	2021-11-09 14:43:51 -07:00
Kyle Havlovitz	14591de8d2	Merge pull request #11461 from deblasis/feature/empty_client_addr_warning config: warn the user if client_addr is empty	2021-11-09 09:37:38 -08:00
Daniel Upton	caa5b5a5a6	xds: prefer fed state gateway definitions if they're fresher (#11522 ) Fixes an issue described in #10132, where if two DCs are WAN federated over mesh gateways, and the gateway in the non-primary DC is terminated and receives a new IP address (as is commonly the case when running them on ephemeral compute instances) the primary DC is unable to re-establish its connection until the agent running on its own gateway is restarted. This was happening because we always preferred gateways discovered by the `Internal.ServiceDump` RPC (which would fail because there's no way to dial the remote DC) over those discovered in the federation state, which is replicated as long as the primary DC's gateway is reachable.	2021-11-09 16:45:36 +00:00
Freddy	0ad360fadf	Merge pull request #11514 from hashicorp/dnephin/ca-fix-secondary-init ca: properly handle the case where the secondary initializes after the primary	2021-11-08 17:16:16 -07:00
freddygv	e6622ab0ab	Avoid returning empty roots with uninitialized CA Currently getCARoots could return an empty object with an empty trust domain before the CA is initialized. This commit returns an error while there is no CA config or no trust domain. There could be a CA config and no trust domain because the CA config can be created in InitializeCA before initialization succeeds.	2021-11-08 16:51:49 -07:00
Dhia Ayachi	f61892393f	refactor session state store tables to use the new index pattern (#11525 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused func `prefixIndexFromServiceNameAsString` * fix test error check * remove unused operations_ent.go and operations_oss.go func * remove unused const Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-08 16:20:50 -05:00
Dhia Ayachi	dfafd4e38c	KV refactoring, part 2 (#11512 ) * add partition to the kv get pretty print * fix failing test * add test for kvs RPC endpoint	2021-11-08 11:43:21 -05:00
Dhia Ayachi	17190c0076	KV state store refactoring and partitioning (#11510 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * partition kvs indexID table * add `partitionedIndexEntryName` in oss for test purpose * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * remove entmeta reference from oss Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-08 09:35:56 -05:00
Giulio Micheloni	10cdc0a5c8	Merge branch 'main' into serve-panic-recovery	2021-11-06 16:12:06 +01:00
Daniel Nephin	69ad7c0544	ca: Only initialize clusterID in the primary The secondary must get the clusterID from the primary	2021-11-05 18:08:44 -04:00
Daniel Nephin	3173582b75	ca: return an error when secondary fails to initialize Previously secondaryInitialize would return nil in this case, which prevented the deferred initialize from happening, and left the CA in an uninitialized state until a config update or root rotation. To fix this I extracted the common parts into the delegate implementation. However looking at this again, it seems like the handling in secondaryUpdateRoots is impossible, because that function should never be called before the secondary is initialzied. I beleive we can remove some of that logic in a follow up.	2021-11-05 18:02:51 -04:00
Daniel Nephin	db29ad346b	acl: remove id and revision from Policy constructors The fields were removed in a previous commit. Also remove an unused constructor for PolicyMerger	2021-11-05 15:45:08 -04:00
Daniel Nephin	617b11302f	acl: remove Policy.ID and Policy.Revision These two fields do not appear to be used anywhere. We use the structs.ACLPolicy ID in the ACLResolver cache, but the acl.Policy ID and revision are not used.	2021-11-05 15:43:52 -04:00
R.B. Boyer	1d8e7bb565	rename helper method to reflect the non-deprecated terminology (#11509 )	2021-11-05 13:51:50 -05:00
Connor	b3af482e09	Support Vault Namespaces explicitly in CA config (#11477 ) * Support Vault Namespaces explicitly in CA config If there is a Namespace entry included in the Vault CA configuration, set it as the Vault Namespace on the Vault client Currently the only way to support Vault namespaces in the Consul CA config is by doing one of the following: 1) Set the VAULT_NAMESPACE environment variable which will be picked up by the Vault API client 2) Prefix all Vault paths with the namespace Neither of these are super pleasant. The first requires direct access and modification to the Consul runtime environment. It's possible and expected, not super pleasant. The second requires more indepth knowledge of Vault and how it uses Namespaces and could be confusing for anyone without that context. It also infers that it is not supported * Add changelog * Remove fmt.Fprint calls * Make comment clearer * Add next consul version to website docs * Add new test for default configuration * go mod tidy * Add skip if vault not present * Tweak changelog text	2021-11-05 11:42:28 -05:00
R.B. Boyer	7fbf749bc4	segments: ensure that the serf_lan_allowed_cidrs applies to network segments (#11495 )	2021-11-04 17:17:19 -05:00
Mark Anderson	e9a0fa7d36	Remove some usage of md5 from the system (#11491 ) * Remove some usage of md5 from the system OSS side of https://github.com/hashicorp/consul-enterprise/pull/1253 This is a potential security issue because an attacker could conceivably manipulate inputs to cause persistence files to collide, effectively deleting the persistence file for one of the colliding elements. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-11-04 13:07:54 -07:00
FFMMM	9afecfa10c	plumb thru root cert tll to the aws ca provider (#11449 ) * plumb thru root cert ttl to the aws ca provider Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Update .changelog/11449.txt Co-authored-by: Dhia Ayachi <dhia@hashicorp.com> Co-authored-by: Dhia Ayachi <dhia@hashicorp.com>	2021-11-04 12:19:08 -07:00
FFMMM	e7ffef54ee	fix aws pca certs (#11470 ) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-11-03 12:21:24 -07:00
Mathew Estafanous	508664440d	Convert (some) test endpoints to use ServeHTTP instead of direct calls to handlers. (#11445 )	2021-11-03 11:12:36 -04:00
FFMMM	27227c0fd2	add root_cert_ttl option for consul connect, vault ca providers (#11428 ) * add root_cert_ttl option for consul connect, vault ca providers Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> * add changelog, pr feedback Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Update .changelog/11428.txt, more docs Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update website/content/docs/agent/options.mdx Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Kyle Havlovitz <kylehav@gmail.com>	2021-11-02 11:02:10 -07:00
Daniel Nephin	0ec2a804df	Merge pull request #10690 from tarat44/h2c-support-in-ping-checks add support for h2c in h2 ping health checks	2021-11-02 13:53:06 -04:00
Alessandro De Blasis	2b3f4efbab	config: warn the user if client_addr is empty if the provided value is empty string then the client services (DNS, HTTP, HTTPS, GRPC) are not listening and the user is not notified in any way about what's happening. Also, since a not provided client_addr defaults to 127.0.0.1, we make sure we are not getting unwanted warnings Signed-off-by: Alessandro De Blasis <alex@deblasis.net>	2021-11-01 22:47:20 +00:00
Daniel Nephin	00ed2b243f	Merge pull request #10771 from hashicorp/dnephin/emit-telemetry-metrics-immediately telemetry: improve cert expiry metrics	2021-11-01 18:31:03 -04:00
freddygv	ecccf22fd7	Exclude default partition from GatewayKey string This will behave the way we handle SNI and SPIFFE IDs, where the default partition is excluded. Excluding the default ensures that don't attempt to compare default.dc2 to dc2 in OSS.	2021-11-01 14:45:52 -06:00
freddygv	d944e6ae3a	Update GatewayKeys deduplication Federation states data is only keyed on datacenter, so it cannot be directly compared against keys for gateway groups.	2021-11-01 13:58:53 -06:00
freddygv	ce43e8cf99	Store GatewayKey in proxycfg snapshot for re-use	2021-11-01 13:58:53 -06:00
freddygv	51c888a41a	Update locality check in xds	2021-11-01 13:58:53 -06:00
freddygv	6657c88296	Update locality check in proxycfg	2021-11-01 13:58:53 -06:00
Daniel Nephin	c706bf135c	Merge pull request #11340 from hashicorp/dnephin/ca-manager-provider ca: split the Provider interface into Primary/Secondary	2021-11-01 14:11:15 -04:00
Daniel Nephin	eaaceedf31	Merge pull request #11338 from hashicorp/dnephin/ca-manager-isolate-secondary ca: clearly identify methods that are primary-only or secondary-only	2021-11-01 14:10:31 -04:00
Daniel Upton	a620b6be2e	Support Check-And-Set deletion of config entries (#11419 ) Implements #11372	2021-11-01 16:42:01 +00:00
Dhia Ayachi	4d763ef9e6	regenerate expired certs (#11462 ) * regenerate expired certs * add documentation to generate tests certificates	2021-11-01 11:40:16 -04:00
Jared Kirschner	6dfcbeceec	Merge pull request #11348 from kbabuadze/fix-answers-alt-domain Fix answers for alt domain	2021-10-29 17:09:20 -04:00
R.B. Boyer	d40d098321	agent: for various /v1/agent endpoints parse the partition parameter on the request (#11444 ) Also update the corresponding CLI commands to send the parameter appropriately. NOTE: Behavioral changes are not happening in this PR.	2021-10-28 16:44:38 -05:00
R.B. Boyer	017e9d5ae4	agent: add a clone function for duplicating the serf lan configuration (#11443 )	2021-10-28 16:11:26 -05:00
Daniel Nephin	a8d6392ab5	Add tests for cert expiry metrics	2021-10-28 14:38:57 -04:00
Daniel Nephin	503dee2d80	Merge pull request #10671 from hashicorp/dnephin/fix-subscribe-test-flake subscribe: improve TestSubscribeBackend_IntegrationWithServer_DeliversAllMessages	2021-10-28 12:57:09 -04:00
Evan Culver	b3c92f22b1	connect: Remove support for Envoy 1.16 (#11354 )	2021-10-27 18:51:35 -07:00
Evan Culver	98acbfa79c	connect: Add support for Envoy 1.20 (#11277 )	2021-10-27 18:38:10 -07:00
freddygv	3dd21023bc	Ensure partition-exports kind gets marshalled The api module has decoding functions that rely on 'kind' being present of payloads. This is so that we can decode into the appropriate api type for the config entry. This commit ensures that a static kind is marshalled in responses from Consul's api endpoints so that the api module can decode them.	2021-10-27 15:01:26 -06:00
Daniel Nephin	0a19d7fd76	agent: move agent tls metric monitor to a more appropriate place And add a test for it	2021-10-27 16:26:09 -04:00
Daniel Nephin	1b2144c982	telemetry: set cert expiry metrics to NaN on start So that followers do not report 0, which would make alerting difficult.	2021-10-27 15:19:25 -04:00
Daniel Nephin	a7fcf14c5c	telemetry: fix cert expiry metrics by removing labels These labels should be set by whatever process scrapes Consul (for prometheus), or by the agent that receives them (for datadog/statsd). We need to remove them here because the labels are part of the "metric key", so we'd have to pre-declare the metrics with the labels. We could do that, but that is extra work for labels that should be added from elsewhere. Also renames the closure to be more descriptive.	2021-10-27 15:19:25 -04:00
Daniel Nephin	4300daa2e6	telemetry: only emit leader cert expiry metrics on the servers	2021-10-27 15:19:25 -04:00
Daniel Nephin	9de725c17d	telemetry: prevent stale values from cert monitors Prometheus scrapes metrics from each process, so when leadership transfers to a different node the previous leader would still be reporting the old cached value. By setting NaN, I believe we should zero-out the value, so that prometheus should only consider the value from the new leader.	2021-10-27 15:19:25 -04:00
Daniel Nephin	616cc9b6f8	telemetry: improve cert expiry metrics Emit the metric immediately so that after restarting an agent, the new expiry time will be emitted. This is particularly important when this metric is being monitored, because we want the alert to resovle itself immediately. Also fixed a bug that was exposed in one of these metrics. The CARoot can be nil, so we have to handle that case.	2021-10-27 15:19:25 -04:00
Daniel Nephin	24951f0c7e	subscribe: attempt to fix a flaky test TestSubscribeBackend_IntegrationWithServer_DeliversAllMessages has been flaking a few times. This commit cleans up the test a bit, and improves the failure output. I don't believe this actually fixes the flake, but I'm not able to reproduce it reliably. The failure appears to be that the event with Port=0 is being sent in both the snapshot and as the first event after the EndOfSnapshot event. Hopefully the improved logging will show us if these are really duplicate events, or actually different events with different indexes.	2021-10-27 15:09:09 -04:00
Freddy	ae76144f55	Merge pull request #11435 from hashicorp/ent-authorizer-refactor [OSS] Export ACLs refactor	2021-10-27 13:04:40 -06:00
Freddy	520bda999b	Merge pull request #11432 from hashicorp/ap/exports-mgw [OSS] Update mesh gateways to handle partitions	2021-10-27 12:54:53 -06:00
freddygv	592965d61e	Rework acl exports interface	2021-10-27 12:50:39 -06:00
Freddy	9bbeea0432	Merge pull request #11433 from hashicorp/exported-service-acls [OSS] acl: Expand ServiceRead and NodeRead to account for partition exports	2021-10-27 12:48:08 -06:00
freddygv	05f91bd2b8	Update comments	2021-10-27 12:36:44 -06:00
Freddy	d8ae915160	Merge pull request #11431 from hashicorp/ap/exports-proxycfg [OSS] Update partitioned mesh gw handling for connect proxies	2021-10-27 11:27:43 -06:00
Freddy	8e23a6a0cc	Merge pull request #11416 from hashicorp/ap/exports-update Rename service-exports to partition-exports	2021-10-27 11:27:31 -06:00
freddygv	40271beb38	Fixup partitions assertion	2021-10-27 11:15:25 -06:00
freddygv	67412ac5e7	Fixup imports	2021-10-27 11:15:25 -06:00
freddygv	4de3537391	Split up locality check from hostname check	2021-10-27 11:15:25 -06:00
freddygv	9769b31641	Move the exportingpartitions constant to enterprise	2021-10-27 11:15:25 -06:00
freddygv	0391a65772	Replace default partition check	2021-10-27 11:15:25 -06:00
freddygv	ee45ac9dc5	PR comments	2021-10-27 11:15:25 -06:00
freddygv	f99946553a	Leave todo about default name	2021-10-27 11:15:25 -06:00
freddygv	9d375ad6d2	Add oss impl of registerEntCache	2021-10-27 11:15:25 -06:00
freddygv	183849416b	Register the ExportingPartitions cache type	2021-10-27 11:15:25 -06:00
freddygv	8b5a9369eb	Account for partitions in xds gen for mesh gw This commit avoids skipping gateways in remote partitions of the local DC when generating listeners/clusters/endpoints.	2021-10-27 11:15:25 -06:00
freddygv	d1d513b1b3	Account for partition in SNI for gateways	2021-10-27 11:15:25 -06:00
freddygv	4f0432be5e	Update xds pkg to account for GatewayKey	2021-10-27 09:03:56 -06:00
freddygv	f3f15640a9	Update mesh gateway proxy watches for partitions This commit updates mesh gateway watches for cross-partitions communication. * Mesh gateways are keyed by partition and datacenter. * Mesh gateways will now watch gateways in partitions that export services to their partition. * Mesh gateways in non-default partitions will not have cross-datacenter watches. They are not involved in traditional WAN federation.	2021-10-27 09:03:56 -06:00
freddygv	af662c8c1c	Avoid mixing named and unnamed params	2021-10-26 23:42:25 -06:00
freddygv	1de62bb0a2	Avoid passing nil config pointer	2021-10-26 23:42:25 -06:00
freddygv	4a2e40aa3c	Avoid panic on nil partitionAuthorizer config partitionAuthorizer.config can be nil if it wasn't provided on calls to newPartitionAuthorizer outside of the ACLResolver. This usage happens often in tests. This commit: adds a nil check when the config is going to be used, updates non-test usage of NewPolicyAuthorizerWithDefaults to pass a non-nil config, and dettaches setEnterpriseConf from the ACLResolver.	2021-10-26 23:42:25 -06:00
freddygv	015d85cd74	Update NodeRead for partition-exports When issuing cross-partition service discovery requests, ACL filtering often checks for NodeRead privileges. This is because the common return type is a CheckServiceNode, which contains node data.	2021-10-26 23:42:11 -06:00
Kyle Havlovitz	afb0976eac	acl: pass PartitionInfo through ent ACLConfig	2021-10-26 23:41:52 -06:00
Kyle Havlovitz	56d1858c4a	acl: Expand ServiceRead logic to look at service-exports for cross-partition	2021-10-26 23:41:32 -06:00
freddygv	4737ad118d	Swap in structs.EqualPartitions for cmp	2021-10-26 23:36:01 -06:00
freddygv	1bade08f91	Replace Split with SplitN	2021-10-26 23:36:01 -06:00
freddygv	3966677aaf	Finish removing useInDatacenter	2021-10-26 23:36:01 -06:00
freddygv	69476221c1	Update XDS for sidecars dialing through gateways	2021-10-26 23:35:48 -06:00
freddygv	ea311d2e47	Configure sidecars to watch gateways in partitions Previously the datacenter of the gateway was the key identifier, now it is the datacenter and partition. When dialing services in other partitions or datacenters we now watch the appropriate partition.	2021-10-26 23:35:37 -06:00
freddygv	feaebde1f1	Remove useInDatacenter from disco chain requests useInDatacenter was used to determine whether the mesh gateway mode of the upstream should be returned in the discovery chain target. This commit makes it so that the mesh gateway mode is returned every time, and it is up to the caller to decide whether mesh gateways should be watched or used.	2021-10-26 23:35:21 -06:00
R.B. Boyer	e27e58c6cc	agent: refactor the agent delegate interface to be partition friendly (#11429 )	2021-10-26 15:08:55 -05:00
Chris S. Kim	27f8a85664	agent: Ensure partition is considered in agent endpoints (#11427 )	2021-10-26 15:20:57 -04:00
Konstantine	2f9ee8e558	remove spaces	2021-10-26 12:38:13 -04:00
Konstantine	be14f6da90	fix altDomain responses for services where address is IP, added tests	2021-10-26 12:38:13 -04:00
Konstantine	eec9d66e22	fix encodeIPAsFqdn to return alt-domain when requested, added test case	2021-10-26 12:38:12 -04:00
Konstantine	9d6797a463	fixed altDomain response for NS type queries, and added test	2021-10-26 12:38:12 -04:00
Konstantine	0735e12412	edited TestDNS_AltDomains_Service to test responses for altDomains, and added TXT additional section check	2021-10-26 12:38:12 -04:00
Konstantine	8972e093d9	fixed alt-domain answer for SRV records, and TXT records in additional section	2021-10-26 12:38:12 -04:00
Chris S. Kim	3f736467e6	ui: Pass primary dc through to uiserver (#11317 ) Co-authored-by: John Cowen <johncowen@users.noreply.github.com>	2021-10-26 10:30:17 -04:00
freddygv	83d4d0e108	Remove outdated partition label from test	2021-10-25 18:47:02 -06:00
freddygv	c3e381b4c1	Rename service-exports to partition-exports Existing config entries prefixed by service- are specific to individual services. Since this config entry applies to partitions it is being renamed. Additionally, the Partition label was changed to Name because using Partition at the top-level and in the enterprise meta was leading to the enterprise meta partition being dropped by msgpack.	2021-10-25 17:58:48 -06:00
Daniel Nephin	f24bad2a52	Merge pull request #11232 from hashicorp/dnephin/acl-legacy-remove-docs acl: add docs and changelog for the removal of the legacy ACL system	2021-10-25 18:38:00 -04:00
Daniel Nephin	f7cdd210fe	Update agent/consul/acl_client.go Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2021-10-25 17:25:14 -04:00
Daniel Nephin	732b841dd7	state: remove support for updating legacy ACL tokens	2021-10-25 17:25:14 -04:00
Daniel Nephin	76b007dacd	acl: remove init check for legacy anon token This token should always already be migrated from a previous version.	2021-10-25 17:25:14 -04:00
Daniel Nephin	8ae6ee4e36	acl: remove legacy parameter to ACLDatacenter It is no longer used now that legacy ACLs have been removed.	2021-10-25 17:25:14 -04:00
Daniel Nephin	d778113773	acl: remove ACLTokenTypeManagement	2021-10-25 17:25:14 -04:00
Daniel Nephin	2f0eba1980	acl: remove ACLTokenTypeClient, along with the last test referencing it.	2021-10-25 17:25:14 -04:00
Daniel Nephin	88c6aeea34	acl: remove legacy arg to store.ACLTokenSet And remove the tests for legacy=true	2021-10-25 17:25:14 -04:00
Daniel Nephin	b31a7fc498	acl: remove EmbeddedPolicy This method is no longer. It only existed for legacy tokens, which are no longer supported.	2021-10-25 17:25:14 -04:00
Daniel Nephin	ceaa36f983	acl: remove tests for resolving legacy tokens The code for this was already removed, which suggests this is not actually testing what it claims. I'm guessing these are still resolving because the tokens are converted to non-legacy tokens?	2021-10-25 17:25:14 -04:00
Daniel Nephin	a46e3bd2fc	acl: stop replication on leadership lost It seems like this was missing. Previously this was only called by init of ACLs during an upgrade. Now that legacy ACLs are removed, nothing was calling stop. Also remove an unused method from client.	2021-10-25 17:24:12 -04:00
Daniel Nephin	15cd8c7ab8	Remove incorrect TODO	2021-10-25 17:20:06 -04:00
Daniel Nephin	589b238374	acl: move the legacy ACL struct to the one package where it is used It is now only used for restoring snapshots. We can remove it in phase 2.	2021-10-25 17:20:06 -04:00
Daniel Nephin	0ba5d0afcd	acl: remove most of the rest of structs/acl_legacy.go	2021-10-25 17:20:06 -04:00
Paul Banks	ab5cdce760	Merge pull request #11163 from hashicorp/feature/ingress-tls-mixed Add support for enabling connect-based ingress TLS per listener.	2021-10-25 21:36:01 +01:00
FFMMM	6433a57d3c	fix autopilot_failure_tolerance, add autopilot metrics test case (#11399 ) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-10-25 10:55:59 -07:00
FFMMM	67a624a49f	use *telemetry.MetricsPrefix as prometheus.PrometheusOpts.Name (#11290 ) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-10-21 13:33:01 -07:00
Dhia Ayachi	75f69a98a2	fix leadership transfer on leave suggestions (#11387 ) * add suggestions * set isLeader to false when leadership transfer succeed	2021-10-21 14:02:26 -04:00
Dhia Ayachi	2d1ac1f7d0	try to perform a leadership transfer when leaving (#11376 ) * try to perform a leadership transfer when leaving * add a changelog	2021-10-21 12:44:31 -04:00
Kyle Havlovitz	752a285552	Add new service-exports config entry	2021-10-20 12:24:18 -07:00
Jared Kirschner	716b05f934	Merge pull request #11293 from bisakhmondal/service_filter expression validation of service-resolver subset filter	2021-10-20 08:57:37 -04:00
Paul Banks	4808b97d9c	Rebase and rebuild golden files for Envoy version bump	2021-10-19 21:37:58 +01:00
Paul Banks	ff405d35c7	Refactor `resolveListenerSDSConfig` to pass in whole config	2021-10-19 20:58:29 +01:00
Paul Banks	5c8702b182	Add support for enabling connect-based ingress TLS per listener.	2021-10-19 20:58:28 +01:00
Giulio Micheloni	b549de831d	Restored comment.	2021-10-16 18:05:32 +01:00
Giulio Micheloni	a5a4eb9cae	Separete test file and no stack trace in ret error	2021-10-16 18:02:03 +01:00
Giulio Micheloni	10814d934e	Merge branch 'main' of https://github.com/hashicorp/consul into hashicorp-main	2021-10-16 16:59:32 +01:00
R.B. Boyer	55dd52cb17	acl: small OSS refactors to help ensure that auth methods with namespace rules work with partitions (#11323 )	2021-10-14 15:38:05 -05:00
freddygv	f76fddb28e	Use stored entmeta to fill authzContext	2021-10-14 08:57:40 -06:00
freddygv	bdf3e951f8	Ensure partition is handled by auto-encrypt	2021-10-14 08:32:45 -06:00
FFMMM	bb228ab165	fix: only add prom autopilot gauges to servers (#11241 ) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-10-13 09:25:30 -07:00
Chris S. Kim	0a6d683c84	Update Intentions.List with partitions (#11299 )	2021-10-13 10:47:12 -04:00
R.B. Boyer	3e8ece97a8	acl: fix bug in 'consul members' filtering with partitions (#11263 )	2021-10-13 09:18:16 -05:00
Bisakh Mondal	929ad1e80f	add service resolver subset filter validation	2021-10-13 02:56:04 +05:30
Connor	2cd80e5f66	Merge pull request #11222 from hashicorp/clly/service-mesh-metrics Start tracking connect service mesh usage metrics	2021-10-11 14:35:03 -05:00
Connor Kelly	2119351f77	Replace fmt.Sprintf with function	2021-10-11 12:43:38 -05:00
tarat44	baec141df3	preload json values in structs to determine defaults	2021-10-10 17:52:26 -04:00
Daniel Nephin	e37b5846fd	ca: split Primary/Secondary Provider To make it more clear which methods are necessary for each scenario. This can also prevent problems which force all DCs to use the same Vault instance, which is currently a problem.	2021-10-10 15:48:02 -04:00
Daniel Nephin	571acb872e	ca: extract primaryUpdateRootCA This function is only run when the CAManager is a primary. Extracting this function makes it clear which parts of UpdateConfiguration are run only in the primary and also makes the cleanup logic simpler. Instead of both a defer and a local var we can call the cleanup function in two places.	2021-10-10 15:26:55 -04:00
Daniel Nephin	a65594d8ec	ca: rename functions to use a primary or secondary prefix This commit renames functions to use a consistent pattern for identifying the functions that can only be called when the Manager is run as the primary or secondary. This is a step toward eventually creating separate types and moving these methods off of CAManager.	2021-10-10 15:26:55 -04:00
Daniel Nephin	20f0efd8c1	ca: make receiver variable name consistent Every other method uses c not ca	2021-10-10 15:26:55 -04:00
tarat44	e3a18e5203	add test cases for h2ping_use_tls default behavior	2021-10-09 17:12:52 -04:00
FFMMM	7f28301212	fix consul_autopilot_healthy metric emission (#11231 ) https://github.com/hashicorp/consul/issues/10730	2021-10-08 10:31:50 -07:00
Connor Kelly	38986d6371	Rename ConfigUsageEnterprise to EnterpriseConfigEntryUsage	2021-10-08 10:53:34 -05:00
Connor Kelly	76b3c4ed3c	Rename and prefix ConfigEntry in Usage table Rename ConfigUsage functions to ConfigEntry prefix ConfigEntry kinds with the ConfigEntry table name to prevent potential conflicts	2021-10-07 16:19:55 -05:00
Connor Kelly	0e39a7a333	Add connect specific prefix to Usage table Ensure that connect Kind's are separate from ConfigEntry Kind's to prevent miscounting	2021-10-07 16:16:23 -05:00
tarat44	bda1998175	only set default on H2PingUseTLS if H2PING is set	2021-10-06 22:13:01 -04:00
Daniel Nephin	51e498717f	docs: add notice that legacy ACLs have been removed. Add changelog Also remove a metric that is no longer emitted that was missed in a previous step.	2021-10-05 18:30:22 -04:00
Daniel Nephin	577f2649bf	acl: remove unused translate rules endpoint The CLI command does not use this endpoint, so we can remove it. It was missed in an earlier pass.	2021-10-05 18:26:05 -04:00
Connor Kelly	f9ba7c39b5	Add changelog, website and metric docs Add changelog to document what changed. Add entry to telemetry section of the website to document what changed Add docs to the usagemetric endpoint to help document the metrics in code	2021-10-05 13:34:24 -05:00
Joshua Montgomery	5446009299	Fixing SOA record to use alt domain when alt domain in use (#10431 )	2021-10-05 10:47:27 -04:00
tarat44	35faff55f8	fix test	2021-10-05 00:48:09 -04:00
tarat44	1c1405552a	fix formatting	2021-10-05 00:15:04 -04:00
tarat44	e46b41d04d	fix formatting	2021-10-05 00:12:23 -04:00
tarat44	f8b47cdfcd	change config option to H2PingUseTLS	2021-10-05 00:12:21 -04:00
tarat44	ed4ca3db49	add support for h2c in h2 ping health checks	2021-10-04 22:51:08 -04:00
Daniel Nephin	e03b7e4c68	Merge pull request #11182 from hashicorp/dnephin/acl-legacy-remove-upgrade acl: remove upgrade from legacy, start in non-legacy mode	2021-10-04 17:25:39 -04:00
Evan Culver	e47c5c5ceb	Merge pull request #11118 from hashicorp/eculver/remove-envoy-1.15 Remove support for Envoy 1.15	2021-10-04 23:14:24 +02:00
Evan Culver	d279c60010	Merge pull request #11115 from hashicorp/eculver/envoy-1.19.1 Add support for Envoy 1.19.1	2021-10-04 23:13:26 +02:00
Daniel Nephin	b9f0014d70	acl: remove updateEnterpriseSerfTags The only remaining caller is a test helper, and the tests don't use the enterprise gossip pools.	2021-10-04 17:01:51 -04:00
Daniel Nephin	5ac360b22d	Merge pull request #11126 from hashicorp/dnephin/acl-legacy-remove-resolve-and-get-policy acl: remove ACL.GetPolicy RPC endpoint and ACLResolver.resolveTokenLegacy	2021-10-04 16:29:51 -04:00
Connor Kelly	ed5693b537	Add metrics to count the number of service-mesh config entries	2021-10-04 14:50:17 -05:00
Connor Kelly	9c487389cf	Add metrics to count connect native service mesh instances This will add the counts of the service mesh instances tagged by whether or not it is connect native	2021-10-04 14:37:05 -05:00
Connor Kelly	8000ea45ca	Add metrics to count service mesh Kind instance counts This will add the counts of service mesh instances tagged by the different ServiceKind's.	2021-10-04 14:36:59 -05:00
Daniel Nephin	b6435259c3	acl: fix test failures caused by remocving legacy ACLs This commit two test failures: 1. Remove check for "in legacy ACL mode", the actual upgrade will be removed in a following commit. 2. Remove the early WaitForLeader in dc2, because with it the test was failing with ACL not found.	2021-10-01 18:03:10 -04:00
Evan Culver	e74ce0fb2e	Add 1.15 versions to too old list	2021-10-01 11:28:26 -07:00
Chris S. Kim	3c8ca0dbd2	agent: Reject partitions in legacy intention endpoints (#11181 )	2021-10-01 13:18:57 -04:00
Chris S. Kim	bf94949d48	Support partitions in parseIntentionStringComponent (#11202 )	2021-10-01 12:36:12 -04:00
Dhia Ayachi	8bd52995d1	fix token list by auth method (#11196 ) * add tests to OIDC authmethod and fix entMeta when retrieving auth-methods * fix oss compilation error	2021-10-01 12:00:43 -04:00
Evan Culver	4cdcaf3658	Merge branch 'eculver/envoy-1.19.1' into eculver/remove-envoy-1.15	2021-09-30 11:32:28 -07:00
Evan Culver	7b157bba4e	regenerate more envoy golden files	2021-09-30 10:57:47 -07:00
Daniel Nephin	ec935a2486	acl: call stop for the upgrade goroutine when done TestAgentLeaks_Server was reporting a goroutine leak without this. Not sure if it would actually be a leak in production or if this is due to the test setup, but seems easy enough to call it this way until we remove legacyACLTokenUpgrade.	2021-09-29 17:36:43 -04:00
Daniel Nephin	0c077d0527	acl: only run startACLUpgrade once Since legacy ACL tokens can no longer be created we only need to run this upgrade a single time when leadership is estalbished.	2021-09-29 16:22:01 -04:00
Daniel Nephin	f21097beda	acl: remove reading of serf acl tags We no long need to read the acl serf tag, because servers are always either ACL enabled or ACL disabled. We continue to write the tag so that during an upgarde older servers will see the tag.	2021-09-29 15:45:11 -04:00
Daniel Nephin	b866e3c4f4	acl: fix test failure For some reason removing legacy ACL upgrade requires using an ACL token now for this WaitForLeader.	2021-09-29 15:21:30 -04:00
Daniel Nephin	ebb2388605	acl: remove legacy ACL upgrades from Server As part of removing the legacy ACL system	2021-09-29 15:19:23 -04:00
Daniel Nephin	41a97360ca	acl: fix test failures caused by remocving legacy ACLs This commit two test failures: 1. Remove check for "in legacy ACL mode", the actual upgrade will be removed in a following commit. 2. Use the root token in WaitForLeader, because without it the test was failing with ACL not found.	2021-09-29 15:15:50 -04:00
Daniel Nephin	b73b68d696	acl: remove ACL.GetPolicy endpoint and resolve legacy acls And all code that was no longer used once those two were removed.	2021-09-29 14:33:19 -04:00
Daniel Nephin	b8da06a34d	acl: remove ACL upgrading from Clients As part of removing the legacy ACL system ACL upgrading and the flag for legacy ACLs is removed from Clients. This commit also removes the 'acls' serf tag from client nodes. The tag is only ever read from server nodes. This commit also introduces a constant for the acl serf tag, to make it easier to track where it is used.	2021-09-29 14:02:38 -04:00
Daniel Nephin	33a5448604	Merge pull request #11136 from hashicorp/dnephin/acl-resolver-fix-default-authz acl: fix default Authorizer for down_policy extend-cache/async-cache	2021-09-29 13:45:12 -04:00
Daniel Nephin	afb1dd5827	Merge pull request #11110 from hashicorp/dnephin/acl-legacy-remove-initialize acl: remove initializeLegacyACL and the rest of the legacy FSM commands	2021-09-29 13:44:30 -04:00
Daniel Nephin	a9ac148c92	Merge pull request #10999 from hashicorp/dnephin/revert-config-xds-port Revert config xds_port	2021-09-29 13:39:15 -04:00
Daniel Nephin	bd28d23b55	command/envoy: stop using the DebugConfig from Self endpoint The DebugConfig in the self endpoint can change at any time. It's not a stable API. This commit adds the XDSPort to a stable part of the XDS api, and changes the envoy command to read this new field. It includes support for the old API as well, in case a newer CLI is used with an older API, and adds a test for both cases.	2021-09-29 13:21:28 -04:00
Daniel Nephin	2995ac61f2	acl: remove the last of the legacy FSM Replace it with an implementation that returns an error, and rename some symbols to use a Deprecated suffix to make it clear. Also remove the ACLRequest struct, which is no longer referenced.	2021-09-29 12:42:23 -04:00
Daniel Nephin	a8358f7575	acl: remove bootstrap-init FSM operation	2021-09-29 12:42:23 -04:00
Daniel Nephin	ea2e0ad2ec	acl: remove initializeLegacyACL from leader init	2021-09-29 12:42:23 -04:00
Daniel Nephin	4e36442583	acl: remove ACLDelete FSM command, and state store function These are no longer used now that ACL.Apply has been removed.	2021-09-29 12:42:23 -04:00
Daniel Nephin	7e37c9a765	acl: remove legacy field to ACLBoostrap	2021-09-29 12:42:23 -04:00
Daniel Nephin	402d3792b6	Revert "Merge pull request #10588 from hashicorp/dnephin/config-fix-ports-grpc" This reverts commit 74fb650b6b966588f8faeec26935a858af2b8bb5, reversing changes made to 58bd8173364effb98b9fd9f9b98d31dd887a9bac.	2021-09-29 12:28:41 -04:00
Daniel Nephin	d4c48a3f23	Merge pull request #11101 from hashicorp/dnephin/acl-legacy-remove-rpc-2 acl: remove legacy ACL.Apply RPC	2021-09-29 12:23:55 -04:00
Daniel Nephin	69a83aefcf	Merge pull request #11177 from hashicorp/dnephin/remove-entmeta-methods structs: remove EnterpriseMeta helper methods	2021-09-29 12:08:07 -04:00
Daniel Nephin	acb62aa896	Merge pull request #10986 from hashicorp/dnephin/acl-legacy-remove-rpc acl: remove legacy ACL RPC - part 1	2021-09-29 12:04:09 -04:00
Daniel Nephin	1bc07c5166	structs: rename the last helper method. This one gets used a bunch, but we can rename it to make the behaviour more obvious.	2021-09-29 11:48:38 -04:00
Daniel Nephin	93b3e110b6	structs: remove another helper We already have a helper funtion.	2021-09-29 11:48:03 -04:00
Daniel Nephin	17652227f6	structs: remove two methods that were only used once each. These methods only called a single function. Wrappers like this end up making code harder to read because it adds extra ways of doing things. We already have many helper functions for constructing these types, we don't need additional methods.	2021-09-29 11:47:03 -04:00
Daniel Nephin	a0e08086f7	Merge pull request #10988 from hashicorp/dnephin/acl-legacy-remove-config acl: isolate deprecated config and warn when they are used	2021-09-29 11:40:14 -04:00
Daniel Nephin	3f4f7d2f3f	Merge pull request #9456 from hashicorp/dnephin/config-deprecation config: Use DeprecatedConfig struct for deprecated config fields	2021-09-29 11:37:40 -04:00
Evan Culver	cb5ef13fde	Merge remote-tracking branch 'origin/eculver/remove-envoy-1.15' into eculver/remove-envoy-1.15	2021-09-28 16:06:36 -07:00
Evan Culver	eaa9394cb2	Fix typo Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2021-09-29 01:05:45 +02:00
Evan Culver	64f94b10ce	Merge branch 'eculver/envoy-1.19.1' into eculver/remove-envoy-1.15	2021-09-28 15:59:43 -07:00
Evan Culver	807871224a	Merge branch 'main' into eculver/envoy-1.19.1	2021-09-28 15:58:20 -07:00
Chris S. Kim	3f79aaf509	Cleanup unnecessary normalizing method (#11169 )	2021-09-28 15:31:12 -04:00
Daniel Nephin	4ed9476a61	Merge pull request #11084 from krastin/krastin-autopilot-loggingtypo Fix a tiny typo in logging in autopilot.go	2021-09-28 15:11:11 -04:00
Evan Culver	e2363c13ff	Merge branch 'main' into eculver/envoy-1.19.1	2021-09-28 11:54:33 -07:00
Chris S. Kim	90fe20c3a2	agent: Clean up unused built-in proxy config (#11165 )	2021-09-28 11:29:10 -04:00
Daniel Nephin	30fe14eed3	acl: fix default authorizer for down_policy This was causing a nil panic because a nil authorizer is no longer valid after the cleanup done in https://github.com/hashicorp/consul/pull/10632.	2021-09-23 18:12:22 -04:00
Daniel Nephin	a6a7069ecf	Remove t.Parallel from TestACLResolver_DownPolicy These tests run in under 10ms, t.Parallel does nothing but slow them down and make failures harder to debug when one panics.	2021-09-23 18:12:22 -04:00
Dhia Ayachi	4505cb2920	Refactor table index acl phase 2 (#11133 ) * extract common methods from oss and ent * remove unreachable code * add missing normalize for binding rules * fix oss to use Query	2021-09-23 15:26:09 -04:00
Daniel Nephin	cc46fcc53e	config: Move ACLEnableKeyListPolicy to DeprecatedConfig	2021-09-23 15:15:00 -04:00
Daniel Nephin	107c24a68a	config: move acl_ttl to DeprecatedConfig	2021-09-23 15:14:59 -04:00
Daniel Nephin	5eb2bebdf8	config: move acl_{default,down}_policy to DeprecatedConfig	2021-09-23 15:14:59 -04:00
Daniel Nephin	408eb0e08e	config: Deprecate EnableACLReplication replaced by ACL.TokenReplication	2021-09-23 15:14:59 -04:00
Daniel Nephin	d54db5917f	config: move ACL master token and replication to DeprecatedConfig	2021-09-23 15:14:59 -04:00
Paul Banks	f8412cf5fa	Merge pull request #10903 from hashicorp/feature/ingress-sds Add Support to for providing TLS certificates for Ingress listeners from an SDS source	2021-09-23 16:19:05 +01:00
Dhia Ayachi	ebe333b947	Refactor table index (#11131 ) * convert tableIndex to use the new pattern * make `indexFromString` available for oss as well * refactor `indexUpdateMaxTxn`	2021-09-23 11:06:23 -04:00
Paul Banks	d57931124f	Final readability tweaks from review	2021-09-23 10:17:12 +01:00
Paul Banks	66c625a64d	Fix subtle loop bug and add test	2021-09-23 10:13:41 +01:00
Paul Banks	7198d0bd80	Refactor SDS validation to make it more contained and readable	2021-09-23 10:13:19 +01:00
Paul Banks	fe4f69613c	Refactor Ingress-specific lister code to separate file	2021-09-23 10:13:19 +01:00
Paul Banks	f4f0793a10	Minor PR typo and cleanup fixes	2021-09-23 10:13:19 +01:00
Paul Banks	4cc1ccf892	Revert abandonned changes to proxycfg for Ent test consistency	2021-09-23 10:13:19 +01:00
Paul Banks	d812a0edc7	Fix merge conflict in xds tests	2021-09-23 10:12:37 +01:00
Paul Banks	a24efd20fc	Fix some more Enterprise Normalization issues affecting tests	2021-09-23 10:12:37 +01:00
Paul Banks	15969327c0	Remove unused argument to fix lint error	2021-09-23 10:09:11 +01:00
Paul Banks	9422e4ebc7	Handle namespaces in route names correctly; add tests for enterprise	2021-09-23 10:09:11 +01:00
Paul Banks	9d576a08dc	Update xDS routes to support ingress services with different TLS config	2021-09-23 10:08:02 +01:00
Paul Banks	8a4254a894	Update xDS Listeners with SDS support	2021-09-23 10:08:02 +01:00
Paul Banks	8548e15f1b	Update proxycfg to hold more ingress config state	2021-09-23 10:08:02 +01:00
Paul Banks	0e410a1b1f	Add ingress-gateway config for SDS	2021-09-23 10:08:02 +01:00
Daniel Nephin	3e6dc2a843	acl: remove ACL.Apply As part of removing the legacy ACL system.	2021-09-22 18:28:08 -04:00
Daniel Nephin	2ce64e2837	acl: made acl rules in tests slightly more specific When converting these tests from the legacy ACL system to the new RPC endpoints I initially changed most things to use _prefix rules, because that was equivalent to the old legacy rules. This commit modifies a few of those rules to be a bit more specific by replacing the _prefix rule with a non-prefix one where possible.	2021-09-22 18:24:56 -04:00
Mark Anderson	c87d57bfeb	partitions/authmethod-index work from enterprise (#11056 ) * partitions/authmethod-index work from enterprise Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-09-22 13:19:20 -07:00
Chris S. Kim	d222f170a7	connect: Allow upstream listener escape hatch for prepared queries (#11109 )	2021-09-22 15:27:10 -04:00
Evan Culver	88a899d06a	connect: remove support for Envoy 1.15	2021-09-22 11:48:50 -07:00
R.B. Boyer	ba13416b57	grpc: strip local ACL tokens from RPCs during forwarding if crossing datacenters (#11099 ) Fixes #11086	2021-09-22 13:14:26 -05:00
Daniel Nephin	66453d2de9	config: Move two more fields to DeprecatedConfig And add a test for deprecated config fields.	2021-09-22 13:23:03 -04:00
Daniel Nephin	23f070e0a1	config: Introduce DeprecatedConfig This struct allows us to move all the deprecated config options off of the main config struct, and keeps all the deprecation logic in a single place, instead of spread across 3+ places.	2021-09-22 13:22:16 -04:00
Evan Culver	4d222cfcd0	add 1.19.x versions to test config	2021-09-22 09:30:45 -07:00
Connor	bc04a155fb	Merge pull request #11090 from hashicorp/clly/kv-usage-metrics Add KVUsage to consul state usage metrics	2021-09-22 11:26:56 -05:00
Connor Kelly	bfe6b64ca7	Strip out go 1.17 bits	2021-09-22 11:04:48 -05:00
Matt Keeler	7c1ef8f515	Add a mock Agent delegate to ease/improve some types of testing	2021-09-22 10:23:01 -04:00
hc-github-team-consul-core	320b20c708	auto-updated agent/uiserver/bindata_assetfs.go from commit 9c0233cf5	2021-09-22 13:05:38 +00:00
hc-github-team-consul-core	949416c071	auto-updated agent/uiserver/bindata_assetfs.go from commit cfbd1bb84	2021-09-22 09:26:14 +00:00
Daniel Nephin	b40bdc9e98	acl: remove remaining tests that use ACL.Apply In preparation for removing ACL.Apply. Tests for ACL.Apply, ACL.GetPolicy, and ACL upgrades were removed because all 3 of those will be removed shortly. The forth test appears to be for the ACLResolver cache, so the test was moved to the correct test file, and the name was updated to make it obvious what is being tested.	2021-09-21 19:35:26 -04:00
Evan Culver	69f4cc7532	regenerate envoy golden files	2021-09-21 16:21:00 -07:00
Evan Culver	b104b7719c	add envoy 1.19.1	2021-09-21 15:39:36 -07:00
Daniel Nephin	ab91d254a3	fsm: restore the legacy commands and emit a helpful error message.	2021-09-21 18:35:12 -04:00
Daniel Nephin	0180dd67ff	Convert tests to the new ACL system In preparation for removing ACL.Apply	2021-09-21 18:35:12 -04:00
Daniel Nephin	b639f47e3c	config: use the new ACL system in tests In preparation for removing ACL.Apply	2021-09-21 17:57:29 -04:00
Daniel Nephin	2702aecc27	catalog: use the new ACL system in tests In preparation for removing ACL.Apply	2021-09-21 17:57:29 -04:00
Daniel Nephin	b6218b75d9	Update 4 non-acl tests that used the legacy ACL.Apply These tests don't really care about the endpoint, they just need some way to create an ACL token.	2021-09-21 17:57:29 -04:00
Daniel Nephin	ad9748adc3	acl: remove two commented out tests for legacy ACL replication They were commented out in 2018.	2021-09-21 17:57:29 -04:00
Daniel Nephin	5a31a2e167	acl: replace legacy Get and List RPCs with an error impl These endpoints are being removed as part of the legacy ACL system.	2021-09-21 17:57:29 -04:00
Daniel Nephin	26f3380688	acl: remove a couple legacy ACL operation constants structs.ACLForceSet was deprecated 4 years ago, it should be safe to remove now. ACLBootstrapNow was removed in a recent commit. While it is technically possible that a cluster with mixed version could still attempt a legacy boostrap, we documented that the legacy system was deprecated in 1.4, so no clusters that are being upgraded should be attempting a legacy boostrap.	2021-09-21 17:57:29 -04:00
Daniel Nephin	af8c10afc4	acl: Remove unused ACLPolicyIDType	2021-09-21 17:57:29 -04:00
Daniel Nephin	5493ff06cc	Merge pull request #10985 from hashicorp/dnephin/acl-legacy-remove-replication acl: remove legacy ACL replication	2021-09-21 17:56:54 -04:00
Connor	64852cd3e5	Apply suggestions from code review Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com>	2021-09-21 10:52:46 -05:00
R.B. Boyer	2773bd94d7	xds: fix representation of incremental xDS subscriptions (#10987 ) Fixes #10563 The `resourceVersion` map was doing two jobs prior to this PR. The first job was to track what version of every resource we know envoy currently has. The second was to track subscriptions to those resources (by way of the empty string for a version). This mostly works out fine, but occasionally leads to consul removing a resource and accidentally (effectively) unsubscribing at the same time. The fix separates these two jobs. When all of the resources for a subscription are removed we continue to track the subscription until envoy explicitly unsubscribes	2021-09-21 09:58:56 -05:00
Connor Kelly	973b7b5c78	Fix test	2021-09-20 13:44:43 -05:00
Connor Kelly	698fc291a9	Add KVUsage to consul state usage metrics This change will add the number of entries in the consul KV store to the already existing usage metrics.	2021-09-20 12:41:54 -05:00
R.B. Boyer	55b36dd056	xds: ensure the active streams counters are 64 bit aligned on 32 bit systems (#11085 )	2021-09-20 11:07:11 -05:00
Krastin Krastev	ba13dbf24c	Update autopilot.go Fixing a minuscule typo in logging	2021-09-20 14:40:58 +02:00
Freddy	f1b2ef30d1	Merge pull request #11071 from hashicorp/partitions/ixn-decisions	2021-09-16 15:18:23 -06:00
freddygv	661f520841	Fixup proxycfg tproxy case	2021-09-16 15:05:28 -06:00
freddygv	12eec88dff	Remove ent checks from oss test	2021-09-16 14:53:28 -06:00
R.B. Boyer	7fa8f19077	acl: ensure the global management policy grants all necessary partition privileges (#11072 )	2021-09-16 15:53:10 -05:00
freddygv	cf56be7d8d	Ensure partition is defaulted in authz	2021-09-16 14:39:01 -06:00
freddygv	b5a8935bb8	Default the partition in ixn check	2021-09-16 14:39:01 -06:00
freddygv	caafc1905e	Fixup test	2021-09-16 14:39:01 -06:00
freddygv	8a9bf3748c	Account for partitions in ixn match/decision	2021-09-16 14:39:01 -06:00
Jeff Widman	a8f396c55f	Bump `go-discover` to fix broken dep tree (#10898 )	2021-09-16 15:31:22 -04:00
hc-github-team-consul-core	5a6f9e38b1	auto-updated agent/uiserver/bindata_assetfs.go from commit 1d9d3349c	2021-09-16 17:31:08 +00:00
R.B. Boyer	4e7b6888e3	acl: fix intention:*:write checks (#11061 ) This is a partial revert of #10793	2021-09-16 11:08:45 -05:00
Freddy	88627700d0	Merge pull request #11051 from hashicorp/partitions/fixes	2021-09-16 09:29:00 -06:00
Freddy	494764ee2d	acl: small resolver changes to account for partitions (#11052 ) Also refactoring the enterprise side of a test to make it easier to reason about.	2021-09-16 09:17:02 -05:00
freddygv	7927a97c2f	Fixup manager tests	2021-09-15 17:24:05 -06:00
freddygv	dc549eca30	Default partition in match endpoint	2021-09-15 17:23:52 -06:00
freddygv	0cdcbbb4c9	Pass partition to intention match query	2021-09-15 17:23:52 -06:00
freddygv	a57c52ca32	Ensure partition is used for SAN validation	2021-09-15 17:23:48 -06:00
Mark Anderson	08b222cfc3	ACL Binding Rules table partitioning (#11044 ) * ACL Binding Rules table partitioning Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-09-15 13:26:08 -07:00
hc-github-team-consul-core	23e3f865b0	auto-updated agent/uiserver/bindata_assetfs.go from commit fc14a412f	2021-09-15 18:55:29 +00:00
hc-github-team-consul-core	abe0195257	auto-updated agent/uiserver/bindata_assetfs.go from commit b16a6fa03	2021-09-15 17:14:42 +00:00
Dhia Ayachi	25ea1a9276	use const instead of literals for `tableIndex` (#11039 )	2021-09-15 10:24:04 -04:00
Mark Anderson	ffe3806aaf	Refactor `indexAuthMethod` in `tableACLBindingRules` (#11029 ) * Port consul-enterprise #1123 to OSS Signed-off-by: Mark Anderson <manderson@hashicorp.com> * Fixup missing query field Signed-off-by: Mark Anderson <manderson@hashicorp.com> * change to re-trigger ci system Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-09-15 09:34:19 -04:00
Freddy	8804577de1	Merge pull request #11024 from hashicorp/partitions/rbac	2021-09-14 11:18:19 -06:00
Freddy	27f40ccf51	Update error texts (#11022 ) Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-09-14 11:08:06 -06:00
freddygv	f209408918	Update spiffe ID patterns used for RBAC	2021-09-14 11:00:03 -06:00
freddygv	0e30151eaa	Expand testing of simplifyNotSourceSlice for partitions	2021-09-14 10:55:15 -06:00
freddygv	a65da57a3d	Expand testing of removeSameSourceIntentions for partitions	2021-09-14 10:55:09 -06:00
freddygv	e9d78a20c7	Account for partition when matching src intentions	2021-09-14 10:55:02 -06:00
Daniel Nephin	44d91ea56f	Add failures_before_warning to checks (#10969 ) Signed-off-by: Jakub Sokołowski <jakub@status.im> * agent: add failures_before_warning setting The new setting allows users to specify the number of check failures that have to happen before a service status us updated to be `warning`. This allows for more visibility for detected issues without creating alerts and pinging administrators. Unlike the previous behavior, which caused the service status to not update until it reached the configured `failures_before_critical` setting, now Consul updates the Web UI view with the `warning` state and the output of the service check when `failures_before_warning` is breached. The default value of `FailuresBeforeWarning` is the same as the value of `FailuresBeforeCritical`, which allows for retaining the previous default behavior of not triggering a warning. When `FailuresBeforeWarning` is set to a value higher than that of `FailuresBeforeCritical it has no effect as `FailuresBeforeCritical` takes precedence. Resolves: https://github.com/hashicorp/consul/issues/10680 Signed-off-by: Jakub Sokołowski <jakub@status.im> Co-authored-by: Jakub Sokołowski <jakub@status.im>	2021-09-14 12:47:52 -04:00
Dhia Ayachi	4992218676	convert expiration indexed in ACLToken table to use `indexerSingle` (#11018 ) * move intFromBool to be available for oss * add expiry indexes * remove dead code: `TokenExpirationIndex` * fix remove indexer `TokenExpirationIndex` * fix rebase issue	2021-09-13 14:37:16 -04:00
Dhia Ayachi	1f23bdf388	add locality indexer partitioning (#11016 ) * convert `Roles` index to use `indexerSingle` * split authmethod write indexer to oss and ent * add index locality * add locality unit tests * move intFromBool to be available for oss * use Bool func * refactor `aclTokenList` to merge func	2021-09-13 11:53:00 -04:00
Dhia Ayachi	3638825db8	convert `indexAuthMethod` index to use `indexerSingle` (#11014 ) * convert `Roles` index to use `indexerSingle` * fix oss build * split authmethod write indexer to oss and ent * add auth method unit tests	2021-09-10 16:56:56 -04:00
Paul Banks	ecbe8f0656	Include namespace and partition in error messages when validating ingress header manip	2021-09-10 21:11:00 +01:00
Paul Banks	e6642c6dae	Refactor HTTPHeaderModifiers.MergeDefaults based on feedback	2021-09-10 21:11:00 +01:00
Paul Banks	a1acb7ec3b	Fix enterprise test failures caused by differences in normalizing EnterpriseMeta	2021-09-10 21:11:00 +01:00
Paul Banks	3484d77b18	Fix enterprise discovery chain tests; Fix multi-level split merging	2021-09-10 21:11:00 +01:00
Paul Banks	e0ad412f1d	Remove unnecessary check	2021-09-10 21:09:24 +01:00
Paul Banks	5c6d27555b	Fix discovery chain test fixtures	2021-09-10 21:09:24 +01:00
Paul Banks	bc1c86df96	Integration tests for all new header manip features	2021-09-10 21:09:24 +01:00
Paul Banks	1dd1683ed9	Header manip for split legs plumbing	2021-09-10 21:09:24 +01:00
Paul Banks	f70f7b2389	Header manip for service-router plumbed through	2021-09-10 21:09:24 +01:00
Paul Banks	fc2ed4cdf4	Ingress gateway header manip plumbing	2021-09-10 21:09:24 +01:00
Paul Banks	2db02cdba2	Add HTTP header manip for router and splitter entries	2021-09-10 21:09:24 +01:00
Paul Banks	7ac9b46f08	Header manip and validation added for ingress-gateway entries	2021-09-10 21:09:24 +01:00
Dhia Ayachi	82b30f8020	convert `Roles` index to use `indexerMulti` (#11013 ) * convert `Roles` index to use `indexerMulti` * add role test in oss * fix oss to use the right index func * preallocate slice	2021-09-10 16:04:33 -04:00
Dhia Ayachi	569e18d002	convert indexPolicies in ACLTokens table to the new index (#11011 )	2021-09-10 14:57:37 -04:00
Dhia Ayachi	0d0edeec27	convert indexSecret to the new index (#11007 )	2021-09-10 09:10:11 -04:00
Dhia Ayachi	f0cbe25ca6	convert indexAccessor to the new index (#11002 )	2021-09-09 16:28:04 -04:00
Hans Hasselberg	24c6ce0be0	tls: consider presented intermediates during server connection tls handshake. (#10964 ) * use intermediates when verifying * extract connection state * remove useless import * add changelog entry * golint * better error * wording * collect errors * use SAN.DNSName instead of CommonName * Add test for unknown intermediate * improve changelog entry	2021-09-09 21:48:54 +02:00
Chris S. Kim	3fb797382b	Sync enterprise changes to oss (#10994 ) This commit updates OSS with files for enterprise-specific admin partitions feature work	2021-09-08 11:59:30 -04:00
Kyle Havlovitz	a7b5a5d1b4	Merge pull request #10984 from hashicorp/mesh-resource acl: adding a new mesh resource	2021-09-07 15:06:20 -07:00
Dhia Ayachi	96d7842118	partition dicovery chains (#10983 ) * partition dicovery chains * fix default partition for OSS	2021-09-07 16:29:32 -04:00
Daniel Nephin	4d5a39e622	acl: remove ACL.IsSame The only caller of this method was removed in a recent commit along with replication.	2021-09-03 12:59:12 -04:00
Daniel Nephin	4dd5bb8e3b	acl: remove legacy ACL replication	2021-09-03 12:42:06 -04:00
R.B. Boyer	4206f585f0	acl: adding a new mesh resource	2021-09-03 09:12:03 -04:00
Dhia Ayachi	72391dc99c	try to infer command partition from node partition (#10981 )	2021-09-03 08:37:23 -04:00
Dhia Ayachi	eb19271fd7	add partition to SNI when partition is non default (#10917 )	2021-09-01 10:35:39 -04:00
Freddy	11672defaf	connect: update envoy supported versions to latest patch release (#10961) Relevant advisory: https://github.com/envoyproxy/envoy/security/advisories/GHSA-6g4j-5vrw-2m8h	2021-08-31 10:39:18 -06:00
Evan Culver	93f94ac24f	rpc: authorize raft requests (#10925 )	2021-08-26 15:04:32 -07:00
hc-github-team-consul-core	a758581ab6	auto-updated agent/uiserver/bindata_assetfs.go from commit eeeb91bea	2021-08-26 18:13:08 +00:00
Chris S. Kim	86de20c975	ent->oss test fix (#10926 )	2021-08-26 14:06:49 -04:00
hc-github-team-consul-core	5c67517647	auto-updated agent/uiserver/bindata_assetfs.go from commit a907e1d87	2021-08-26 18:02:18 +00:00
hc-github-team-consul-core	d9022ce788	auto-updated agent/uiserver/bindata_assetfs.go from commit a0b0ed2bc	2021-08-26 16:06:09 +00:00
Chris S. Kim	efbdf7e117	api: expose upstream routing configurations in topology view (#10811 ) Some users are defining routing configurations that do not have associated services. This commit surfaces these configs in the topology visualization. Also fixes a minor internal bug with non-transparent proxy upstream/downstream references.	2021-08-25 15:20:32 -04:00
R.B. Boyer	6b5a58de50	acl: some acl authz refactors for nodes (#10909 )	2021-08-25 13:43:11 -05:00
hc-github-team-consul-core	c95ec5007d	auto-updated agent/uiserver/bindata_assetfs.go from commit a777b0a9b	2021-08-25 13:46:51 +00:00
hc-github-team-consul-core	9b2dd8b155	auto-updated agent/uiserver/bindata_assetfs.go from commit 8192dde48	2021-08-25 11:39:14 +00:00
R.B. Boyer	a84f5fa25d	grpc: ensure that streaming gRPC requests work over mesh gateway based wan federation (#10838 ) Fixes #10796	2021-08-24 16:28:44 -05:00
hc-github-team-consul-core	6b574abc89	auto-updated agent/uiserver/bindata_assetfs.go from commit 05a28c311	2021-08-24 16:04:24 +00:00
Giulio Micheloni	387f6f717b	Fix merge conflicts	2021-08-22 19:35:08 +01:00
Giulio Micheloni	10b03c3f4e	Merge branch 'main' into serve-panic-recovery	2021-08-22 20:31:11 +02:00
Giulio Micheloni	465e9fecda	grpc, xds: recovery middleware to return and log error in case of panic 1) xds and grpc servers: 1.1) to use recovery middleware with callback that prints stack trace to log 1.2) callback turn the panic into a core.Internal error 2) added unit test for grpc server	2021-08-22 19:06:26 +01:00
freddygv	79e181be73	Avoid passing zero value into variadic	2021-08-20 17:40:33 -06:00
freddygv	ed79e38a36	Update comment for test function	2021-08-20 17:40:33 -06:00
freddygv	b1050e4229	Update prepared query cluster SAN validation Previously SAN validation for prepared queries was broken because we validated against the name, namespace, and datacenter for prepared queries. However, prepared queries can target: - Services with a name that isn't their own - Services in multiple datacenters This means that the SpiffeID to validate needs to be based on the prepared query endpoints, and not the prepared query's upstream definition. This commit updates prepared query clusters to account for that.	2021-08-20 17:40:33 -06:00
freddygv	1f192eb7d9	Fixup proxy config test fixtures - The TestNodeService helper created services with the fixed name "web", and now that name is overridable. - The discovery chain snapshot didn't have prepared query endpoints so the endpoints tests were missing data for prepared queries	2021-08-20 17:38:57 -06:00
R.B. Boyer	60591d55f7	agent: add partition labels to catalog API metrics where appropriate (#10890 )	2021-08-20 15:09:39 -05:00
R.B. Boyer	b6be94e7fa	fixing various bits of enterprise meta plumbing to be more correct (#10889 )	2021-08-20 14:34:23 -05:00
Dhia Ayachi	f766b6dff7	oss portion of ent #1069 (#10883 )	2021-08-20 12:57:45 -04:00
R.B. Boyer	d730298f59	state: partition the nodes.uuid and nodes.meta indexes as well (#10882 )	2021-08-19 16:17:59 -05:00
R.B. Boyer	61f1c01b83	agent: ensure that most agent behavior correctly respects partition configuration (#10880 )	2021-08-19 15:09:42 -05:00
Daniel Nephin	4a0ae4048d	Merge pull request #10849 from hashicorp/dnephin/contrib-doc-xds-auth xds: document how authorization works	2021-08-18 13:25:16 -04:00
R.B. Boyer	e565409c6a	state: partition the usage metrics subsystem (#10867 )	2021-08-18 09:27:15 -05:00
Daniel Nephin	9df2464c7c	xds: document how authorization works	2021-08-17 19:26:34 -04:00
R.B. Boyer	1cef3c99c2	state: adjust streaming event generation to account for partitioned nodes (#10860 ) Also re-enabled some tests that had to be disabled in the prior PR.	2021-08-17 16:49:26 -05:00
R.B. Boyer	e50e13d2ab	state: partition nodes and coordinates in the state store (#10859 ) Additionally: - partitioned the catalog indexes appropriately for partitioning - removed a stray reference to a non-existent index named "node.checks"	2021-08-17 13:29:39 -05:00
Daniel Nephin	5a82859ee1	acl: small improvements to ACLResolver disable due to RPC error Remove the error return, so that not handling is not reported as an error by errcheck. It was returning the error passed as an arg unmodified so there is no reason to return the same value that was passed in. Remove the term upstreams to remove any confusion with the term used in service mesh. Remove the AutoDisable field, and replace it with the TTL value, using 0 to indicate the setting is turned off. Replace "not Before" with "After". Add some test coverage to show the behaviour is still correct.	2021-08-17 13:34:18 -04:00
Daniel Nephin	09ae0ab94a	acl: make ACLDisabledTTL a constant This field was never user-configurable. We always overwrote the value with 120s from NonUserSource. However, we also never copied the value from RuntimeConfig to consul.Config, So the value in NonUserSource was always ignored, and we used the default value of 30s set by consul.DefaultConfig. All of this code is an unnecessary distraction because a user can not actually configure this value. This commit removes the fields and uses a constant value instad. Someone attempting to set acl.disabled_ttl in their config will now get an error about an unknown field, but previously the value was completely ignored, so the new behaviour seems more correct. We have to keep this field in the AutoConfig response for backwards compatibility, but the value will be ignored by the client, so it doesn't really matter what value we set.	2021-08-17 13:34:18 -04:00
Daniel Nephin	a8bc964241	Fix test failures Tests only specified one of the fields, but in production we copy the value from a single place, so we can do the same in tests. The AutoConfig test broke because of the problem noticed in a previous commit. The DisabledTTL is not wired up properly so it reports 0s here. Changed the test to use an explicit value.	2021-08-17 13:32:52 -04:00
Daniel Nephin	0d69b49f41	config: remove ACLResolver settings from RuntimeConfig	2021-08-17 13:32:52 -04:00
Daniel Nephin	75baa22e64	acl: remove ACLResolver config fields from consul.Config	2021-08-17 13:32:52 -04:00
Daniel Nephin	454f62eacc	acl: replace ACLResolver.Config with its own struct This is step toward decoupling ACLResolver from the agent/consul package.	2021-08-17 13:32:52 -04:00
Daniel Nephin	5e5ad62679	acl: remove ACLRulesTranslateLegacyToken API endpoint	2021-08-17 13:10:02 -04:00
Daniel Nephin	be0358df02	acl: remove legacy bootstrap Return an explicit error from the RPC, and remove the flag from the HTTP API.	2021-08-17 13:10:00 -04:00
Daniel Nephin	d877673268	agent: update some tests that were using legacy ACL endpoints The tests were updated to use the new ACL endpoints now that the legacy ones have been removed.	2021-08-17 13:09:30 -04:00
Daniel Nephin	10791b007d	http: update legacy ACL endpoints to return an error Also move a test for the ACLReplicationStatus endpoint into the correct file.	2021-08-17 13:09:29 -04:00
Daniel Nephin	4f54d9708c	acl: add some notes about removing legacy ACL system	2021-08-17 13:08:29 -04:00
Daniel Nephin	e4c6bee7e6	Merge pull request #10792 from hashicorp/dnephin/rename-authz-vars acl: use authz consistently as the variable name for an acl.Authorizer	2021-08-17 13:07:17 -04:00
Daniel Nephin	7f71a672f3	Merge pull request #10807 from hashicorp/dnephin/remove-acl-datacenter config: remove ACLDatacenter	2021-08-17 13:07:09 -04:00
Daniel Nephin	608b291565	acl: use authz consistently as the variable name for an acl.Authorizer Follow up to https://github.com/hashicorp/consul/pull/10737#discussion_r682147950 Renames all variables for acl.Authorizer to use `authz`. Previously some places used `rule` which I believe was an old name carried over from the legacy ACL system. A couple places also used authorizer. This commit also removes another couple of authorizer nil checks that are no longer necessary.	2021-08-17 12:14:10 -04:00
hc-github-team-consul-core	e1da3da0e2	auto-updated agent/uiserver/bindata_assetfs.go from commit ae9c31338	2021-08-16 16:10:17 +00:00
Kyle Havlovitz	470558708e	Merge pull request #10843 from hashicorp/partitions/rename-default oss: Rename default partition	2021-08-12 14:45:53 -07:00
Kyle Havlovitz	98969c018a	oss: Rename default partition	2021-08-12 14:31:37 -07:00
Daniel Nephin	7c865d03ac	proxycfg: Lookup the agent token as a default When no ACL token is provided with the service registration.	2021-08-12 15:51:34 -04:00
Daniel Nephin	d189524e71	proxycfg: Add a test to show the bug When a token is not provided at registration, the agent token is not being used.	2021-08-12 15:47:59 -04:00
Mike Morris	86d76cb099	deps: upgrade gogo-protobuf to v1.3.2 (#10813 ) * deps: upgrade gogo-protobuf to v1.3.2 * go mod tidy using go 1.16 * proto: regen protobufs after upgrading gogo/protobuf Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-08-12 14:05:46 -04:00
Mark Anderson	03a3ec2b55	Fixup to support unix domain socket via command line (#10758 ) Missed the need to add support for unix domain socket config via api/command line. This is a variant of the problems described in it is easy to drop one. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-08-12 10:05:22 -07:00
hc-github-team-consul-core	f02ea91a8b	auto-updated agent/uiserver/bindata_assetfs.go from commit ab6a67520	2021-08-11 17:05:51 +00:00
Giulio Micheloni	0bf124502e	grpc Server: turn panic into error through middleware	2021-08-07 13:21:12 +01:00
Daniel Nephin	364ef3d052	server: remove defaulting of PrimaryDatacenter The constructor for Server is not at all the appropriate place to be setting default values for a config struct that was passed in. In production this value is always set from agent/config. In tests we should set the default in a test helper.	2021-08-06 18:45:24 -04:00
Daniel Nephin	87fb26fd65	Merge pull request #10612 from bigmikes/acl-replication-fix acl: acl replication routine to report the last error message	2021-08-06 18:29:51 -04:00
Daniel Nephin	047abdd73c	acl: remove ACLDatacenter This field has been unnecessary for a while now. It was always set to the same value as PrimaryDatacenter. So we can remove the duplicate field and use PrimaryDatacenter directly. This change was made by GoLand refactor, which did most of the work for me.	2021-08-06 18:27:00 -04:00
Giulio Micheloni	5c34a48d45	String type instead of error type and changelog.	2021-08-06 22:35:27 +01:00
Daniel Nephin	9435118179	acl: remove Server.ResolveTokenIdentityAndDefaultMeta This method suffered from similar naming to a couple other methods on Server, and had not great re-use (2 callers). By copying a few of the lines into one of the callers we can move the implementation into the second caller. Once moved, we can see that ResolveTokenAndDefaultMeta is identical in both Client and Server, and likely should be further refactored, possibly into ACLResolver. This change is being made to make ACL resolution easier to trace.	2021-08-05 15:20:13 -04:00
Daniel Nephin	25f40de163	acl: remove Server.ResolveTokenToIdentityAndAuthorizer This method was an alias for ACLResolver.ResolveTokenToIdentityAndAuthorizer. By removing the method that does nothing the code becomes easier to trace.	2021-08-05 15:20:13 -04:00
Daniel Nephin	695963acb7	acl: recouple acl filtering from ACLResolver ACL filtering only needs an authorizer and a logger. We can decouple filtering from the ACLResolver by passing in the necessary logger. This change is being made in preparation for moving the ACLResolver into an acl package	2021-08-05 15:20:13 -04:00
Daniel Nephin	ba2f9a65d1	acl: remove unused error return filterACLWithAuthorizer could never return an error. This change moves us a little bit closer to being able to enable errcheck and catch problems caused by unhandled error return values.	2021-08-05 15:20:13 -04:00
Daniel Nephin	c80b9565e2	acl: rename acl.Authorizer vars to authz For consistency	2021-08-05 15:19:47 -04:00
Daniel Nephin	37c67cb280	acl: move vet functions These functions are moved to the one place they are called to improve code locality. They are being moved out of agent/consul/acl.go in preparation for moving ACLResolver to an acl package.	2021-08-05 15:19:24 -04:00
Daniel Nephin	c8eedabc7c	acl: move vetRegisterWithACL and vetDeregisterWithACL These functions are used in only one place. Move the functions next to their one caller to improve code locality. This change is being made in preparation for moving the ACLResolver into an acl package. The moved functions were previously in the same file as the ACLResolver. By moving them out of that file we may be able to move the entire file with fewer modifications.	2021-08-05 15:17:54 -04:00
Daniel Nephin	b223c2bc25	Merge pull request #10770 from hashicorp/dnephin/log-cert-expiration telemetry: add log message when certs are about to expire	2021-08-05 15:17:20 -04:00
Daniel Nephin	c866f1041a	Merge pull request #10793 from hashicorp/dnephin/acl-intentions acl: small cleanup of a couple Authorization flows	2021-08-05 15:16:49 -04:00
Dhia Ayachi	40baf98159	defer setting the state before returning to avoid stuck in `INITIALIZING` state (#10630 ) * defer setting the state before returning to avoid being stuck in `INITIALIZING` state * add changelog * move comment with the right if statement * ca: report state transition error from setSTate * update comment to reflect state transition Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-08-05 14:51:19 -04:00
Daniel Nephin	79ab48ef81	Merge pull request #10768 from hashicorp/dnephin/agent-tls-cert-expiration-metric telemetry: add Agent TLS Certificate expiration metric	2021-08-04 18:42:02 -04:00
Daniel Nephin	0ca9e875e2	acl: remove special handling of services in txn_endpoint Follow up to: https://github.com/hashicorp/consul/pull/10738#discussion_r680190210 Previously we were passing an Authorizer that would always allow the operation, then later checking the authorization using vetServiceTxnOp. On the surface this seemed strange, but I think it was actually masking a bug as well. Over time `servicePreApply` was changed to add additional authorization for `service.Proxy.DestinationServiceName`, but because we were passing a nil Authorizer, that authorization was not handled on the txn_endpoint. `TxnServiceOp.FillAuthzContext` has some special handling in enterprise, so we need to make sure to continue to use that from the Txn endpoint. This commit removes the `vetServiceTxnOp` function, and passes in the `FillAuthzContext` function so that `servicePreApply` can be used by both the catalog and txn endpoints. This should be much less error prone and prevent bugs like this in the future.	2021-08-04 18:32:20 -04:00
hc-github-team-consul-core	ef162f8390	auto-updated agent/uiserver/bindata_assetfs.go from commit bcd53e73a	2021-08-04 22:27:44 +00:00
Daniel Nephin	f6d5a85561	acl: move check for Intention.DestinationName into Authorizer Follow up to https://github.com/hashicorp/consul/pull/10737#discussion_r680134445 Move the check for the Intention.DestinationName into the Authorizer to remove the need to check what kind of Authorizer is being used. It sounds like this check is only for legacy ACLs, so is probably just a safeguard .	2021-08-04 18:06:44 -04:00
Daniel Nephin	3dc113ada6	Merge pull request #10738 from hashicorp/dnephin/remove-authorizer-nil-checks-2 acl: remove the last of the authz == nil checks	2021-08-04 17:41:40 -04:00
Daniel Nephin	2e9aa91256	Merge pull request #10737 from hashicorp/dnephin/remove-authorizer-nil-checks acl: remove authz == nil checks	2021-08-04 17:39:34 -04:00
Daniel Nephin	210a850353	telemetry: add log message when certs are about to expire	2021-08-04 14:18:59 -04:00
Daniel Nephin	13aa7b70d5	telemetry: fix a couple bugs in cert expiry metrics 1. do not emit the metric if Query fails 2. properly check for PrimaryUsersIntermediate, the logic was inverted Also improve the logging by including the metric name in the log message	2021-08-04 13:51:44 -04:00
Daniel Nephin	1673b3a68c	telemetry: add a metric for agent TLS cert expiry	2021-08-04 13:51:44 -04:00
Dhia Ayachi	6ed6966a1f	fix state index for `CAOpSetRootsAndConfig` op (#10675 ) * fix state index for `CAOpSetRootsAndConfig` op * add changelog * Update changelog Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * remove the change log as it's not needed Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-08-04 13:07:49 -04:00
hc-github-team-consul-core	4b2ada0dcc	auto-updated agent/uiserver/bindata_assetfs.go from commit 8ad1ab9c0	2021-08-04 16:47:13 +00:00
Evan Culver	57aabe3455	checks: Add Interval and Timeout to API response (#10717 )	2021-08-03 15:26:49 -07:00
Daniel Nephin	953c9bee4f	acl: Remove the remaining authz == nil checks These checks were a bit more involved. They were previously skipping some code paths when the authorizer was nil. After looking through these it seems correct to remove the authz == nil check, since it will never evaluate to true.	2021-07-30 14:55:35 -04:00
Daniel Nephin	e4821a58ee	acl: remove acl == nil checks	2021-07-30 14:28:19 -04:00
Daniel Nephin	fbaeac9ecf	acl: remove authz == nil checks These case are already impossible conditions, because most of these functions already start with a check for ACLs being disabled. So the code path being removed could never be reached. The one other case (ConnectAuthorized) was already changed in a previous commit. This commit removes an impossible branch because authz == nil can never be true.	2021-07-30 13:58:35 -04:00
Daniel Nephin	b6d9d0d9f7	acl: remove many instances of authz == nil	2021-07-30 13:58:35 -04:00
Daniel Nephin	bbc05ae869	agent: remove unused agent methods These methods are no longer used. Remove the methods, and update the tests to use actual method used by production code. Also removes the 'authz == nil' check is no longer a possible code path now that we are returning a non-nil acl.Authorizer when ACLs are disabled.	2021-07-30 13:58:35 -04:00
Daniel Nephin	2503f27a36	acl: remove rule == nil checks	2021-07-30 13:58:35 -04:00
hc-github-team-consul-core	701d4ffef0	auto-updated agent/uiserver/bindata_assetfs.go from commit 2ee501be8	2021-07-30 17:58:27 +00:00
Daniel Nephin	475fec5670	Merge pull request #10632 from hashicorp/pairing/acl-authorizer-when-acl-disabled acls: Update ACL authorizer to return meaningful permission when ACLs are disabled	2021-07-30 13:22:55 -04:00
Evan Culver	241b6429c3	Fix intention endpoint test	2021-07-30 12:58:45 -04:00
Daniel Nephin	9b41e7287f	acl: use acl.ManangeAll when ACLs are disabled Instead of returning nil and checking for nilness Removes a bunch of nil checks, and fixes one test failures.	2021-07-30 12:58:24 -04:00
Blake Covarrubias	f97e843c61	Add OSS changes for specifying audit log permission mode	2021-07-30 09:58:11 -07:00
Daniel Nephin	f2f5aba1bf	Merge pull request #10707 from hashicorp/dnephin/streaming-setup-default-timeout streaming: set default query timeout	2021-07-28 18:29:28 -04:00
Daniel Nephin	057e8320f9	streaming: set a default timeout The blocking query backend sets the default value on the server side. The streaming backend does not using blocking queries, so we must set the timeout on the client.	2021-07-28 17:50:00 -04:00
hc-github-team-consul-core	f39d36d346	auto-updated agent/uiserver/bindata_assetfs.go from commit eb5512fb7	2021-07-27 21:39:22 +00:00
Chris S. Kim	33d7d48767	sync enterprise files with oss (#10705 )	2021-07-27 17:09:59 -04:00
Daniel Nephin	cfc829275c	http: don't log an error if the request is cancelled Now that we have at least one endpoint that uses context for cancellation we can encounter this scenario where the returned error is a context.Cancelled or context.DeadlineExceeded. If the request.Context().Err() is not nil, then we know the request itself was cancelled, so we can log a different message at Info level, instad of the error.	2021-07-27 17:06:59 -04:00
Daniel Nephin	bad2c4ef67	Merge pull request #10399 from hashicorp/dnephin/debug-stream-metrics debug: use the new metrics stream in debug command	2021-07-27 13:23:15 -04:00
Daniel Nephin	7d24564ff0	http: add tests for AgentMetricsStream	2021-07-26 17:53:33 -04:00
Daniel Nephin	cf2e25c6bb	http: emit indented JSON in the metrics stream endpoint To remove the need to decode and re-encode in the CLI	2021-07-26 17:53:33 -04:00
Daniel Nephin	d716f709fd	debug: use the new metrics stream in debug command	2021-07-26 17:53:32 -04:00
Freddy	b136b1795a	Reset root prune interval after TestLeader_CARootPruning completes #10645 Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-07-26 15:43:40 -06:00
Chris S. Kim	6341183a84	agent: update proxy upstreams to inherit namespace from service (#10688 )	2021-07-26 17:12:29 -04:00
Freddy	57ca0ed480	Log the correlation ID when blocking queries fire (#10689 ) Knowing that blocking queries are firing does not provide much information on its own. If we know the correlation IDs we can piece together which parts of the snapshot have been populated. Some of these responses might be empty from the blocking query timing out. But if they're returning quickly I think we can reasonably assume they contain data.	2021-07-23 16:36:17 -06:00
R.B. Boyer	c271976445	state: refactor some node/coordinate state store functions to take an EnterpriseMeta (#10687 ) Note the field is not used yet.	2021-07-23 13:42:23 -05:00
R.B. Boyer	b2facb35a9	replumbing a bunch of api and agent structs for partitions (#10681 )	2021-07-22 14:33:22 -05:00
R.B. Boyer	254557a1f6	sync changes to oss files made in enterprise (#10670 )	2021-07-22 13:58:08 -05:00
R.B. Boyer	62ac98b564	agent/structs: add a bunch more EnterpriseMeta helper functions to help with partitioning (#10669 )	2021-07-22 13:20:45 -05:00
Dhia Ayachi	b725605fe4	config raft apply silent error (#10657 ) * return an error when the index is not valid * check response as bool when applying `CAOpSetConfig` * remove check for bool response * fix error message and add check to test * fix comment * add changelog	2021-07-22 10:32:27 -04:00
Freddy	7d48383041	Avoid panic on concurrent writes to cached service config map (#10647 ) If multiple instances of a service are co-located on the same node then their proxies will all share a cache entry for their resolved service configuration. This is because the cache key contains the name of the watched service but does not take into account the ID of the watching proxies. This means that there will be multiple agent service manager watches that can wake up on the same cache update. These watchers then concurrently modify the value in the cache when merging the resolved config into the local proxy definitions. To avoid this concurrent map write we will only delete the key from opaque config in the local proxy definition after the merge, rather than from the cached value before the merge.	2021-07-20 10:09:29 -06:00
hc-github-team-consul-core	aa97ed5ac6	auto-updated agent/uiserver/bindata_assetfs.go from commit 1eb7a83ee	2021-07-20 15:15:10 +00:00
Blake Covarrubias	441a6c9969	Add DNS recursor strategy option (#10611 ) This change adds a new `dns_config.recursor_strategy` option which controls how Consul queries DNS resolvers listed in the `recursors` config option. The supported options are `sequential` (default), and `random`. Closes #8807 Co-authored-by: Blake Covarrubias <blake@covarrubi.as> Co-authored-by: Priyanka Sengupta <psengupta@flatiron.com>	2021-07-19 15:22:51 -07:00
Daniel Nephin	901a5cdd8c	Merge pull request #10396 from hashicorp/dnephin/fix-more-data-races Fix some data races	2021-07-16 18:21:58 -04:00
Daniel Nephin	23dfb8e9ad	Merge pull request #10009 from hashicorp/dnephin/trim-dns-response-with-edns dns: properly trim response when EDNS is used	2021-07-16 18:09:25 -04:00
Daniel Nephin	db29c51cd2	acl: use SetHash consistently in testPolicyForID A previous commit used SetHash on two of the cases to fix a data race. This commit applies that change to all cases. Using SetHash in this test helper should ensure that the test helper behaves closer to production.	2021-07-16 17:59:56 -04:00
Daniel Nephin	63772f7ac4	dns: improve naming of error to match DNS terminology Co-authored-by: Blake Covarrubias <blake@covarrubi.as>	2021-07-16 12:40:24 -04:00
Dhia Ayachi	079decdabd	fix truncate when NS is set Also: fix test to catch the issue	2021-07-16 12:40:11 -04:00
Evan Culver	521c423075	acls: Show `AuthMethodNamespace` when reading/listing ACL token meta (#10598 )	2021-07-15 10:38:52 -07:00
Daniel Nephin	b4ab87111c	Merge pull request #10567 from hashicorp/dnephin/config-unexport-build config: unexport the remaining builder methods	2021-07-15 12:05:19 -04:00
Freddy	a942a2e025	Merge pull request #10621 from hashicorp/vuln/validate-sans	2021-07-15 09:43:55 -06:00
Daniel Nephin	f286ea0922	Fix godoc comment Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2021-07-15 11:22:46 -04:00
R.B. Boyer	e018d8a10b	xds: ensure single L7 deny intention with default deny policy does not result in allow action (CVE-2021-36213) (#10619 )	2021-07-15 10:09:00 -05:00
hc-github-team-consul-core	6bf7c98227	auto-updated agent/uiserver/bindata_assetfs.go from commit 0762da3a6	2021-07-15 11:23:49 +00:00
Giulio Micheloni	3a1afd8f57	acl: fix error type into a string type for serialization issue acl_endpoint_test.go:507: Error Trace: acl_endpoint_test.go:507 retry.go:148 retry.go:149 retry.go:103 acl_endpoint_test.go:504 Error: Received unexpected error: codec.decoder: decodeValue: Cannot decode non-nil codec value into nil error (1 methods) Test: TestACLEndpoint_ReplicationStatus	2021-07-15 11:31:44 +02:00
freddygv	b6b42c34dc	Add TODOs about partition handling	2021-07-14 22:21:55 -06:00
freddygv	3d4fa44c22	Update golden files	2021-07-14 22:21:55 -06:00
freddygv	a7de87e95b	Validate SANs for passthrough clusters and failovers	2021-07-14 22:21:55 -06:00
freddygv	a6f7d806f6	Update golden files to account for SAN validation	2021-07-14 22:21:55 -06:00
freddygv	3f11449363	Validate Subject Alternative Name for upstreams These changes ensure that the identity of services dialed is cryptographically verified. For all upstreams we validate against SPIFFE IDs in the format used by Consul's service mesh: spiffe://<trust-domain>/ns/<namespace>/dc/<datacenter>/svc/<service>	2021-07-14 22:20:27 -06:00
Daniel Nephin	27871498f0	Fix a data race in TestACLResolver_Client By setting the hash when we create the policy. ``` WARNING: DATA RACE Read at 0x00c0028b4b10 by goroutine 1182: github.com/hashicorp/consul/agent/structs.(ACLPolicy).SetHash() /home/daniel/pers/code/consul/agent/structs/acl.go:701 +0x40d github.com/hashicorp/consul/agent/structs.ACLPolicies.resolveWithCache() /home/daniel/pers/code/consul/agent/structs/acl.go:779 +0xfe github.com/hashicorp/consul/agent/structs.ACLPolicies.Compile() /home/daniel/pers/code/consul/agent/structs/acl.go:809 +0xf1 github.com/hashicorp/consul/agent/consul.(ACLResolver).ResolveTokenToIdentityAndAuthorizer() /home/daniel/pers/code/consul/agent/consul/acl.go:1226 +0x6ef github.com/hashicorp/consul/agent/consul.resolveTokenAsync() /home/daniel/pers/code/consul/agent/consul/acl_test.go:66 +0x5c Previous write at 0x00c0028b4b10 by goroutine 1509: github.com/hashicorp/consul/agent/structs.(ACLPolicy).SetHash() /home/daniel/pers/code/consul/agent/structs/acl.go:730 +0x3a8 github.com/hashicorp/consul/agent/structs.ACLPolicies.resolveWithCache() /home/daniel/pers/code/consul/agent/structs/acl.go:779 +0xfe github.com/hashicorp/consul/agent/structs.ACLPolicies.Compile() /home/daniel/pers/code/consul/agent/structs/acl.go:809 +0xf1 github.com/hashicorp/consul/agent/consul.(ACLResolver).ResolveTokenToIdentityAndAuthorizer() /home/daniel/pers/code/consul/agent/consul/acl.go:1226 +0x6ef github.com/hashicorp/consul/agent/consul.resolveTokenAsync() /home/daniel/pers/code/consul/agent/consul/acl_test.go:66 +0x5c Goroutine 1182 (running) created at: github.com/hashicorp/consul/agent/consul.TestACLResolver_Client.func4() /home/daniel/pers/code/consul/agent/consul/acl_test.go:1669 +0x459 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 Goroutine 1509 (running) created at: github.com/hashicorp/consul/agent/consul.TestACLResolver_Client.func4() /home/daniel/pers/code/consul/agent/consul/acl_test.go:1668 +0x415 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 ```	2021-07-14 18:58:16 -04:00
Daniel Nephin	c3c8058fd7	agent: remove deprecated call in a test	2021-07-14 18:58:16 -04:00
Daniel Nephin	9d471269d8	agent: fix a data race in a test The test was modifying a pointer to a struct that had been passed to another goroutine. Instead create a new struct to modify. ``` WARNING: DATA RACE Write at 0x00c01407c3c0 by goroutine 832: github.com/hashicorp/consul/agent.TestServiceManager_PersistService_API() /home/daniel/pers/code/consul/agent/service_manager_test.go:446 +0x1d86 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 Previous read at 0x00c01407c3c0 by goroutine 938: reflect.typedmemmove() /usr/lib/go/src/runtime/mbarrier.go:177 +0x0 reflect.Value.Set() /usr/lib/go/src/reflect/value.go:1569 +0x13b github.com/mitchellh/copystructure.(walker).Primitive() /home/daniel/go/pkg/mod/github.com/mitchellh/copystructure@v1.0.0/copystructure.go:289 +0x190 github.com/mitchellh/reflectwalk.walkPrimitive() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:252 +0x31b github.com/mitchellh/reflectwalk.walk() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:179 +0x24d github.com/mitchellh/reflectwalk.walkStruct() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:386 +0x4ec github.com/mitchellh/reflectwalk.walk() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:188 +0x656 github.com/mitchellh/reflectwalk.walkStruct() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:386 +0x4ec github.com/mitchellh/reflectwalk.walk() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:188 +0x656 github.com/mitchellh/reflectwalk.Walk() /home/daniel/go/pkg/mod/github.com/mitchellh/reflectwalk@v1.0.1/reflectwalk.go:92 +0x164 github.com/mitchellh/copystructure.Config.Copy() /home/daniel/go/pkg/mod/github.com/mitchellh/copystructure@v1.0.0/copystructure.go:69 +0xe7 github.com/mitchellh/copystructure.Copy() /home/daniel/go/pkg/mod/github.com/mitchellh/copystructure@v1.0.0/copystructure.go:13 +0x84 github.com/hashicorp/consul/agent.mergeServiceConfig() /home/daniel/pers/code/consul/agent/service_manager.go:362 +0x56 github.com/hashicorp/consul/agent.(serviceConfigWatch).handleUpdate() /home/daniel/pers/code/consul/agent/service_manager.go:279 +0x250 github.com/hashicorp/consul/agent.(serviceConfigWatch).runWatch() /home/daniel/pers/code/consul/agent/service_manager.go:246 +0x2d4 Goroutine 832 (running) created at: testing.(T).Run() /usr/lib/go/src/testing/testing.go:1238 +0x5d7 testing.runTests.func1() /usr/lib/go/src/testing/testing.go:1511 +0xa6 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 testing.runTests() /usr/lib/go/src/testing/testing.go:1509 +0x612 testing.(M).Run() /usr/lib/go/src/testing/testing.go:1417 +0x3b3 main.main() _testmain.go:1181 +0x236 Goroutine 938 (running) created at: github.com/hashicorp/consul/agent.(serviceConfigWatch).start() /home/daniel/pers/code/consul/agent/service_manager.go:223 +0x4e4 github.com/hashicorp/consul/agent.(ServiceManager).AddService() /home/daniel/pers/code/consul/agent/service_manager.go:98 +0x344 github.com/hashicorp/consul/agent.(Agent).addServiceLocked() /home/daniel/pers/code/consul/agent/agent.go:1942 +0x2e4 github.com/hashicorp/consul/agent.(*Agent).AddService() /home/daniel/pers/code/consul/agent/agent.go:1929 +0x337 github.com/hashicorp/consul/agent.TestServiceManager_PersistService_API() /home/daniel/pers/code/consul/agent/service_manager_test.go:400 +0x17c4 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 ```	2021-07-14 18:58:16 -04:00
Daniel Nephin	6703787740	agent: fix a data race in DNS tests The dnsConfig pulled from the atomic.Value is a pointer, so modifying it in place creates a data race. Use the exported ReloadConfig interface instead.	2021-07-14 18:58:16 -04:00
Daniel Nephin	2946e42a9e	agent: fix two data race in agent tests The LogOutput io.Writer used by TestAgent must allow concurrent reads and writes, and a bytes.Buffer does not allow this. The bytes.Buffer must be wrapped with a lock to make this safe.	2021-07-14 18:58:16 -04:00
Daniel Nephin	ff26294d63	consul: fix data race in leader CA tests Some global variables are patched to shorter values in these tests. But the goroutines that read them can outlive the test because nothing waited for them to exit. This commit adds a Wait() method to the routine manager, so that tests can wait for the goroutines to exit. This prevents the data race because the 'reset to original value' can happen after all other goroutines have stopped.	2021-07-14 18:58:15 -04:00
Daniel Nephin	edd755b7ab	dns: correct rcode for qtype not supported A previous commit started using QueryRefuced, but that is not correct. QueryRefuced refers to the OpCode, not the query type. Instead use errNoAnswer because we have no records for that query type.	2021-07-14 17:48:50 -04:00
Dhia Ayachi	48171c43f4	Check response len do not exceed max Buffer size	2021-07-14 17:15:34 -04:00
Dhia Ayachi	8fcac3cef6	add missing test for truncate	2021-07-14 17:15:34 -04:00
Daniel Nephin	b4abf8b0ec	dns: remove network parameter from two funcs Now that trimDNSResponse is handled by the caller we don't need to pass this value around. We can remove it from both the serviceLookup struct, and two functions.	2021-07-14 17:15:34 -04:00
Daniel Nephin	4712e24749	dns: trim response immediately before the write Previously the response was being trimmed before adding the EDNS values, which could cause it to exceed the max size.	2021-07-14 17:15:34 -04:00
Daniel Nephin	a9e9c6c23e	dns: handle errors from dispatch	2021-07-14 17:15:34 -04:00
Daniel Nephin	6cf9ecc1c9	dns: error response from dispatch So that dispatch can communicate status back to the caller.	2021-07-14 17:15:34 -04:00
Daniel Nephin	9298cfe0f6	dns: refactor dispatch to use an explicit return in each case In preparation for changing the return value, so that SOA, eDNS trimming and 'not found' errors can be handled in a single place.	2021-07-14 17:15:34 -04:00
Daniel Nephin	b09aa1e3c6	dns: small refactor to setEDNS to return early Using a guard clause instead of a long nested if. The diff is best viewed with whitespace turned off.	2021-07-14 17:15:34 -04:00
Daniel Nephin	f1bc7bd49a	dns: remove unused method It was added in 5934f803bfb54c1ceeeb6518398f1b82a726459f but it was never used.	2021-07-14 17:15:34 -04:00
Daniel Nephin	e3d781d99c	dns: remove unnecessary function wrapping The dispatch function was called from a single place and did nothing but add a default value. Removing it makes code easier to trace by removing an unnecessary hop.	2021-07-14 17:15:33 -04:00
Kyle Havlovitz	e97bc2bda7	http: add partition query param parsing	2021-07-14 12:07:38 -07:00
hc-github-team-consul-core	8c5723ec98	auto-updated agent/uiserver/bindata_assetfs.go from commit 3e80e637b	2021-07-14 18:00:42 +00:00
Giulio Micheloni	96fe1f4078	acl: acl replication routine to report the last error message	2021-07-14 11:50:23 +02:00
Daniel Nephin	57c5a40869	Merge pull request #10588 from hashicorp/dnephin/config-fix-ports-grpc config: rename `ports.grpc` to `ports.xds`	2021-07-13 13:11:38 -04:00
Daniel Nephin	15300b873a	fix backwards compat for envoy command The compatv2 integration tests were failing because they use an older CLI version with a newer HTTP API. This commit restores the GRPCPort field to the DebugConfig output to allow older CIs to continue to fetch the port.	2021-07-13 12:31:49 -04:00
Daniel Nephin	25dc14f036	Apply suggestions from code review Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-07-13 12:31:49 -04:00
Daniel Nephin	a5f93e5596	command/envoy: stop using the DebugConfig from Self endpoint The DebugConfig in the self endpoint can change at any time. It's not a stable API. With the previous change to rename GRPCPort to XDSPort this command would have broken. This commit adds the XDSPort to a stable part of the XDS api, and changes the envoy command to read this new field. It includes support for the old API as well, in case a newer CLI is used with an older API, and adds a test for both cases.	2021-07-13 12:31:49 -04:00
Daniel Nephin	ef6bc739a1	config: update config settings and flags for ports.xds	2021-07-13 12:31:48 -04:00
Dhia Ayachi	53b45a8441	check expiry date of the root/intermediate before using it to sign a leaf (#10500 ) * ca: move provider creation into CAManager This further decouples the CAManager from Server. It reduces the interface between them and removes the need for the SetLogger method on providers. * ca: move SignCertificate to CAManager To reduce the scope of Server, and keep all the CA logic together * ca: move SignCertificate to the file where it is used * auto-config: move autoConfigBackend impl off of Server Most of these methods are used exclusively for the AutoConfig RPC endpoint. This PR uses a pattern that we've used in other places as an incremental step to reducing the scope of Server. * fix linter issues * check error when `raftApplyMsgpack` * ca: move SignCertificate to CAManager To reduce the scope of Server, and keep all the CA logic together * check expiry date of the intermediate before using it to sign a leaf * fix typo in comment Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> * Fix test name * do not check cert start date * wrap error to mention it is the intermediate expired * Fix failing test * update comment Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * use shim to avoid sleep in test * add root cert validation * remove duplicate code * Revert "fix linter issues" This reverts commit 6356302b54f06c8f2dee8e59740409d49e84ef24. * fix import issue * gofmt leader_connect_ca * add changelog entry * update error message Co-authored-by: Freddy <freddygv@users.noreply.github.com> * fix error message in test Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2021-07-13 12:15:06 -04:00
R.B. Boyer	ae8b526be8	connect/ca: ensure edits to the key type/bits for the connect builtin CA will regenerate the roots (#10330 ) progress on #9572	2021-07-13 11:12:07 -05:00
R.B. Boyer	0537922c6c	connect/ca: require new vault mount points when updating the key type/bits for the vault connect CA provider (#10331 ) progress on #9572	2021-07-13 11:11:46 -05:00
Daniel Nephin	58cf5767a8	Merge pull request #10479 from hashicorp/dnephin/ca-provider-explore-2 ca: move Server.SignIntermediate to CAManager	2021-07-12 19:03:43 -04:00
Daniel Nephin	a22bdb2ac9	Merge pull request #10445 from hashicorp/dnephin/ca-provider-explore ca: isolate more of the CA logic in CAManager	2021-07-12 15:26:23 -04:00
Daniel Nephin	fdb0ba8041	ca: use provider constructors to be more consistent Adds a contructor for the one provider that did not have one.	2021-07-12 14:04:34 -04:00
Dhia Ayachi	3eac4ffda4	check error when `raftApplyMsgpack`	2021-07-12 13:42:51 -04:00
Daniel Nephin	34c8585b29	auto-config: move autoConfigBackend impl off of Server Most of these methods are used exclusively for the AutoConfig RPC endpoint. This PR uses a pattern that we've used in other places as an incremental step to reducing the scope of Server.	2021-07-12 13:42:40 -04:00
Daniel Nephin	605275b4dc	ca: move SignCertificate to the file where it is used	2021-07-12 13:42:39 -04:00
Daniel Nephin	c2e85f25d4	ca: move SignCertificate to CAManager To reduce the scope of Server, and keep all the CA logic together	2021-07-12 13:42:39 -04:00
Daniel Nephin	6fced99ea9	Merge pull request #10590 from hashicorp/dnephin/tls-config-less-copy config: remove duplicate tlsutil.Config fields from agent/consul.Config	2021-07-12 13:00:52 -04:00
hc-github-team-consul-core	dfff26a758	auto-updated agent/uiserver/bindata_assetfs.go from commit a96e87aec	2021-07-12 13:33:26 +00:00
Dhia Ayachi	a0320169fe	add missing state reset when stopping ca manager	2021-07-12 09:32:36 -04:00
Daniel Nephin	68d5f7769a	ca: fix mockCAServerDelegate to work with the new interface raftApply was removed so ApplyCARequest needs to handle all the possible operations Also set the providerShim to use the mock provider. other changes are small test improvements that were necessary to debug the failures.	2021-07-12 09:32:36 -04:00
Daniel Nephin	6d4b0ce194	ca: remove unused method and small refactor to getCAProvider so that GoLand is less confused about what it is doing. Previously it was reporting that the for condition was always true, which was not the case.	2021-07-12 09:32:35 -04:00
Daniel Nephin	4330122d9a	ca: remove raftApply from delegate interface After moving ca.ConsulProviderStateDelegate into the interface we now have the ApplyCARequest method which does the same thing. Use this more specific method instead of raftApply.	2021-07-12 09:32:35 -04:00
Daniel Nephin	fae0a8f851	ca: move generateCASignRequest to the delegate This method on Server was only used by the caDelegateWithState, so move it there until we can move it entirely into CAManager.	2021-07-12 09:32:35 -04:00
Daniel Nephin	d4bb9fd97a	ca: move provider creation into CAManager This further decouples the CAManager from Server. It reduces the interface between them and removes the need for the SetLogger method on providers.	2021-07-12 09:32:33 -04:00
Daniel Nephin	fc629d9eaa	ca-manager: move provider shutdown into CAManager Reducing the coupling between Server and CAManager	2021-07-12 09:27:28 -04:00
Daniel Nephin	1e23d181b5	config: remove misleading UseTLS field This field was documented as enabling TLS for outgoing RPC, but that was not the case. All this field did was set the use_tls serf tag. Instead of setting this field in a place far from where it is used, move the logic to where the serf tag is set, so that the code is much more obvious.	2021-07-09 19:01:45 -04:00
Daniel Nephin	3c60a46376	config: remove duplicate TLSConfig fields from agent/consul.Config tlsutil.Config already presents an excellent structure for this configuration. Copying the runtime config fields to agent/consul.Config makes code harder to trace, and provides no advantage. Instead of copying the fields around, use the tlsutil.Config struct directly instead. This is one small step in removing the many layers of duplicate configuration.	2021-07-09 18:49:42 -04:00
Daniel Nephin	2ab6be6a88	config: update GRPCPort and addr in runtime config	2021-07-09 12:31:53 -04:00
Daniel Nephin	9c6458c6c2	rename GRPC->XDS where appropriate	2021-07-09 12:17:45 -04:00
Evan Culver	5ff191ad99	Add support for returning ACL secret IDs for accessors with acl:write (#10546 )	2021-07-08 15:13:08 -07:00
Daniel Nephin	dcb90fb832	Merge pull request #10570 from hashicorp/copy-of-master Changes that were accidentally merged into the old master branch	2021-07-08 16:28:56 -04:00
R.B. Boyer	0e6a482b76	config: add agent config flag for enterprise clients to indicate they wish to join a particular partition (#10572 )	2021-07-08 10:03:38 -05:00
Dhia Ayachi	e5dbf5e55b	Add ca certificate metrics (#10504 ) * add intermediate ca metric routine * add Gauge config for intermediate cert * Stop metrics routine when stopping leader * add changelog entry * updage changelog Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * use variables instead of a map * go imports sort * Add metrics for primary and secondary ca * start metrics routine in the right DC * add telemetry documentation * update docs * extract expiry fetching in a func * merge metrics for primary and secondary into signing ca metric Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-07-07 09:41:01 -04:00
hc-github-team-consul-core	83c543cd6b	auto-updated agent/uiserver/bindata_assetfs.go from commit 6fbeea5de	2021-07-07 10:51:32 +00:00
Jared Kirschner	37f25aed1d	Merge pull request #10559 from jkirschner-hashicorp/fix-autopilot-config-post-default-values Fix defaults for autopilot config update	2021-07-06 19:19:52 -04:00
hc-github-team-consul-core	93607fa2ee	auto-updated agent/uiserver/bindata_assetfs.go from commit 2c4f22a9f	2021-07-06 22:54:28 +00:00
Daniel Nephin	14527dd005	Merge pull request #10552 from hashicorp/dnephin/ca-remove-rotation-period ca: remove unused RotationPeriod field	2021-07-06 18:49:33 -04:00
Daniel Nephin	e8e5defc71	config: unexport the remaining builder methods And remove BuildAndValidate. This commit completes some earlier work to reduce the config interface a single Load function. The last remaining test was converted to use Load instad of BuildAndValidate.	2021-07-06 18:42:09 -04:00
Jared Kirschner	1449806c3d	Fix defaults for autopilot config update Previously, for a POST request to the /v1/operator/autopilot/configuration endpoint, any fields not included in the payload were set to a zero-initialized value rather than the documented default value. Now, if an optional field is not included in the payload, it will be set to its documented default value: - CleanupDeadServers: true - LastContactThreshold: "200ms" - MaxTrailingLogs: 250 - MinQuorum: 0 - ServerStabilizationTime: "10s" - RedundancyZoneTag: "" - DisableUpgradeMigration: false - UpgradeVersionTag: ""	2021-07-06 18:39:40 -04:00
hc-github-team-consul-core	164db92b15	auto-updated agent/uiserver/bindata_assetfs.go from commit 74070c095	2021-07-06 16:06:51 +00:00
hc-github-team-consul-core	ff2360d430	auto-updated agent/uiserver/bindata_assetfs.go from commit 5f73de6fb	2021-07-06 15:50:57 +00:00
jkirschner-hashicorp	31bbab8ae7	Merge pull request #10560 from jkirschner-hashicorp/change-sane-to-reasonable Replace use of 'sane' where appropriate	2021-07-06 11:46:04 -04:00
Daniel Nephin	b4a10443d1	ca: remove unused RotationPeriod field This field was never used. Since it is persisted as part of a map[string]interface{} it is pretty easy to remove it.	2021-07-05 19:15:44 -04:00
Jared Kirschner	4c3b1b8b7b	Replace use of 'sane' where appropriate HashiCorp voice, style, and language guidelines recommend avoiding ableist language unless its reference to ability is accurate in a particular use.	2021-07-02 12:18:46 -04:00
Dhia Ayachi	b57cf27e8f	Format certificates properly (rfc7468) with a trailing new line (#10411 ) * trim carriage return from certificates when inserting rootCA in the inMemDB * format rootCA properly when returning the CA on the connect CA endpoint * Fix linter warnings * Fix providers to trim certs before returning it * trim newlines on write when possible * add changelog * make sure all provider return a trailing newline after the root and intermediate certs * Fix endpoint to return trailing new line * Fix failing test with vault provider * make test more robust * make sure all provider return a trailing newline after the leaf certs * Check for suffix before removing newline and use function * Add comment to consul provider * Update change log Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * fix typo * simplify code callflow Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * extract requireNewLine as shared func * remove dependency to testify in testing file * remove extra newline in vault provider * Add cert newline fix to envoy xds * remove new line from mock provider * Remove adding a new line from provider and fix it when the cert is read * Add a comment to explain the fix * Add missing for leaf certs * fix missing new line * fix missing new line in leaf certs * remove extra new line in test * updage changelog Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * fix in vault provider and when reading cache (RPC call) * fix AWS provider * fix failing test in the provider * remove comments and empty lines * add check for empty cert in test * fix linter warnings * add new line for leaf and private key * use string concat instead of Sprintf * fix new lines for leaf signing * preallocate slice and remove append * Add new line to `SignIntermediate` and `CrossSignCA` Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-06-30 20:48:29 -04:00
Daniel Nephin	72ea979c39	Merge pull request #10515 from hashicorp/dnephin/fix-arm32-atomic-aligment Fix panic on 32-bit platforms	2021-06-30 16:40:20 -04:00
Daniel Nephin	843e08bb23	testing: fix a test for 32-bit The hcl decoding apparently uses strconv.ParseInt, which fails to parse a 64bit int. Since hcl v1 is basically EOl, it seems unlikely we'll fix this in hcl. Since this test is only about loading values from config files, the extra large number doesn't seem important. Trim a few zeros from the numbers so that they parse properly on 32bit platforms. Also skip a slow test when -short is used.	2021-06-29 16:10:21 -04:00
Daniel Nephin	e226733b26	fix 64-bit aligment for 32-bit platforms sync/atomic must be used with 64-bit aligned fields, and that alignment is difficult to ensure unless the field is the first one in the struct. https://golang.org/pkg/sync/atomic/#pkg-note-BUG.	2021-06-29 16:10:21 -04:00
Daniel Nephin	ffefcdc025	streaming: support X-Cache-Hit header If a value was already available in the local view the request is considered a cache hit. If the materialized had to wait for a value, it is considered a cache miss.	2021-06-28 17:29:23 -04:00
Daniel Nephin	a4a390d7c5	streaming: fix enable of streaming in the client And add checks to all the tests that explicitly use streaming.	2021-06-28 17:23:14 -04:00
Daniel Nephin	62beaa80f3	Remove a racy and failing test This test is super racy (it's not just a single line). This test also starts failing once streaming is enabled, because the cache rate limit no longer applies to the requests in the test. The queries use streaming instead of the cache. This test is no longer valid, and the functionality is already well tested by TestCacheThrottle. Instead of spending time rewriting this test, let's remove it. ``` WARNING: DATA RACE Read at 0x00c01de410fc by goroutine 735: github.com/hashicorp/consul/agent.TestCacheRateLimit.func1() /home/daniel/pers/code/consul/agent/agent_test.go:1024 +0x9af github.com/hashicorp/consul/testrpc.WaitForTestAgent() /home/daniel/pers/code/consul/testrpc/wait.go:99 +0x209 github.com/hashicorp/consul/agent.TestCacheRateLimit.func1() /home/daniel/pers/code/consul/agent/agent_test.go:966 +0x1ad testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 Previous write at 0x00c01de410fc by goroutine 605: github.com/hashicorp/consul/agent.TestCacheRateLimit.func1.2() /home/daniel/pers/code/consul/agent/agent_test.go:998 +0xe9 Goroutine 735 (running) created at: testing.(*T).Run() /usr/lib/go/src/testing/testing.go:1238 +0x5d7 github.com/hashicorp/consul/agent.TestCacheRateLimit() /home/daniel/pers/code/consul/agent/agent_test.go:961 +0x375 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 Goroutine 605 (finished) created at: github.com/hashicorp/consul/agent.TestCacheRateLimit.func1() /home/daniel/pers/code/consul/agent/agent_test.go:1022 +0x91e github.com/hashicorp/consul/testrpc.WaitForTestAgent() /home/daniel/pers/code/consul/testrpc/wait.go:99 +0x209 github.com/hashicorp/consul/agent.TestCacheRateLimit.func1() /home/daniel/pers/code/consul/agent/agent_test.go:966 +0x1ad testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 ```	2021-06-28 17:23:13 -04:00
Daniel Nephin	d0e32cc3ba	http: add an X-Consul-Query-Backend header to responses So that it is easier to detect and test when streaming is being used.	2021-06-28 16:44:58 -04:00
Daniel Nephin	902bd80989	Merge pull request #10506 from hashicorp/dnephin/docs-rpc-query-metrics docs: correct some misleading telemetry docs	2021-06-28 12:33:57 -04:00
Daniel Nephin	86244967c5	docs: correct some misleading telemetry docs The query metrics are actually reported for all read queries, not only ones that use a MinIndex to block for updates. Also clarify the raft.apply metric is only on the leader.	2021-06-28 12:20:53 -04:00
R.B. Boyer	30ccd5c2d9	connect: include optional partition prefixes in SPIFFE identifiers (#10507 ) NOTE: this does not include any intentions enforcement changes yet	2021-06-25 16:47:47 -05:00
R.B. Boyer	c3d5a2a5ab	connect/ca: cease including the common name field in generated certs (#10424 ) As part of this change, we ensure that the SAN extensions are marked as critical when the subject is empty so that AWS PCA tolerates the loss of common names well and continues to function as a Connect CA provider. Parts of this currently hack around a bug in crypto/x509 and can be removed after https://go-review.googlesource.com/c/go/+/329129 lands in a Go release. Note: the AWS PCA tests do not run automatically, but the following passed locally for me: ENABLE_AWS_PCA_TESTS=1 go test ./agent/connect/ca -run TestAWS	2021-06-25 13:00:00 -05:00
hc-github-team-consul-core	f0f5d9bfc4	auto-updated agent/uiserver/bindata_assetfs.go from commit ace794d21	2021-06-25 09:47:01 +00:00
Dhia Ayachi	8b967b3bb6	return an empty record when asked for an addr dns with type other then A, AAAA and ANY (#10401 ) * return an invalid record when asked for an addr dns with type other then A and AAAA * add changelog * fix ANY use case and add a test for it * update changelog type Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * return empty response if the question record type do not match for addr * set comment in the right place * return A\AAAA record in extra section if record type is not A\AAAA for addr * Fix failing test * remove commented code Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * use require for test validation * use variable to init struct * fix failing test * Update agent/dns.go Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update .changelog/10401.txt Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update agent/dns.go Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update agent/dns.go Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update agent/dns.go Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * fix compilation error Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-06-24 20:44:44 -04:00
Daniel Nephin	cefbb5bf3f	Merge pull request #10476 from hashicorp/dnephin/ca-primary-uses-intermediate ca: replace ca.PrimaryIntermediateProviders	2021-06-24 14:05:19 -04:00
R.B. Boyer	9778bee35a	structs: prohibit config entries from referencing more than one partition at a time (#10478 ) affected kinds: service-defaults, ingress-gateway, terminating-gateway, service-intentions	2021-06-23 16:44:10 -05:00
R.B. Boyer	952df8b491	structs: prevent service-defaults upstream configs from using wildcard names or namespaces (#10475 )	2021-06-23 15:48:54 -05:00
Daniel Nephin	72b30174fa	ca: replace ca.PrimaryIntermediateProviders With an optional interface that providers can use to indicate if they use an intermediate cert in the primary DC. This removes the need to look up the provider config when renewing the intermediate.	2021-06-23 15:47:30 -04:00
R.B. Boyer	b412ca0f89	structs: add some missing config entry validation and clean up tests (#10465 ) Affects kinds: service-defaults, ingress-gateway, terminating-gateway	2021-06-23 14:11:23 -05:00
hc-github-team-consul-core	a6a8e421d2	auto-updated agent/uiserver/bindata_assetfs.go from commit c78f7ecb2	2021-06-23 08:24:11 +00:00
Daniel Nephin	60614a3f68	Merge pull request #10444 from hashicorp/dnephin/tls-cert-exploration-2 tlsutil: reduce interface provided to auto-config	2021-06-22 15:32:51 -04:00
Daniel Nephin	86c9cb037f	tlsutil: reduce interface provided to auto-config Replace two methods with a single one that returns the cert. This moves more of the logic into the single caller (auto-config). tlsutil.Configurator is widely used. By keeping it smaller and focused only on storing and returning TLS config, we make the code easier to follow. These two methods were more related to auto-config than to tlsutil, so reducing the interface moves the logic closer to the feature that requires it.	2021-06-22 14:11:28 -04:00
hc-github-team-consul-core	5add535d78	auto-updated agent/uiserver/bindata_assetfs.go from commit 043f631b7	2021-06-22 18:01:05 +00:00
hc-github-team-consul-core	605267d3e7	auto-updated agent/uiserver/bindata_assetfs.go from commit 4bddd5210	2021-06-22 13:24:58 +00:00
Daniel Nephin	6a61c5d772	proxycfg: remove unused method This method was accidentally re-introduced in an earlier rebase. It was removed in ed1082510dc80523b1f2a3a740fa5a13c77594f9 as part of the tproxy work.	2021-06-21 15:54:40 -04:00
Daniel Nephin	41bf0670a8	proxycfg: move each handler into a seprate file There is no interaction between these handlers, so splitting them into separate files makes it easier to discover the full implementation of each kindHandler.	2021-06-21 15:48:40 -04:00
hc-github-team-consul-core	27b6c61384	auto-updated agent/uiserver/bindata_assetfs.go from commit 5f17062b0	2021-06-21 11:11:47 +00:00
hc-github-team-consul-core	7b8af4dba4	auto-updated agent/uiserver/bindata_assetfs.go from commit 9eab71514	2021-06-21 10:59:56 +00:00
hc-github-team-consul-core	365ab6df11	auto-updated agent/uiserver/bindata_assetfs.go from commit ac424187f	2021-06-21 10:45:46 +00:00
Daniel Nephin	f4c1f982d1	Merge pull request #9924 from hashicorp/dnephin/cert-expiration-metric connect: emit a metric for the seconds until root CA expiry	2021-06-18 14:18:55 -04:00
Daniel Nephin	96896409d6	Merge pull request #9489 from hashicorp/dnephin/proxycfg-state-2 proxycfg: split state into a handler for each kind	2021-06-18 13:57:28 -04:00
Daniel Nephin	1da58902aa	Merge pull request #10425 from hashicorp/dnephin/tls-cert-exploration tlsutil: fix a possible panic, and make the package safer	2021-06-18 13:54:07 -04:00
Daniel Nephin	b0a2252fa0	inline assignment	2021-06-17 15:43:04 -04:00
Nitya Dhanushkodi	ffbbe9e73f	proxycfg: reference to entry in map should not panic	2021-06-17 11:49:04 -07:00
Daniel Nephin	b7293242f1	Replace type conversion with embedded structs	2021-06-17 13:23:35 -04:00
Daniel Nephin	40ff895927	proxycfg: split state into kind-specific types This commit extracts all the kind-specific logic into handler types, and keeps the generic parts on the state struct. This change should make it easier to add new kinds, and see the implementation of each kind more clearly.	2021-06-16 14:04:01 -04:00
Daniel Nephin	b57f03feff	proxycfg: unmethod hostnameEndpoints the method receiver can be replaced by the first argument. This will allow us to extract more from the state struct in the future.	2021-06-16 14:03:30 -04:00
Daniel Nephin	f2ae6cb47c	Remove duplicate import because two PRs crossed paths.	2021-06-16 13:19:54 -04:00
Daniel Nephin	b40174ccf2	Merge pull request #9466 from hashicorp/dnephin/proxycfg-state proxycfg: prepare state for split by kind	2021-06-16 13:14:26 -04:00
R.B. Boyer	38d4b75ab3	xds: fix flaky protocol tests (#10410 )	2021-06-16 11:57:43 -05:00
Freddy	3127dac0fb	Merge pull request #10404 from hashicorp/ingress-stats	2021-06-15 14:28:07 -06:00
R.B. Boyer	1645b6aafe	xds: adding more delta protocol tests (#10398 ) Fixes #10125	2021-06-15 15:21:07 -05:00
freddygv	bdb3af918c	Regen golden files	2021-06-15 14:18:25 -06:00
Freddy	0d05cbb105	Update agent/xds/listeners.go Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-06-15 14:09:26 -06:00
Freddy	0e417e006e	Omit empty tproxy config in JSON responses (#10402 )	2021-06-15 13:53:35 -06:00
Nitya Dhanushkodi	08ed3edf71	proxycfg: Ensure that endpoints for explicit upstreams in other datacenters are watched in transparent mode (#10391 ) Co-authored-by: Freddy Vallenilla <freddy@hashicorp.com>	2021-06-15 11:00:26 -07:00
freddygv	e07542211a	Remove unused param	2021-06-15 11:19:45 -06:00
Dhia Ayachi	4b75f15fb7	improve monitor performance (#10368 ) * remove flush for each write to http response in the agent monitor endpoint * fix race condition when we stop and start monitor multiple times, the doneCh is closed and never recover. * start log reading goroutine before adding the sink to avoid filling the log channel before getting a chance of reading from it * flush every 500ms to optimize log writing in the http server side. * add changelog file * add issue url to changelog * fix changelog url * Update changelog Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * use ticker to flush and avoid race condition when flushing in a different goroutine * stop the ticker when done Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Revert "fix race condition when we stop and start monitor multiple times, the doneCh is closed and never recover." This reverts commit 1eeddf7a * wait for log consumer loop to start before registering the sink Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-06-15 12:05:52 -04:00
freddygv	450ac44126	Update ingress gateway stats labeling In the absence of stats_tags to handle this pattern, when we pass "ingress_upstream.$port" as the stat_prefix, Envoy splits up that prefix and makes the port a part of the metric name. For example: - stat_prefix: ingress_upstream.8080 This leads to metric names like envoy_http_8080_no_route. Changing the stat_prefix to ingress_upstream_80880 yields the expected metric names such as envoy_http_no_route. Note that we don't encode the destination's name/ns/dc in this stat_prefix because for HTTP services ingress gateways use a single filter chain. Only cluster metrics are available on a per-upstream basis.	2021-06-15 08:52:18 -06:00
freddygv	5d7188e290	Update terminating gateway stats labeling This change makes it so that the stat prefix for terminating gateways matches that of connect proxies. By using the structure of "upstream.svc.ns.dc" we can extract labels for the destination service, namespace, and datacenter.	2021-06-15 08:52:18 -06:00
R.B. Boyer	8d5f81b460	xds: ensure that dependent xDS resources are reconfigured during primary type warming (#10381 ) Updates to a cluster will clear the associated endpoints, and updates to a listener will clear the associated routes. Update the incremental xDS logic to account for this implicit cleanup so that we can finish warming the clusters and listeners. Fixes #10379	2021-06-14 17:20:27 -05:00
Daniel Nephin	e36800cefa	Update metric name and handle the case where there is no active root CA.	2021-06-14 17:01:16 -04:00
Daniel Nephin	548796ae13	connect: emit a metric for the number of seconds until root CA expiration	2021-06-14 16:57:01 -04:00
Freddy	f399fd2add	Rename CatalogDestinationsOnly (#10397 ) CatalogDestinationsOnly is a passthrough that would enable dialing addresses outside of Consul's catalog. However, when this flag is set to true only _connect_ endpoints for services can be dialed. This flag is being renamed to signal that non-Connect endpoints can't be dialed by transparent proxies when the value is set to true.	2021-06-14 14:15:09 -06:00
Freddy	f19b1f0058	Relax validation for expose.paths config (#10394 ) Previously we would return an error if duplicate paths were specified. This could lead to problems in cases where a user has the same path, say /healthz, on two different ports. This validation was added to signal a potential misconfiguration. Instead we will only check for duplicate listener ports, since that is what would lead to ambiguity issues when generating xDS config. In the future we could look into using a single listener and creating distinct filter chains for each path/port.	2021-06-14 14:04:11 -06:00
Daniel Nephin	cbcc1a3a86	proxycfg: extract two types from state struct These two new struct types will allow us to make polymorphic handler for each kind, instad of having all the logic for each proxy kind on the state struct.	2021-06-10 17:42:17 -04:00
Daniel Nephin	b99da95e70	proxycfg: pass context around where it is needed context.Context should never be stored on a struct (as it says in the godoc) because it is easy to to end up with the wrong context when it is stored. Also see https://blog.golang.org/context-and-structs This change is also in preparation for splitting state into kind-specific handlers so that the implementation of each kind is grouped together.	2021-06-10 17:34:50 -04:00
Daniel Nephin	95315e9e06	http: add PrimaryDatacenter to the /v1/agent/self response This field is available in DebugConfig, but that field is not stable and could change at any time. The consul-k8s needs to be able to detect the primary DC for tests, so adding this field to the stable part of the API response.	2021-06-10 17:19:16 -04:00
Freddy	61ae2995b7	Add flag for transparent proxies to dial individual instances (#10329 )	2021-06-09 14:34:17 -06:00
Daniel Nephin	b5503223ae	submatview: add test cases for store.Get with timeout and no index Also set a more unique name for the serviceRequest.Type to prevent potential name conflicts in the future.	2021-06-08 18:04:38 -04:00
Daniel Nephin	450cce60a1	Merge pull request #10364 from hashicorp/dnephin/streaming-e2e-test submatview: and Store integration test with stream backend	2021-06-08 16:13:45 -04:00
Freddy	62facc1a04	Revert "Avoid adding original_dst filter when not needed" (#10365 )	2021-06-08 13:18:41 -06:00
Daniel Nephin	b8717966f1	submatview: and Store integration test with stream backend	2021-06-08 12:15:35 -04:00
Daniel Nephin	20f7a72792	stream: remove bufferItem.NextLink Both NextLink and NextNoBlock had the same logic, with slightly different return values. By adding a bool return value (similar to map lookups) we can remove the duplicate method.	2021-06-07 17:04:46 -04:00
Daniel Nephin	48f388f590	stream: fix a bug with creating a snapshot The head of the topic buffer was being ignored when creating a snapshot. This commit fixes the bug by ensuring that the head of the topic buffer is included in the snapshot before handing it off to the subscription.	2021-06-04 18:33:04 -04:00
Daniel Nephin	4fb6c5a137	submatview: fix a bug with Store.Get When info.Timeout is 0, it should have no timeout. Previously it was using a 0 duration timeout which caused it to return without waiting. This bug was masked by using a timeout in the tests. Removing the timeout caused the tests to fail.	2021-06-03 17:48:44 -04:00
Paul Ewing	e454a9aae0	usagemetrics: add cluster members to metrics API (#10340 ) This PR adds cluster members to the metrics API. The number of members per segment are reported as well as the total number of members. Tested by running a multi-node cluster locally and ensuring the numbers were correct. Also added unit test coverage to add the new expected gauges to existing test cases.	2021-06-03 08:25:53 -07:00
Daniel Nephin	0dfb7da610	grpc: fix a data race by using a static resolver We have seen test flakes caused by 'concurrent map read and map write', and the race detector reports the problem as well (prevent us from running some tests with -race). The root of the problem is the grpc expects resolvers to be registered at init time before any requests are made, but we were using a separate resolver for each test. This commit introduces a resolver registry. The registry is registered as the single resolver for the consul scheme. Each test uses the Authority section of the target (instead of the scheme) to identify the resolver that should be used for the test. The scheme is used for lookup, which is why it can no longer be used as the unique key. This allows us to use a lock around the map of resolvers, preventing the data race.	2021-06-02 11:35:38 -04:00
Daniel Nephin	2dcfe4a0d5	submatview: improve a couple comments	2021-06-01 17:49:31 -04:00
Dhia Ayachi	9f2f9ac3a5	make tests use a dummy node_name to avoid environment related failures (#10262 ) * fix tests to use a dummy nodeName and not fail when hostname is not a valid nodeName * remove conditional testing * add test when node name is invalid	2021-06-01 11:58:03 -04:00
Daniel Nephin	dcf80907a9	structs: fix cache keys So that requests are cached properly, and the cache does not return the wrong data for a request.	2021-05-31 17:22:16 -04:00
Daniel Nephin	857799cd56	structs: add two cache completeness tests types that implement cache.Request	2021-05-31 16:54:41 -04:00
Daniel Nephin	01790fbcb7	structs: improve the interface of assertCacheInfoKeyIsComplete	2021-05-31 16:54:41 -04:00
Daniel Nephin	9de439f66a	structs: Add more cache key tests	2021-05-31 16:54:40 -04:00
Dhia Ayachi	0c13f80d5a	RPC Timeout/Retries account for blocking requests (#8978 )	2021-05-27 17:29:43 -04:00
hc-github-team-consul-core	aad8acb6ad	auto-updated agent/uiserver/bindata_assetfs.go from commit 18190fb07	2021-05-27 15:00:34 +00:00
Dhia Ayachi	00f7e0772a	debug: remove the CLI check for debug_enabled (#10273 ) * debug: remove the CLI check for debug_enabled The API allows collecting profiles even debug_enabled=false as long as ACLs are enabled. Remove this check from the CLI so that users do not need to set debug_enabled=true for no reason. Also: - fix the API client to return errors on non-200 status codes for debug endpoints - improve the failure messages when pprof data can not be collected Co-Authored-By: Dhia Ayachi <dhia@hashicorp.com> * remove parallel test runs parallel runs create a race condition that fail the debug tests * Add changelog Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-05-27 09:41:53 -04:00
hc-github-team-consul-core	c2dc8cf56b	auto-updated agent/uiserver/bindata_assetfs.go from commit ddee7afbb	2021-05-27 12:29:12 +00:00
Freddy	c61e2bbda7	Ensure passthrough clusters can be created (#10301 )	2021-05-26 15:05:14 -06:00
Freddy	7cfd7e9ec1	Avoid adding original_dst filter when not needed (#10302 )	2021-05-26 15:04:45 -06:00
Matt Keeler	7e4ea16149	Move some things around to allow for license updating via config reload The bulk of this commit is moving the LeaderRoutineManager from the agent/consul package into its own package: lib/gort. It also got a renaming and its Start method now requires a context. Requiring that context required updating a whole bunch of other places in the code.	2021-05-25 09:57:50 -04:00
Dhia Ayachi	6aa915db8d	upgrade golangci-lint to v1.40.1 (#10276 ) Also: fix linter issue detected with newer version	2021-05-24 22:22:37 -04:00
Matt Keeler	58b934133d	hcs-1936: Prepare for adding license auto-retrieval to auto-config in enterprise	2021-05-24 13:20:30 -04:00
Matt Keeler	82f5cb3f08	Preparation for changing where license management is done.	2021-05-24 10:19:31 -04:00
hc-github-team-consul-core	9f765b2582	auto-updated agent/uiserver/bindata_assetfs.go from commit 600f85753	2021-05-24 11:37:54 +00:00
hc-github-team-consul-core	5e3aa6c7ae	auto-updated agent/uiserver/bindata_assetfs.go from commit dd4a66808	2021-05-24 10:56:33 +00:00
Daniel Nephin	21f35ab863	Merge pull request #10272 from hashicorp/dnephin/backport-namespace-license-fix Backport some ent changes for serf tags	2021-05-21 12:31:34 -04:00
Matt Keeler	84c6c56578	Add OSS bits for supporting specifying the enterprise license via config	2021-05-20 16:11:33 -04:00
Daniel Nephin	f2cf586414	Refactor of serf feature flag tags. This refactor is to make it easier to see how serf feature flags are encoded as serf tags, and where those feature flags are read. - use constants for both the prefix and feature flag name. A constant makes it much easier for an IDE to locate the read and write location. - isolate the feature-flag encoding logic in the metadata package, so that the feature flag prefix can be unexported. Only expose a function for encoding the flags into tags. This logic is now next to the logic which reads the tags. - remove the duplicate `addEnterpriseSerfTags` functions. Both Client and Server structs had the same implementation. And neither implementation needed the method receiver.	2021-05-20 12:57:06 -04:00
Daniel Nephin	d9959ba811	Merge pull request #10200 from hashicorp/dnephin/backport-audit-log-config-changes config: backport audit log config changes from enterprise	2021-05-19 10:58:28 -04:00
hc-github-team-consul-core	dd040e13d6	auto-updated agent/uiserver/bindata_assetfs.go from commit 39302041e	2021-05-19 10:11:29 +00:00
Joshua Shanks	9e4051ec65	GH-8728 add raft default values	2021-05-18 14:51:14 -04:00
hc-github-team-consul-core	2730185ac6	auto-updated agent/uiserver/bindata_assetfs.go from commit 8301e79c5	2021-05-18 15:35:50 +00:00
hc-github-team-consul-core	2350928f7a	auto-updated agent/uiserver/bindata_assetfs.go from commit d1bbe0895	2021-05-17 12:32:31 +00:00
R.B. Boyer	7c9763d027	xds: emit a labeled gauge of connected xDS streams by version (#10243 ) Fixes #10099	2021-05-14 13:59:13 -05:00
R.B. Boyer	b90877b440	server: ensure that central service config flattening properly resets the state each time (#10239 ) The prior solution to call reply.Reset() aged poorly since newer fields were added to the reply, but not added to Reset() leading serial blocking query loops on the server to blend replies. This could manifest as a service-defaults protocol change from default=>http not reverting back to default after the config entry reponsible was deleted.	2021-05-14 10:21:44 -05:00
R.B. Boyer	c42899eafa	agent: ensure we hash the non-deprecated upstream fields on ServiceConfigRequest (#10240 )	2021-05-14 10:15:48 -05:00
hc-github-team-consul-core	2416a6ddde	auto-updated agent/uiserver/bindata_assetfs.go from commit 04bd57617	2021-05-13 10:42:23 +00:00
Iryna Shustava	7a41dbd9b6	Save exposed ports in agent's store and expose them via API (#10173 ) * Save exposed HTTP or GRPC ports to the agent's store * Add those the health checks API so we can retrieve them from the API * Change redirect-traffic command to also exclude those ports from inbound traffic redirection when expose.checks is set to true.	2021-05-12 13:51:39 -07:00

... 15 16 17 18 19 ...

4909 Commits