open-consul

Commit Graph

Author	SHA1	Message	Date
Daniel Nephin	07a33a1526	ca: accept only the cluster ID to SpiffeIDSigningForCluster To make it more obivous where ClusterID is used, and remove the need to create a struct when only one field is used.	2021-11-16 16:57:21 -05:00
Will Jordan	2e66b7a5e6	Update node info sync comment (#11465 )	2021-11-16 11:16:11 -08:00
R.B. Boyer	83bf7ab3ff	re-run gofmt on 1.17 (#11579 ) This should let freshly recompiled golangci-lint binaries using Go 1.17 pass 'make lint'	2021-11-16 12:04:01 -06:00
R.B. Boyer	086ff42b56	partitions: various refactors to support partitioning the serf LAN pool (#11568 )	2021-11-15 09:51:14 -06:00
freddygv	f33eae6fe1	Update proxycfg for ingress service partitions	2021-11-12 14:33:31 -07:00
freddygv	dc7ea2ef1e	Accept partition for ingress services	2021-11-12 14:33:14 -07:00
freddygv	5ac1ab359b	Move assertion to after config fetch	2021-11-10 10:50:08 -07:00
freddygv	2261d51515	Use ClusterID to check for readiness The TrustDomain is populated from the Host() method which includes the hard-coded "consul" domain. This means that despite having an empty cluster ID, the TrustDomain won't be empty.	2021-11-10 10:45:22 -07:00
freddygv	482d3bc610	Prevent replicating partition-exports	2021-11-09 16:42:42 -07:00
freddygv	739490df12	handle error scenario of empty local DC	2021-11-09 16:42:42 -07:00
freddygv	b9b41625b9	Restrict DC for partition-exports writes There are two restrictions: - Writes from the primary DC which explicitly target a secondary DC. - Writes to a secondary DC that do not explicitly target the primary DC. The first restriction is because the config entry is not supported in secondary datacenters. The second restriction is to prevent the scenario where a user writes the config entry to a secondary DC, the write gets forwarded to the primary, but then the config entry does not apply in the secondary. This makes the scope more explicit.	2021-11-09 16:42:42 -07:00
Freddy	eb2b40b22d	Update filter chain creation for sidecar/ingress listeners (#11245 ) The duo of `makeUpstreamFilterChainForDiscoveryChain` and `makeListenerForDiscoveryChain` were really hard to reason about, and led to concealing a bug in their branching logic. There were several issues here: - They tried to accomplish too much: determining filter name, cluster name, and whether RDS should be used. - They embedded logic to handle significantly different kinds of upstream listeners (passthrough, prepared query, typical services, and catch-all) - They needed to coalesce different data sources (Upstream and CompiledDiscoveryChain) Rather than handling all of those tasks inside of these functions, this PR pulls out the RDS/clusterName/filterName logic. This refactor also fixed a bug with the handling of [UpstreamDefaults](https://www.consul.io/docs/connect/config-entries/service-defaults#defaults). These defaults get stored as UpstreamConfig in the proxy snapshot with a DestinationName of "", since they apply to all upstreams. However, this wildcard destination name must not be used when creating the name of the associated upstream cluster. The coalescing logic in the original functions here was in some situations creating clusters with a `.` prefix, which is not a valid destination.	2021-11-09 14:43:51 -07:00
Kyle Havlovitz	14591de8d2	Merge pull request #11461 from deblasis/feature/empty_client_addr_warning config: warn the user if client_addr is empty	2021-11-09 09:37:38 -08:00
Daniel Upton	caa5b5a5a6	xds: prefer fed state gateway definitions if they're fresher (#11522 ) Fixes an issue described in #10132, where if two DCs are WAN federated over mesh gateways, and the gateway in the non-primary DC is terminated and receives a new IP address (as is commonly the case when running them on ephemeral compute instances) the primary DC is unable to re-establish its connection until the agent running on its own gateway is restarted. This was happening because we always preferred gateways discovered by the `Internal.ServiceDump` RPC (which would fail because there's no way to dial the remote DC) over those discovered in the federation state, which is replicated as long as the primary DC's gateway is reachable.	2021-11-09 16:45:36 +00:00
Freddy	0ad360fadf	Merge pull request #11514 from hashicorp/dnephin/ca-fix-secondary-init ca: properly handle the case where the secondary initializes after the primary	2021-11-08 17:16:16 -07:00
freddygv	e6622ab0ab	Avoid returning empty roots with uninitialized CA Currently getCARoots could return an empty object with an empty trust domain before the CA is initialized. This commit returns an error while there is no CA config or no trust domain. There could be a CA config and no trust domain because the CA config can be created in InitializeCA before initialization succeeds.	2021-11-08 16:51:49 -07:00
Dhia Ayachi	f61892393f	refactor session state store tables to use the new index pattern (#11525 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused func `prefixIndexFromServiceNameAsString` * fix test error check * remove unused operations_ent.go and operations_oss.go func * remove unused const Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-08 16:20:50 -05:00
Dhia Ayachi	dfafd4e38c	KV refactoring, part 2 (#11512 ) * add partition to the kv get pretty print * fix failing test * add test for kvs RPC endpoint	2021-11-08 11:43:21 -05:00
Dhia Ayachi	17190c0076	KV state store refactoring and partitioning (#11510 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * partition kvs indexID table * add `partitionedIndexEntryName` in oss for test purpose * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * remove entmeta reference from oss Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-08 09:35:56 -05:00
Daniel Nephin	69ad7c0544	ca: Only initialize clusterID in the primary The secondary must get the clusterID from the primary	2021-11-05 18:08:44 -04:00
Daniel Nephin	3173582b75	ca: return an error when secondary fails to initialize Previously secondaryInitialize would return nil in this case, which prevented the deferred initialize from happening, and left the CA in an uninitialized state until a config update or root rotation. To fix this I extracted the common parts into the delegate implementation. However looking at this again, it seems like the handling in secondaryUpdateRoots is impossible, because that function should never be called before the secondary is initialzied. I beleive we can remove some of that logic in a follow up.	2021-11-05 18:02:51 -04:00
Daniel Nephin	db29ad346b	acl: remove id and revision from Policy constructors The fields were removed in a previous commit. Also remove an unused constructor for PolicyMerger	2021-11-05 15:45:08 -04:00
Daniel Nephin	617b11302f	acl: remove Policy.ID and Policy.Revision These two fields do not appear to be used anywhere. We use the structs.ACLPolicy ID in the ACLResolver cache, but the acl.Policy ID and revision are not used.	2021-11-05 15:43:52 -04:00
R.B. Boyer	1d8e7bb565	rename helper method to reflect the non-deprecated terminology (#11509 )	2021-11-05 13:51:50 -05:00
Connor	b3af482e09	Support Vault Namespaces explicitly in CA config (#11477 ) * Support Vault Namespaces explicitly in CA config If there is a Namespace entry included in the Vault CA configuration, set it as the Vault Namespace on the Vault client Currently the only way to support Vault namespaces in the Consul CA config is by doing one of the following: 1) Set the VAULT_NAMESPACE environment variable which will be picked up by the Vault API client 2) Prefix all Vault paths with the namespace Neither of these are super pleasant. The first requires direct access and modification to the Consul runtime environment. It's possible and expected, not super pleasant. The second requires more indepth knowledge of Vault and how it uses Namespaces and could be confusing for anyone without that context. It also infers that it is not supported * Add changelog * Remove fmt.Fprint calls * Make comment clearer * Add next consul version to website docs * Add new test for default configuration * go mod tidy * Add skip if vault not present * Tweak changelog text	2021-11-05 11:42:28 -05:00
R.B. Boyer	7fbf749bc4	segments: ensure that the serf_lan_allowed_cidrs applies to network segments (#11495 )	2021-11-04 17:17:19 -05:00
Mark Anderson	e9a0fa7d36	Remove some usage of md5 from the system (#11491 ) * Remove some usage of md5 from the system OSS side of https://github.com/hashicorp/consul-enterprise/pull/1253 This is a potential security issue because an attacker could conceivably manipulate inputs to cause persistence files to collide, effectively deleting the persistence file for one of the colliding elements. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-11-04 13:07:54 -07:00
FFMMM	9afecfa10c	plumb thru root cert tll to the aws ca provider (#11449 ) * plumb thru root cert ttl to the aws ca provider Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Update .changelog/11449.txt Co-authored-by: Dhia Ayachi <dhia@hashicorp.com> Co-authored-by: Dhia Ayachi <dhia@hashicorp.com>	2021-11-04 12:19:08 -07:00
FFMMM	e7ffef54ee	fix aws pca certs (#11470 ) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-11-03 12:21:24 -07:00
Mathew Estafanous	508664440d	Convert (some) test endpoints to use ServeHTTP instead of direct calls to handlers. (#11445 )	2021-11-03 11:12:36 -04:00
FFMMM	27227c0fd2	add root_cert_ttl option for consul connect, vault ca providers (#11428 ) * add root_cert_ttl option for consul connect, vault ca providers Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> * add changelog, pr feedback Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Update .changelog/11428.txt, more docs Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update website/content/docs/agent/options.mdx Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Kyle Havlovitz <kylehav@gmail.com>	2021-11-02 11:02:10 -07:00
Daniel Nephin	0ec2a804df	Merge pull request #10690 from tarat44/h2c-support-in-ping-checks add support for h2c in h2 ping health checks	2021-11-02 13:53:06 -04:00
Alessandro De Blasis	2b3f4efbab	config: warn the user if client_addr is empty if the provided value is empty string then the client services (DNS, HTTP, HTTPS, GRPC) are not listening and the user is not notified in any way about what's happening. Also, since a not provided client_addr defaults to 127.0.0.1, we make sure we are not getting unwanted warnings Signed-off-by: Alessandro De Blasis <alex@deblasis.net>	2021-11-01 22:47:20 +00:00
Daniel Nephin	00ed2b243f	Merge pull request #10771 from hashicorp/dnephin/emit-telemetry-metrics-immediately telemetry: improve cert expiry metrics	2021-11-01 18:31:03 -04:00
freddygv	ecccf22fd7	Exclude default partition from GatewayKey string This will behave the way we handle SNI and SPIFFE IDs, where the default partition is excluded. Excluding the default ensures that don't attempt to compare default.dc2 to dc2 in OSS.	2021-11-01 14:45:52 -06:00
freddygv	d944e6ae3a	Update GatewayKeys deduplication Federation states data is only keyed on datacenter, so it cannot be directly compared against keys for gateway groups.	2021-11-01 13:58:53 -06:00
freddygv	ce43e8cf99	Store GatewayKey in proxycfg snapshot for re-use	2021-11-01 13:58:53 -06:00
freddygv	51c888a41a	Update locality check in xds	2021-11-01 13:58:53 -06:00
freddygv	6657c88296	Update locality check in proxycfg	2021-11-01 13:58:53 -06:00
Daniel Nephin	c706bf135c	Merge pull request #11340 from hashicorp/dnephin/ca-manager-provider ca: split the Provider interface into Primary/Secondary	2021-11-01 14:11:15 -04:00
Daniel Nephin	eaaceedf31	Merge pull request #11338 from hashicorp/dnephin/ca-manager-isolate-secondary ca: clearly identify methods that are primary-only or secondary-only	2021-11-01 14:10:31 -04:00
Daniel Upton	a620b6be2e	Support Check-And-Set deletion of config entries (#11419 ) Implements #11372	2021-11-01 16:42:01 +00:00
Dhia Ayachi	4d763ef9e6	regenerate expired certs (#11462 ) * regenerate expired certs * add documentation to generate tests certificates	2021-11-01 11:40:16 -04:00
Jared Kirschner	6dfcbeceec	Merge pull request #11348 from kbabuadze/fix-answers-alt-domain Fix answers for alt domain	2021-10-29 17:09:20 -04:00
R.B. Boyer	d40d098321	agent: for various /v1/agent endpoints parse the partition parameter on the request (#11444 ) Also update the corresponding CLI commands to send the parameter appropriately. NOTE: Behavioral changes are not happening in this PR.	2021-10-28 16:44:38 -05:00
R.B. Boyer	017e9d5ae4	agent: add a clone function for duplicating the serf lan configuration (#11443 )	2021-10-28 16:11:26 -05:00
Daniel Nephin	a8d6392ab5	Add tests for cert expiry metrics	2021-10-28 14:38:57 -04:00
Daniel Nephin	503dee2d80	Merge pull request #10671 from hashicorp/dnephin/fix-subscribe-test-flake subscribe: improve TestSubscribeBackend_IntegrationWithServer_DeliversAllMessages	2021-10-28 12:57:09 -04:00
Evan Culver	b3c92f22b1	connect: Remove support for Envoy 1.16 (#11354 )	2021-10-27 18:51:35 -07:00
Evan Culver	98acbfa79c	connect: Add support for Envoy 1.20 (#11277 )	2021-10-27 18:38:10 -07:00
freddygv	3dd21023bc	Ensure partition-exports kind gets marshalled The api module has decoding functions that rely on 'kind' being present of payloads. This is so that we can decode into the appropriate api type for the config entry. This commit ensures that a static kind is marshalled in responses from Consul's api endpoints so that the api module can decode them.	2021-10-27 15:01:26 -06:00
Daniel Nephin	0a19d7fd76	agent: move agent tls metric monitor to a more appropriate place And add a test for it	2021-10-27 16:26:09 -04:00
Daniel Nephin	1b2144c982	telemetry: set cert expiry metrics to NaN on start So that followers do not report 0, which would make alerting difficult.	2021-10-27 15:19:25 -04:00
Daniel Nephin	a7fcf14c5c	telemetry: fix cert expiry metrics by removing labels These labels should be set by whatever process scrapes Consul (for prometheus), or by the agent that receives them (for datadog/statsd). We need to remove them here because the labels are part of the "metric key", so we'd have to pre-declare the metrics with the labels. We could do that, but that is extra work for labels that should be added from elsewhere. Also renames the closure to be more descriptive.	2021-10-27 15:19:25 -04:00
Daniel Nephin	4300daa2e6	telemetry: only emit leader cert expiry metrics on the servers	2021-10-27 15:19:25 -04:00
Daniel Nephin	9de725c17d	telemetry: prevent stale values from cert monitors Prometheus scrapes metrics from each process, so when leadership transfers to a different node the previous leader would still be reporting the old cached value. By setting NaN, I believe we should zero-out the value, so that prometheus should only consider the value from the new leader.	2021-10-27 15:19:25 -04:00
Daniel Nephin	616cc9b6f8	telemetry: improve cert expiry metrics Emit the metric immediately so that after restarting an agent, the new expiry time will be emitted. This is particularly important when this metric is being monitored, because we want the alert to resovle itself immediately. Also fixed a bug that was exposed in one of these metrics. The CARoot can be nil, so we have to handle that case.	2021-10-27 15:19:25 -04:00
Daniel Nephin	24951f0c7e	subscribe: attempt to fix a flaky test TestSubscribeBackend_IntegrationWithServer_DeliversAllMessages has been flaking a few times. This commit cleans up the test a bit, and improves the failure output. I don't believe this actually fixes the flake, but I'm not able to reproduce it reliably. The failure appears to be that the event with Port=0 is being sent in both the snapshot and as the first event after the EndOfSnapshot event. Hopefully the improved logging will show us if these are really duplicate events, or actually different events with different indexes.	2021-10-27 15:09:09 -04:00
Freddy	ae76144f55	Merge pull request #11435 from hashicorp/ent-authorizer-refactor [OSS] Export ACLs refactor	2021-10-27 13:04:40 -06:00
Freddy	520bda999b	Merge pull request #11432 from hashicorp/ap/exports-mgw [OSS] Update mesh gateways to handle partitions	2021-10-27 12:54:53 -06:00
freddygv	592965d61e	Rework acl exports interface	2021-10-27 12:50:39 -06:00
Freddy	9bbeea0432	Merge pull request #11433 from hashicorp/exported-service-acls [OSS] acl: Expand ServiceRead and NodeRead to account for partition exports	2021-10-27 12:48:08 -06:00
freddygv	05f91bd2b8	Update comments	2021-10-27 12:36:44 -06:00
Freddy	d8ae915160	Merge pull request #11431 from hashicorp/ap/exports-proxycfg [OSS] Update partitioned mesh gw handling for connect proxies	2021-10-27 11:27:43 -06:00
Freddy	8e23a6a0cc	Merge pull request #11416 from hashicorp/ap/exports-update Rename service-exports to partition-exports	2021-10-27 11:27:31 -06:00
freddygv	40271beb38	Fixup partitions assertion	2021-10-27 11:15:25 -06:00
freddygv	67412ac5e7	Fixup imports	2021-10-27 11:15:25 -06:00
freddygv	4de3537391	Split up locality check from hostname check	2021-10-27 11:15:25 -06:00
freddygv	9769b31641	Move the exportingpartitions constant to enterprise	2021-10-27 11:15:25 -06:00
freddygv	0391a65772	Replace default partition check	2021-10-27 11:15:25 -06:00
freddygv	ee45ac9dc5	PR comments	2021-10-27 11:15:25 -06:00
freddygv	f99946553a	Leave todo about default name	2021-10-27 11:15:25 -06:00
freddygv	9d375ad6d2	Add oss impl of registerEntCache	2021-10-27 11:15:25 -06:00
freddygv	183849416b	Register the ExportingPartitions cache type	2021-10-27 11:15:25 -06:00
freddygv	8b5a9369eb	Account for partitions in xds gen for mesh gw This commit avoids skipping gateways in remote partitions of the local DC when generating listeners/clusters/endpoints.	2021-10-27 11:15:25 -06:00
freddygv	d1d513b1b3	Account for partition in SNI for gateways	2021-10-27 11:15:25 -06:00
freddygv	4f0432be5e	Update xds pkg to account for GatewayKey	2021-10-27 09:03:56 -06:00
freddygv	f3f15640a9	Update mesh gateway proxy watches for partitions This commit updates mesh gateway watches for cross-partitions communication. * Mesh gateways are keyed by partition and datacenter. * Mesh gateways will now watch gateways in partitions that export services to their partition. * Mesh gateways in non-default partitions will not have cross-datacenter watches. They are not involved in traditional WAN federation.	2021-10-27 09:03:56 -06:00
freddygv	af662c8c1c	Avoid mixing named and unnamed params	2021-10-26 23:42:25 -06:00
freddygv	1de62bb0a2	Avoid passing nil config pointer	2021-10-26 23:42:25 -06:00
freddygv	4a2e40aa3c	Avoid panic on nil partitionAuthorizer config partitionAuthorizer.config can be nil if it wasn't provided on calls to newPartitionAuthorizer outside of the ACLResolver. This usage happens often in tests. This commit: adds a nil check when the config is going to be used, updates non-test usage of NewPolicyAuthorizerWithDefaults to pass a non-nil config, and dettaches setEnterpriseConf from the ACLResolver.	2021-10-26 23:42:25 -06:00
freddygv	015d85cd74	Update NodeRead for partition-exports When issuing cross-partition service discovery requests, ACL filtering often checks for NodeRead privileges. This is because the common return type is a CheckServiceNode, which contains node data.	2021-10-26 23:42:11 -06:00
Kyle Havlovitz	afb0976eac	acl: pass PartitionInfo through ent ACLConfig	2021-10-26 23:41:52 -06:00
Kyle Havlovitz	56d1858c4a	acl: Expand ServiceRead logic to look at service-exports for cross-partition	2021-10-26 23:41:32 -06:00
freddygv	4737ad118d	Swap in structs.EqualPartitions for cmp	2021-10-26 23:36:01 -06:00
freddygv	1bade08f91	Replace Split with SplitN	2021-10-26 23:36:01 -06:00
freddygv	3966677aaf	Finish removing useInDatacenter	2021-10-26 23:36:01 -06:00
freddygv	69476221c1	Update XDS for sidecars dialing through gateways	2021-10-26 23:35:48 -06:00
freddygv	ea311d2e47	Configure sidecars to watch gateways in partitions Previously the datacenter of the gateway was the key identifier, now it is the datacenter and partition. When dialing services in other partitions or datacenters we now watch the appropriate partition.	2021-10-26 23:35:37 -06:00
freddygv	feaebde1f1	Remove useInDatacenter from disco chain requests useInDatacenter was used to determine whether the mesh gateway mode of the upstream should be returned in the discovery chain target. This commit makes it so that the mesh gateway mode is returned every time, and it is up to the caller to decide whether mesh gateways should be watched or used.	2021-10-26 23:35:21 -06:00
R.B. Boyer	e27e58c6cc	agent: refactor the agent delegate interface to be partition friendly (#11429 )	2021-10-26 15:08:55 -05:00
Chris S. Kim	27f8a85664	agent: Ensure partition is considered in agent endpoints (#11427 )	2021-10-26 15:20:57 -04:00
Konstantine	2f9ee8e558	remove spaces	2021-10-26 12:38:13 -04:00
Konstantine	be14f6da90	fix altDomain responses for services where address is IP, added tests	2021-10-26 12:38:13 -04:00
Konstantine	eec9d66e22	fix encodeIPAsFqdn to return alt-domain when requested, added test case	2021-10-26 12:38:12 -04:00
Konstantine	9d6797a463	fixed altDomain response for NS type queries, and added test	2021-10-26 12:38:12 -04:00
Konstantine	0735e12412	edited TestDNS_AltDomains_Service to test responses for altDomains, and added TXT additional section check	2021-10-26 12:38:12 -04:00
Konstantine	8972e093d9	fixed alt-domain answer for SRV records, and TXT records in additional section	2021-10-26 12:38:12 -04:00
Chris S. Kim	3f736467e6	ui: Pass primary dc through to uiserver (#11317 ) Co-authored-by: John Cowen <johncowen@users.noreply.github.com>	2021-10-26 10:30:17 -04:00
freddygv	83d4d0e108	Remove outdated partition label from test	2021-10-25 18:47:02 -06:00

1 2 3 4 5 ...

3910 Commits