open-consul

Commit Graph

Author	SHA1	Message	Date
R.B. Boyer	5036b5e414	update changelog (#12495 )	2022-03-02 16:44:13 -06:00
R.B. Boyer	3804677570	server: suppress spurious blocking query returns where multiple config entries are involved (#12362 ) Starting from and extending the mechanism introduced in #12110 we can specially handle the 3 main special Consul RPC endpoints that react to many config entries in a single blocking query in Connect: - `DiscoveryChain.Get` - `ConfigEntry.ResolveServiceConfig` - `Intentions.Match` All of these will internally watch for many config entries, and at least one of those will likely be not found in any given query. Because these are blends of multiple reads the exact solution from #12110 isn't perfectly aligned, but we can tweak the approach slightly and regain the utility of that mechanism. ### No Config Entries Found In this case, despite looking for many config entries none may be found at all. Unlike #12110 in this scenario we do not return an empty reply to the caller, but instead synthesize a struct from default values to return. This can be handled nearly identically to #12110 with the first 1-2 replies being non-empty payloads followed by the standard spurious wakeup suppression mechanism from #12110. ### No Change Since Last Wakeup Once a blocking query loop on the server has completed and slept at least once, there is a further optimization we can make here to detect if any of the config entries that were present at specific versions for the prior execution of the loop are identical for the loop we just woke up for. In that scenario we can return a slightly different internal sentinel error and basically externally handle it similar to #12110. This would mean that even if 20 discovery chain read RPC handling goroutines wakeup due to the creation of an unrelated config entry, the only ones that will terminate and reply with a blob of data are those that genuinely have new data to report. ### Extra Endpoints Since this pattern is pretty reusable, other key config-entry-adjacent endpoints used by `agent/proxycfg` also were updated: - `ConfigEntry.List` - `Internal.IntentionUpstreams` (tproxy)	2022-02-25 15:46:34 -06:00
Chris S. Kim	aea00f10ae	Merge pull request #12442 from danieleva/12422-keyring Allows keyring operations on client agents	2022-02-25 16:28:56 -05:00
Daniele Vazzola	2cb1017e13	Adds changelog	2022-02-25 15:43:06 +00:00
Evan Culver	49a6109d96	Add changelog entry	2022-02-24 17:05:55 -08:00
Evan Culver	9f4d9f3f74	connect: Upgrade Envoy 1.20 to 1.20.2 (#12443 )	2022-02-24 16:19:39 -08:00
R.B. Boyer	a97d20cf63	catalog: compare node names case insensitively in more places (#12444 ) Many places in consul already treated node names case insensitively. The state store indexes already do it, but there are a few places that did a direct byte comparison which have now been corrected. One place of particular consideration is ensureCheckIfNodeMatches which is executed during snapshot restore (among other places). If a node check used a slightly different casing than the casing of the node during register then the snapshot restore here would deterministically fail. This has been fixed. Primary approach: git grep -i "node.[!=]=.node" -- ':!_test.go' ':!docs' git grep -i '\[[^]]member[^]]\] git grep -i '\[[^]]\(member\\|name\\|node\)[^]]\]' -- ':!_test.go' ':!website' ':!ui' ':!agent/proxycfg/testing.go:' ':!*.md'	2022-02-24 16:54:47 -06:00
R.B. Boyer	d860384731	server: partly fix config entry replication issue that prevents replication in some circumstances (#12307 ) There are some cross-config-entry relationships that are enforced during "graph validation" at persistence time that are required to be maintained. This means that config entries may form a digraph at times. Config entry replication procedes in a particular sorted order by kind and name. Occasionally there are some fixups to these digraphs that end up replicating in the wrong order and replicating the leaves (ingress-gateway) before the roots (service-defaults) leading to replication halting due to a graph validation error related to things like mismatched service protocol requirements. This PR changes replication to give each computed change (upsert/delete) a fair shot at being applied before deciding to terminate that round of replication in error. In the case where we've simply tried to do the operations in the wrong order at least ONE of the outstanding requests will complete in the right order, leading the subsequent round to have fewer operations to do, with a smaller likelihood of graph validation errors. This does not address all scenarios, but for scenarios where the edits are being applied in the wrong order this should avoid replication halting. Fixes #9319 The scenario that is NOT ADDRESSED by this PR is as follows: 1. create: service-defaults: name=new-web, protocol=http 2. create: service-defaults: name=old-web, protocol=http 3. create: service-resolver: name=old-web, redirect-to=new-web 4. delete: service-resolver: name=old-web 5. update: service-defaults: name=old-web, protocol=grpc 6. update: service-defaults: name=new-web, protocol=grpc 7. create: service-resolver: name=old-web, redirect-to=new-web If you shutdown dc2 just before (4) and turn it back on after (7) replication is impossible as there is no single edit you can make to make forward progress.	2022-02-23 17:27:48 -06:00
R.B. Boyer	ed1bc166e4	add changelog entry for enterprise only change (#12425 )	2022-02-23 14:23:48 -06:00
Daniel Nephin	3639f4b551	Merge pull request #11910 from hashicorp/dnephin/ca-provider-interface-for-ica-in-primary ca: add support for an external trusted CA	2022-02-22 13:14:52 -05:00
John Cowen	4ad8a0cfef	ui: Transition App Chrome to use new Disclosure Menus (#12334 ) * Add %panel CSS component * Deprecate old menu-panel component * Various smallish tweaks to disclosure-menu * Move all menus in the app chrome to use new DisclosureMenu * Follow up CSS to move all app chrome menus to new components * Don't prevent default any events from anchors * Add a tick to click steps	2022-02-21 12:22:59 +00:00
Evan Culver	067223337d	checks: populate interval and timeout when registering services (#11138 )	2022-02-18 12:05:33 -08:00
Kyle Havlovitz	9c03b5dc3d	Merge pull request #12385 from hashicorp/tproxy-http-upstream-fix xds: respect chain protocol on default discovery chain	2022-02-18 10:08:59 -08:00
John Cowen	b626e33f92	ui: Fixup displaying a Nspace default policy when expanding the preview pane (#12316 )	2022-02-18 17:22:05 +00:00
John Cowen	6e0eddd841	ui: Replace CollapsibleNotices with more a11y focussed Disclosure component (#12305 ) * Delete collapsible notices component and related helper * Add relative t action/helper to our Route component * Replace single use CollapsibleNotices with multi-use Disclosure	2022-02-18 17:16:03 +00:00
Evan Culver	bdb120f79d	ci: combine 'enhancement' entry type with 'improvement' (#12376 )	2022-02-17 19:21:47 -08:00
Daniel Nephin	0abaf29c10	ca: add test cases for rotating external trusted CA	2022-02-17 18:21:30 -05:00
Daniel Nephin	2d5254a73b	Merge pull request #12110 from hashicorp/dnephin/blocking-queries-not-found rpc: make blocking queries for non-existent items more efficient	2022-02-17 18:09:39 -05:00
Ashwin Venkatesh	39be071264	Parse datacenter from request (#12370 ) * Parse datacenter from request - Parse the value of the datacenter from the create/delete requests for AuthMethods and BindingRules so that they can be created in and deleted from the datacenters specified in the request.	2022-02-17 16:41:27 -05:00
Kyle Havlovitz	6b9eb17df1	Add changelog note	2022-02-17 12:17:12 -08:00
Florian Apolloner	895da50986	Support for connect native services in topology view. (#12098 )	2022-02-16 16:51:54 -05:00
Daniel Nephin	bdafa24c50	Make blockingQuery efficient with 'not found' results. By using the query results as state. Blocking queries are efficient when the query matches some results, because the ModifyIndex of those results, returned as queryMeta.Mindex, will never change unless the items themselves change. Blocking queries for non-existent items are not efficient because the queryMeta.Index can (and often does) change when other entities are written. This commit reduces the churn of these queries by using a different comparison for "has changed". Instead of using the modified index, we use the existence of the results. If the previous result was "not found" and the new result is still "not found", we know we can ignore the modified index and continue to block. This is done by setting the minQueryIndex to the returned queryMeta.Index, which prevents the query from returning before a state change is observed.	2022-02-15 18:24:33 -05:00
Daniel Nephin	5bd73fc218	debug: limit the size of the trace We've noticed that a trace that is captured over the full duration is too large to open on most machines. A trace.out captured over just the interval period (30s by default) should be a more than enough time to capture trace data.	2022-02-15 14:15:34 -05:00
R.B. Boyer	b216d52b66	server: conditionally avoid writing a config entry to raft if it was already the same (#12321 ) This will both save on unnecessary raft operations as well as unnecessarily incrementing the raft modify index of config entries subject to no-op updates.	2022-02-14 14:39:12 -06:00
R.B. Boyer	ef8cc33949	raft: update to v1.3.5 (#12325 ) This includes closing some leadership transfer gaps and adding snapshot restore progress logging.	2022-02-14 13:48:52 -06:00
R.B. Boyer	0b80f70a39	local: fixes a data race in anti-entropy sync (#12324 ) The race detector noticed this initially in `TestAgentConfigWatcherSidecarProxy` but it is not restricted to just tests. The two main changes here were: - ensure that before we mutate the internal `agent/local` representation of a Service (for tags or VIPs) we clone those fields - ensure that there's no function argument joint ownership between the caller of a function and the local state when calling `AddService`, `AddCheck`, and related using `copystructure` for now.	2022-02-14 10:41:33 -06:00
Mark Anderson	fa95afdcf6	Refactor to make ACL errors more structured. (#12308 ) * First phase of refactoring PermissionDeniedError Add extended type PermissionDeniedByACLError that captures information about the accessor, particular permission type and the object and name of the thing being checked. It may be worth folding the test and error return into a single helper function, that can happen at a later date. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2022-02-11 12:53:23 -08:00
John Cowen	ab3b765a88	ui: Make sure saving intentions from topology includes the partition (#12317 )	2022-02-11 13:58:01 +00:00
John Cowen	72a10582d0	ui: Stop ember-data overwriting SyncTimes (#12315 )	2022-02-11 13:54:46 +00:00
John Cowen	c6342969c5	ui: Exclude Service Health from Node listing page (#12248 ) This commit excludes the health of any service instances from the Node Listing page. This means that if you are viewing the Node listing page you will only see failing nodes if there are any Node Checks failing, Service Instance Health checks are no longer taken into account. Co-authored-by: Jamie White <jamie@jgwhite.co.uk>	2022-02-11 09:52:27 +00:00
Freddy	f45bec7779	Merge pull request #12223 from hashicorp/proxycfg/passthrough-cleanup	2022-02-10 17:35:51 -07:00
freddygv	88832f692a	Add changelog entry	2022-02-10 17:21:34 -07:00
Freddy	bb129384b7	Prevent xDS tight loop on cfg errors (#12195 )	2022-02-10 15:37:36 -07:00
Dhia Ayachi	de7598f064	fix race when starting a service while the agent `serviceManager` is … (#12302 ) * fix race when starting a service while the agent `serviceManager` is stopping * add changelog	2022-02-10 13:30:49 -05:00
John Cowen	d3e5d6edc4	ui: Ensure proxy instance health is taken into account in Service Instance Listings (#12279 ) We noticed that the Service Instance listing on both Node and Service views where not taking into account proxy instance health. This fixes that up so that the small health check information in each Service Instance row includes the proxy instances health checks when displaying Service Instance health (afterall if the proxy instance is unhealthy then so is the service instance that it should be proxying) * Refactor Consul::InstanceChecks with docs * Add to-hash helper, which will return an object keyed by a prop * Stop using/relying on ember-data type things, just use a hash lookup * For the moment add an equivalent "just give me proxies" model prop * Start stitching things together, this one requires an extra HTTP request ..previously we weren't even requesting proxies instances here * Finish up the stitching * Document Consul::ServiceInstance::List while I'm here * Fix up navigation mocks Name > Service	2022-02-10 15:28:26 +00:00
Daniel Nephin	db4675bd1a	Merge pull request #12277 from hashicorp/dnephin/panic-in-service-register catalog: initialize the refs map to prevent a nil panic	2022-02-09 19:48:22 -05:00
Daniel Nephin	6376141464	config-entry: fix a panic when registering a service or ingress gateway	2022-02-09 18:49:48 -05:00
R.B. Boyer	0cd0d505fa	xds: allow only one outstanding delta request at a time (#12236 ) Fixes #11876 This enforces that multiple xDS mutations are not issued on the same ADS connection at once, so that we can 100% control the order that they are applied. The original code made assumptions about the way multiple in-flight mutations were applied on the Envoy side that was incorrect.	2022-02-08 10:36:48 -06:00
claire labry	9d47cc8df1	Merge pull request #11956 from hashicorp/enable-security-scan Enable Security Scan for CRT	2022-02-04 13:13:24 -05:00
Daniel Nephin	cc2d1bc2e7	add changelog	2022-02-03 17:39:36 -05:00
Evan Culver	1908e98c66	Merge branch 'enable-security-scan' of github.com:hashicorp/consul into enable-security-scan	2022-02-02 17:32:17 -08:00
Evan Culver	0784d44a3c	Add changelog entry	2022-02-02 17:31:08 -08:00
John Cowen	404523f7ea	ui: Alias all our Structure Icons to Flight Icons (#12209 )	2022-02-02 13:24:47 +00:00
Daniel Nephin	ff64c13c3e	Merge pull request #12166 from hashicorp/dnephin/acl-resolve-token-2 acl: remove ResolveTokenToIdentity	2022-01-31 19:19:21 -05:00
Daniel Nephin	57eac90cae	acl: remove unused methods on fakes, and add changelog Also document the metric that was removed in a previous commit.	2022-01-31 17:53:53 -05:00
Dan Upton	ebdda4848f	streaming: split event buffer by key (#12080 )	2022-01-28 12:27:00 +00:00
R.B. Boyer	b999b3edfc	xds: fix for delta xDS reconnect bug in LDS/CDS (#12174 ) When a wildcard xDS type (LDS/CDS/SRDS) reconnects from a delta xDS stream, prior to envoy `1.19.0` it would populate the `ResourceNamesSubscribe` field with the full list of currently subscribed items, instead of simply omitting it to infer that it wanted everything (which is what wildcard mode means). This upstream issue was filed in envoyproxy/envoy#16063 and fixed in envoyproxy/envoy#16153 which went out in Envoy `1.19.0` and is fixed in later versions (later refactored in envoyproxy/envoy#16855). This PR conditionally forces LDS/CDS to be wildcard-only even when the connected Envoy requests a non-wildcard subscription, but only does so on versions prior to `1.19.0`, as we should not need to do this on later versions. This fixes the failure case as described here: #11833 (comment) Co-authored-by: Huan Wang <fredwanghuan@gmail.com>	2022-01-25 11:24:27 -06:00
Michele Degges	300ce7b8e7	Adding changelog entry	2022-01-24 12:32:22 -08:00
Ashwin Venkatesh	fb04a28140	Add support for 'Partition' and 'RetryJoin' (#12126 ) - Adding a 'Partition' and 'RetryJoin' command allows test cases where one would like to spin up a Consul Agent in a non-default partition to test use-cases that are common when enabling Admin Partition on Kubernetes.	2022-01-20 16:49:36 -05:00
Dan Upton	088ba2edaf	[OSS] Remove remaining references to master (#11827 )	2022-01-20 12:47:50 +00:00

1 2 3 4 5 ...

656 Commits