open-consul

Commit Graph

Author	SHA1	Message	Date
Daniel Nephin	ed1cc5f255	acl: remove ResolveTokenToIdentity By exposing the AccessorID from the primary ResolveToken method we can remove this duplication.	2022-01-22 14:47:59 -05:00
Daniel Nephin	26f0ebd96f	acl: return a resposne from ResolveToken that includes the ACLIdentity So that we can duplicate duplicate methods.	2022-01-22 14:33:09 -05:00
Daniel Nephin	314614f073	acl: remove duplicate methods Now that ACLResolver is embedded we don't need ResolveTokenToIdentity on Client and Server. Moving ResolveTokenAndDefaultMeta to ACLResolver removes the duplicate implementation.	2022-01-22 14:12:08 -05:00
Daniel Nephin	62c09b2d0a	acl: embed ACLResolver in Client and Server In preparation for removing duplicate resolve token methods.	2022-01-22 14:07:26 -05:00
R.B. Boyer	05c7373a28	bulk rewrite using this script set -euo pipefail unset CDPATH cd "$(dirname "$0")" for f in $(git grep '\brequire := require\.New(' \| cut -d':' -f1 \| sort -u); do echo "=== require: $f ===" sed -i '/require := require.New(t)/d' $f # require.XXX(blah) but not require.XXX(tblah) or require.XXX(rblah) sed -i 's/\brequire\.$[a-zA-Z0-9_]$($[^tr]$/require.\1(t,\2/g' $f # require.XXX(tblah) but not require.XXX(t, blah) sed -i 's/\brequire\.$[a-zA-Z0-9_]$($t[^,]$/require.\1(t,\2/g' $f # require.XXX(rblah) but not require.XXX(r, blah) sed -i 's/\brequire\.$[a-zA-Z0-9_]$($r[^,]$/require.\1(t,\2/g' $f gofmt -s -w $f done for f in $(git grep '\bassert := assert\.New(' \| cut -d':' -f1 \| sort -u); do echo "=== assert: $f ===" sed -i '/assert := assert.New(t)/d' $f # assert.XXX(blah) but not assert.XXX(tblah) or assert.XXX(rblah) sed -i 's/\bassert\.$[a-zA-Z0-9_]$($[^tr]$/assert.\1(t,\2/g' $f # assert.XXX(tblah) but not assert.XXX(t, blah) sed -i 's/\bassert\.$[a-zA-Z0-9_]$($t[^,]$/assert.\1(t,\2/g' $f # assert.XXX(rblah) but not assert.XXX(r, blah) sed -i 's/\bassert\.$[a-zA-Z0-9_]$($r[^,]$/assert.\1(t,\2/g' $f gofmt -s -w $f done	2022-01-20 10:46:23 -06:00
R.B. Boyer	c12b0ee3d2	test: normalize require.New and assert.New syntax	2022-01-20 10:45:56 -06:00
Dan Upton	088ba2edaf	[OSS] Remove remaining references to master (#11827 )	2022-01-20 12:47:50 +00:00
Daniel Nephin	59206e38c7	rpc: cleanup exit and blocking condition logic in blockingQuery Remove some unnecessary comments around query_blocking metric. The only line that needs any comments in the atomic decrement. Cleanup the block and return comments and logic. The old comment about AbandonCh may have been relevant before, but it is expected behaviour now. The logic was simplified by inverting the err condition.	2022-01-17 16:59:25 -05:00
Daniel Nephin	a28d1268cb	rpc: extract rpcQueryTimeout method This helps keep the logic in blockingQuery more focused. In the future we may have a separate struct for RPC queries which may allow us to move this off of Server.	2022-01-17 16:59:25 -05:00
Daniel Nephin	751bc2e7d3	rpc: move the index defaulting to setQueryMeta. This safeguard should be safe to apply in general. We are already applying it to non-blocking queries that call blockingQuery, so it should be fine to apply it to others.	2022-01-17 16:59:25 -05:00
Daniel Nephin	95e471052b	rpc: add subtests to blockingQuery test	2022-01-17 16:59:25 -05:00
Daniel Nephin	6bf8efe607	rpc: refactor blocking query To remove the TODO, and make it more readable. In general this reduces the scope of variables, making them easier to reason about. It also introduces more early returns so that we can see the flow from the structure of the function.	2022-01-17 16:58:47 -05:00
Daniel Nephin	1971a58b29	Merge pull request #11661 from hashicorp/dnephin/ca-remove-one-call-to-active-root ca: remove one call to Provider.ActiveRoot	2022-01-13 16:48:12 -05:00
Kyle Havlovitz	2ba76486d0	Add virtual IP generation for term gateway backed services	2022-01-12 12:08:49 -08:00
Daniel Nephin	262898e561	ca: remove unnecessary var, and slightly reduce cyclo complexity `newIntermediate` is always equal to `needsNewIntermediate`, so we can remove the extra variable and use the original directly. Also remove the `activeRoot.ID != newActiveRoot.ID` case from an if, because that case is already checked above, and `needsNewIntermediate` will already be true in that case. This condition now reads a lot better: > Persist a new root if we did not have one before, or if generated a new intermediate.	2022-01-06 16:56:49 -05:00
Daniel Nephin	d406f78c5c	ca: remove unused provider.ActiveRoot call In the previous commit the single use of this storedRoot was removed. In this commit the original objective is completed. The Provider.ActiveRoot is being removed because 1. the secondary should get the active root from the Consul primary DC, not the provider, so that secondary DCs do not need to communicate with a provider instance in a different DC. 2. so that the Provider.ActiveRoot interface can be changed without impacting other code paths.	2022-01-06 16:56:48 -05:00
Daniel Nephin	4d15e8a9ec	ca: extract the lookup of the active primary CA This method had only one caller, which always looked for the active root. This commit moves the lookup into the method to reduce the logic in the one caller. This is being done in preparation for a larger change. Keeping this separate so it is easier to see. The `storedRootID != primaryRoots.ActiveRootID` is being removed because these can never be different. The `storedRootID` comes from `provider.ActiveRoot`, the `primaryRoots.ActiveRootID` comes from the store `CARoot` from the primary. In both cases the source of the data is the primary DC. Technically they could be different if someone modified the provider outside of Consul, but that would break many things, so is not a supported flow. If these were out of sync because of ordering of events then the secondary will soon receive an update to `primaryRoots` and everything will be sorted out again.	2022-01-06 16:56:48 -05:00
Daniel Nephin	37b09df427	ca: update godoc To clarify what to expect from the data stored in this field, and the behaviour of this function.	2022-01-06 16:56:48 -05:00
Daniel Nephin	1f670c22f5	ca: remove one call to provider.ActiveRoot ActiveRoot should not be called from the secondary DC, because there should not be a requirement to run the same Vault instance in a secondary DC. SignIntermediate is called in a secondary DC, so it should not call ActiveRoot We would also like to change the interface of ActiveRoot so that we can support using an intermediate cert as the primary CA in Consul. In preparation for making that change I am reducing the number of calls to ActiveRoot, so that there are fewer code paths to modify when the interface changes. This change required a change to the mockCAServerDelegate we use in tests. It was returning the RootCert for SignIntermediate, but that is not an accurate fake of production. In production this would also be a separate cert.	2022-01-06 16:55:50 -05:00
Daniel Nephin	1f66120c20	ca: remove redundant append of an intermediate cert Immediately above this line we are already appending the full list of intermediates. The `provider.ActiveIntermediate` MUST be in this list of intermediates because it must be available to all the other non-leader Servers. If it was not in this list of intermediates then any proxy that received data from a non-leader would have the wrong certs. This is being removed now because we are planning on changing the `Provider.ActiveIntermediate` interface, and removing these extra calls ahead of time helps make that change easier.	2022-01-06 16:55:50 -05:00
Daniel Nephin	b66d259c1a	ca: only generate a single private key for the whole test case Using tracing and cpu profiling I found that the majority of the time in these test cases is spent generating a private key. We really don't need separate private keys, so we can generate only one and use it for all cases. With this change the test runs much faster.	2022-01-06 16:55:50 -05:00
Daniel Nephin	92a054cfa6	ca: cleanup a test Fix the name to match the function it is testing Remove unused code Fix the signature, instead of returning (error, string) which should be (string, error) accept a testing.T to emit errors. Handle the error from encode.	2022-01-06 16:55:49 -05:00
Daniel Nephin	9ec7e07db4	ca: use the new leaf signing lookup func in leader metrics	2022-01-06 16:55:49 -05:00
Daniel Nephin	4983c27703	snapshot: return the error from replyFn The only function passed to SnapshotRPC today always returns a nil error, so there's no way to exercise this bug in practice. This change is being made for correctness so that it doesn't become a problem in the future, if we ever pass a different function to SnapshotRPC.	2022-01-05 17:51:03 -05:00
Jared Kirschner	a9371f18e5	Clarify service and check error messages (use ID) Error messages related to service and check operations previously included the following substrings: - service %q - check %q From this error message, it isn't clear that the expected field is the ID for the entity, not the name. For example, if the user has a service named test, the error message would read 'Unknown service "test"'. This is misleading - a service with that name does exist, but not with that ID. The substrings above have been modified to make it clear that ID is needed, not name: - service with ID %q - check with ID %q	2022-01-04 11:42:37 -08:00
Chris S. Kim	d87fe70a82	testing: Revert assertion for virtual IP flag (#11932 )	2022-01-04 11:24:56 -05:00
Daniel Nephin	48d123e241	Merge pull request #11796 from hashicorp/dnephin/cleanup-test-server testing: stop using an old version in testServer	2021-12-22 16:04:04 -05:00
Freddy	f7eeffb98d	Use anonymousToken when querying by secret ID (#11813 ) Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Dan Upton <daniel@floppy.co> This query has been incorrectly querying by accessor ID since New ACLs were added. However, the legacy token compat allowed this to continue to work, since it made a fallback query for the anonymousToken ID. PR #11184 removed this legacy token query, which means that the query by accessor ID is now the only check for the anonymous token's existence. This PR updates the GetBySecret call to use the secret ID of the token.	2021-12-13 10:56:09 -07:00
R.B. Boyer	a0156785dd	various partition related todos (#11822 )	2021-12-13 11:43:33 -06:00
Kyle Havlovitz	9187070a93	Merge pull request #11798 from hashicorp/vip-goroutine-check leader: move the virtual IP version check into a goroutine	2021-12-10 15:59:35 -08:00
Kyle Havlovitz	45402dad63	state: fix freed VIP table id index	2021-12-10 14:41:45 -08:00
Kyle Havlovitz	ccc119c549	Exit before starting the vip check routine if possible	2021-12-10 14:30:50 -08:00
Daniel Nephin	6444d1d4b3	testing: Deprecate functions for creating a server. These helper functions actually end up hiding important setup details that should be visible from the test case. We already have a convenient way of setting this config when calling newTestServerWithConfig.	2021-12-09 20:09:29 -05:00
Daniel Nephin	74e92316de	testing: remove old config.Build version DefaultConfig already sets the version to version.Version, so by removing this our tests will run with the version that matches the code.	2021-12-09 20:09:29 -05:00
Kyle Havlovitz	2a52630067	leader: move the virtual IP version check into a goroutine	2021-12-09 17:00:33 -08:00
Daniel Nephin	15c4de0c15	ca: prune some unnecessary lookups in the tests	2021-12-08 18:42:52 -05:00
Daniel Nephin	bf798094d5	ca: remove duplicate WaitFor function	2021-12-08 18:42:52 -05:00
Daniel Nephin	984986f007	ca: fix flakes in RenewIntermediate tests I suspect one problem was that we set structs.IntermediateCertRenewInterval to 1ms, which meant that in some cases the intermediate could renew before we stored the original value. Another problem was that the 'wait for intermediate' loop was calling the provider.ActiveIntermediate, but the comparison needs to use the RPC endpoint to accurately represent a user request. So changing the 'wait for' to use the state store ensures we don't race. Also moves the patching into a separate function. Removes the addition of ca.CertificateTimeDriftBuffer as part of calculating halfTime. This was added in a previous commit to attempt to fix the flake, but it did not appear to fix the problem. Adding the time here was making the tests fail when using the shared patch function. It's not clear to me why, but there's no reason we should be including this time in the halfTime calculation.	2021-12-08 18:42:52 -05:00
Daniel Nephin	bc7ec4455f	ca: improve RenewIntermediate tests Use the new verifyLearfCert to show the cert verifies with intermediates from both sources. This required using the RPC interface so that the leaf pem was constructed correctly. Add IndexedCARoots.Active since that is a common operation we see in a few places.	2021-12-08 18:42:52 -05:00
Daniel Nephin	0784073d5e	ca: add a test for Vault in secondary DC	2021-12-08 18:42:51 -05:00
Daniel Nephin	373f445db5	ca: Add CARoots.Active method Which will be used in the next commit.	2021-12-08 18:41:51 -05:00
R.B. Boyer	2f345cca33	acl: ensure that the agent recovery token is properly partitioned (#11782 )	2021-12-08 17:11:55 -06:00
Daniel Nephin	0f95a2c3b1	Merge pull request #11721 from hashicorp/dnephin/ca-export-fsm-operation ca: use the real FSM operation in tests	2021-12-08 17:49:00 -05:00
Daniel Nephin	be1ddc5942	ca: use the real FSM operation in tests Previously we had a couple copies that reproduced the FSM operation. These copies introduce risk that the test does not accurately match production. This PR removes the test versions of the FSM operation, and exports the real production FSM operation so that it can be used in tests. The consul provider tests did need to change because of this. Previously we would return a hardcoded value of 2, but in production this value is always incremented.	2021-12-08 17:29:44 -05:00
R.B. Boyer	957758cb61	test: test server should auto cleanup (#11779 )	2021-12-08 13:26:06 -06:00
Evan Culver	32a04317bf	rpc: Unset partition before forwarding to remote datacenter (#11758 )	2021-12-08 11:02:14 -08:00
Daniel Nephin	52c8b4994b	Merge remote-tracking branch 'origin/main' into serve-panic-recovery	2021-12-07 16:30:41 -05:00
Chris S. Kim	b74ddd7b70	Godocs updates for catalog endpoints (#11716 )	2021-12-07 10:18:28 -05:00
Dan Upton	8bc11b08dc	Rename `ACLMasterToken` => `ACLInitialManagementToken` (#11746 )	2021-12-07 12:39:28 +00:00
Dan Upton	0230ebb4ef	agent/token: rename `agent_master` to `agent_recovery` (internally) (#11744 )	2021-12-07 12:12:47 +00:00
R.B. Boyer	89e90d1ffc	return the max	2021-12-06 15:36:52 -06:00
freddygv	65875a7c69	Remove support for failover to partition Failing over to a partition is more siimilar to failing over to another datacenter than it is to failing over to a namespace. In a future release we should update how localities for failover are specified. We should be able to accept a list of localities which can include both partition and datacenter.	2021-12-06 12:32:24 -07:00
freddygv	a1c1e36be7	Allow cross-partition references in disco chain * Add partition fields to targets like service route destinations * Update validation to prevent cross-DC + cross-partition references * Handle partitions when reading config entries for disco chain * Encode partition in compiled targets	2021-12-06 12:32:19 -07:00
R.B. Boyer	5ea4b82940	light refactors to support making partitions and serf-based wan federation are mutually exclusive (#11755 )	2021-12-06 13:18:02 -06:00
Freddy	d86b98c503	Merge pull request #11739 from hashicorp/ap/exports-rename	2021-12-06 08:20:50 -07:00
freddygv	a2fd30e514	Clean up additional refs to partition exports	2021-12-04 15:16:40 -07:00
freddygv	02fb323652	Rename partition-exports to exported-services Using a name less tied to partitions gives us more flexibility to use this config entry in OSS for exports between datacenters/meshes.	2021-12-03 17:47:31 -07:00
freddygv	fcfed67246	Update intention topology to use new table	2021-12-03 17:28:31 -07:00
freddygv	4acbdc4618	Avoid updating default decision from wildcard ixn Given that we do not allow wildcard partitions in intentions, no one ixn can override the DefaultAllow setting. Only the default ACL policy applies across all partitions.	2021-12-03 17:28:12 -07:00
freddygv	142d8193e5	Add a new table to query service names by kind This table purposefully does not index by partition/namespace. It's a global view into all service names. This table is intended to replace the current serviceListTxn watch in intentionTopologyTxn. For cross-partition transparent proxying we need to be able to calculate upstreams from intentions in any partition. This means that the existing serviceListTxn function is insufficient since it's scoped to a partition. Moving away from that function is also beneficial because it watches the main "services" table, so watchers will wake up when any instance is registered or deregistered.	2021-12-03 17:28:12 -07:00
Freddy	3eddf98e62	Merge pull request #11680 from hashicorp/ap/partition-exports-oss	2021-12-03 16:57:50 -07:00
Dan Upton	2f4b8d7a7d	internal: support `ResultsFilteredByACLs` flag/header (#11643 )	2021-12-03 23:04:24 +00:00
Dan Upton	43e28a3af6	query: support `ResultsFilteredByACLs` in query list endpoint (#11620 )	2021-12-03 23:04:09 +00:00
Dhia Ayachi	e38ccf0a22	port oss changes (#11736 )	2021-12-03 17:23:55 -05:00
Freddy	3791d6d7da	Merge pull request #11720 from hashicorp/bbolt	2021-12-03 14:44:36 -07:00
Dan Upton	1d694df02b	fedstate: support `ResultsFilteredByACLs` in `ListMeshGateways` endpoint (#11644 )	2021-12-03 20:56:55 +00:00
Dan Upton	0489ea187d	catalog: support `ResultsFilteredByACLs` flag/header (#11594 )	2021-12-03 20:56:14 +00:00
Dan Upton	8bb1b89554	coordinate: support `ResultsFilteredByACLs` flag/header (#11617 )	2021-12-03 20:51:02 +00:00
Dan Upton	a62aa3847d	sessions: support `ResultsFilteredByACLs` flag/header (#11606 )	2021-12-03 20:43:43 +00:00
Dan Upton	0a7ba5162e	txn: support `ResultsFilteredByACLs` flag in `Read` endpoint (#11632 )	2021-12-03 20:41:03 +00:00
Dhia Ayachi	a8874c65f7	sessions partitioning tests (#11734 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused operations_ent.go and operations_oss.go func * remove unused const * convert `IndexID` of `session_checks` table * convert `indexSession` of `session_checks` table * convert `indexNodeCheck` of `session_checks` table * partition `indexID` and `indexSession` of `tableSessionChecks` * fix oss linter * fix review comments * remove partition for Checks as it's always use the session partition * fix tests * fix tests * do not namespace nodeChecks index Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-12-03 15:36:07 -05:00
Dan Upton	b10e69ffda	intention: support `ResultsFilteredByACLs` flag/header (#11612 )	2021-12-03 20:35:54 +00:00
Mark Anderson	e8f542030e	Cross port of ent #1383 (#11726 ) Cross port of ent #1383 "Reject non-default datacenter when making partitioned ACLs" On the OSS side this is a minor refactor to add some more checks that are only applicable to enterprise code. Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-12-03 10:20:25 -08:00
Dan Upton	1d571bb503	config: support `ResultsFilteredByACLs` in list/list all endpoints (#11621 )	2021-12-03 17:39:47 +00:00
Dan Upton	44bc833318	kv: support `ResultsFilteredByACLs` in list/list keys (#11593 )	2021-12-03 17:31:48 +00:00
Dan Upton	3ad8540d23	health: support `ResultsFilteredByACLs` flag/header (#11602 )	2021-12-03 17:31:32 +00:00
Dan Upton	0efe478044	Groundwork for exposing when queries are filtered by ACLs (#11569 )	2021-12-03 17:11:26 +00:00
Kyle Havlovitz	dbb58b726a	Merge pull request #11724 from hashicorp/service-virtual-ips oss: add virtual IP generation for connect services	2021-12-02 16:16:57 -08:00
Kyle Havlovitz	db88f95fbe	consul: add virtual IP generation for connect services	2021-12-02 15:42:47 -08:00
R.B. Boyer	6ec84cfbe2	agent: add variation of force-leave that exclusively works on the WAN (#11722 ) Fixes #6548	2021-12-02 17:15:10 -06:00
Matt Keeler	68e629a476	Emit raft-boltdb metrics	2021-12-02 16:56:15 -05:00
Daniel Nephin	8e2c71528f	config: add NoFreelistSync option # Conflicts: # agent/config/testdata/TestRuntimeConfig_Sanitize-enterprise.golden # agent/consul/server.go	2021-12-02 16:56:15 -05:00
Matt Keeler	1f49738167	Use raft-boltdb/v2	2021-12-02 16:56:15 -05:00
Daniel Nephin	fa32c78429	ca: set the correct SigningKeyID after config update with Vault provider The test added in this commit shows the problem. Previously the SigningKeyID was set to the RootCert not the local leaf signing cert. This same bug was fixed in two other places back in 2019, but this last one was missed. While fixing this bug I noticed I had the same few lines of code in 3 places, so I extracted a new function for them. There would be 4 places, but currently the InitializeCA flow sets this SigningKeyID in a different way, so I've left that alone for now.	2021-12-02 16:07:11 -05:00
Daniel Nephin	a0014e13fd	Merge pull request #11713 from hashicorp/dnephin/ca-test-names ca: make test naming consistent	2021-12-02 16:05:42 -05:00
Daniel Nephin	720d782225	Merge pull request #11671 from hashicorp/dnephin/ca-fix-storing-vault-intermediate ca: fix storing the leaf signing cert with Vault provider	2021-12-02 16:02:24 -05:00
Daniel Nephin	a0160f7426	Merge pull request #11677 from hashicorp/dnephin/freeport-interface sdk: use t.Cleanup in freeport and remove unnecessary calls	2021-12-02 15:58:41 -05:00
Daniel Nephin	c1cb77b829	ca: make test naming consistent While working on the CA system it is important to be able to run all the tests related to the system, without having to wait for unrelated tests. There are many slow and unrelated tests in agent/consul, so we need some way to filter to only the relevant tests. This PR renames all the CA system related tests to start with either `TestCAMananger` for tests of internal operations that don't have RPC endpoint, or `TestConnectCA` for tests of RPC endpoints. This allows us to run all the test with: go test -run 'TestCAMananger\|TestConnectCA' ./agent/consul The test naming follows an undocumented convention of naming tests as follows: Test[<struct name>_]<function name>[_<test case description>] I tried to always keep Primary/Secondary at the end of the description, and _Vault_ has to be in the middle because of our regex to run those tests as a separate CI job. You may notice some of the test names changed quite a bit. I did my best to identify the underlying method being tested, but I may have been slightly off in some cases.	2021-12-02 14:57:09 -05:00
Daniel Nephin	460f8919c9	ca: make getLeafSigningCertFromRoot safer As a method on the struct type this would not be safe to call without first checking c.isIntermediateUsedToSignLeaf. So for now, move this logic to the CAMananger, so that it is always correct.	2021-12-02 12:42:49 -05:00
Daniel Nephin	64532ef636	ca: fix stored CARoot representation with Vault provider We were not adding the local signing cert to the CARoot. This commit fixes that bug, and also adds support for fixing existing CARoot on upgrade. Also update the tests for both primary and secondary to be more strict. Check the SigningKeyID is correct after initialization and rotation.	2021-12-02 12:42:49 -05:00
Chris S. Kim	67eacee31e	ENT to OSS sync (#11703 )	2021-12-01 14:56:10 -05:00
R.B. Boyer	70b143ddc5	auto-config: ensure the feature works properly with partitions (#11699 )	2021-12-01 13:32:34 -06:00
Daniel Nephin	963a9819d0	ca: add some godoc and func for finding leaf signing cert This will be used in a follow up commit.	2021-11-30 18:36:41 -05:00
Daniel Nephin	056a52ba64	sdk/freeport: rename Port to GetOne For better consistency with GetN	2021-11-30 17:32:41 -05:00
Chris S. Kim	e9c661db7f	Refactor test helper (#11689 ) Allow custom ACL root tokens to be passed	2021-11-30 13:22:07 -05:00
Chris S. Kim	0ec67cc2d1	acl: Fill authzContext from token in Coordinate endpoints (#11688 )	2021-11-30 13:17:41 -05:00
freddygv	76146dfc5b	Move ent config test to ent file	2021-11-29 12:15:17 -07:00
Daniel Nephin	4f0d092c95	testing: remove unnecessary calls to freeport Previously we believe it was necessary for all code that required ports to use freeport to prevent conflicts. https://github.com/dnephin/freeport-test shows that it is actually save to use port 0 (`127.0.0.1:0`) as long as it is passed directly to `net.Listen`, and the listener holds the port for as long as it is needed. This works because freeport explicitly avoids the ephemeral port range, and port 0 always uses that range. As you can see from the test output of https://github.com/dnephin/freeport-test, the two systems never use overlapping ports. This commit converts all uses of freeport that were being passed directly to a net.Listen to use port 0 instead. This allows us to remove a bit of wrapping we had around httptest, in a couple places.	2021-11-29 12:19:43 -05:00
Daniel Nephin	20a8e11bf2	testing: use the new freeport interfaces	2021-11-27 15:39:46 -05:00
Daniel Nephin	2cf41e4dc8	go-sso: remove returnFunc now that freeport handles return	2021-11-27 15:29:38 -05:00
Daniel Nephin	772d8f7381	ca: clean up unnecessary raft.Apply response checking In d2ab767fef21244e9fe3b9887ea70fc177912381 raftApply was changed to handle this check in a single place, instad of having every caller check it. It looks like these few places were missed when I did that clean up. This commit removes the remaining resp.(error) checks, since they are all no-ops now.	2021-11-26 17:57:55 -05:00
Daniel Nephin	48954adfdc	Merge pull request #11339 from hashicorp/dnephin/ca-manager-isolate-secondary-2 ca: reduce use of state in the secondary	2021-11-26 14:41:45 -05:00
Daniel Nephin	8240286956	ca: remove state check in secondarySetPrimaryRoots This function is only ever called from operations that have already acquired the state lock, so checking the value of state can never fail. This change is being made in preparation for splitting out a separate type for the secondary logic. The state can't easily be shared, so really only the expored top-level functions should acquire the 'state lock'.	2021-11-26 14:14:47 -05:00
Daniel Nephin	877094e2fa	ca: remove actingSecondaryCA This commit removes the actingSecondaryCA field, and removes the stateLock around it. This field was acting as a proxy for providerRoot != nil, so replace it with that check instead. The two methods which called secondarySetCAConfigured already set the state, so checking the state again at this point will not catch runtime errors (only programming errors, which we can catch with tests). In general, handling state transitions should be done on the "entrypoint" methods where execution starts, not in every internal method. This is being done to remove some unnecessary references to c.state, in preparations for extracting types for primary/secondary.	2021-11-26 14:14:47 -05:00
Daniel Nephin	cd5f6b2dfb	ca: reduce consul provider backend interface a bit This makes it easier to fake, which will allow me to use the ConsulProvider as an 'external PKI' to test a customer setup where the actual root CA is not the root we use for the Consul CA. Replaces a call to the state store to fetch the clusterID with the clusterID field already available on the built-in provider.	2021-11-25 11:46:06 -05:00
Dhia Ayachi	f605689154	Partition/kv indexid sessions (#11639 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused operations_ent.go and operations_oss.go func * convert `indexNodeCheck` of `session_checks` table * partition `indexID` and `indexSession` of `tableSessionChecks` * remove partition for Checks as it's always use the session partition * partition sessions index id table * fix rebase issues Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-24 11:34:36 -05:00
Dhia Ayachi	b1c4be3da0	Partition session checks store (#11638 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused operations_ent.go and operations_oss.go func * remove unused const * convert `IndexID` of `session_checks` table * convert `indexSession` of `session_checks` table * convert `indexNodeCheck` of `session_checks` table * partition `indexID` and `indexSession` of `tableSessionChecks` * fix oss linter * fix review comments * remove partition for Checks as it's always use the session partition Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-24 09:10:38 -05:00
Daniel Nephin	07a33a1526	ca: accept only the cluster ID to SpiffeIDSigningForCluster To make it more obivous where ClusterID is used, and remove the need to create a struct when only one field is used.	2021-11-16 16:57:21 -05:00
R.B. Boyer	83bf7ab3ff	re-run gofmt on 1.17 (#11579 ) This should let freshly recompiled golangci-lint binaries using Go 1.17 pass 'make lint'	2021-11-16 12:04:01 -06:00
R.B. Boyer	086ff42b56	partitions: various refactors to support partitioning the serf LAN pool (#11568 )	2021-11-15 09:51:14 -06:00
freddygv	5ac1ab359b	Move assertion to after config fetch	2021-11-10 10:50:08 -07:00
freddygv	2261d51515	Use ClusterID to check for readiness The TrustDomain is populated from the Host() method which includes the hard-coded "consul" domain. This means that despite having an empty cluster ID, the TrustDomain won't be empty.	2021-11-10 10:45:22 -07:00
freddygv	482d3bc610	Prevent replicating partition-exports	2021-11-09 16:42:42 -07:00
freddygv	739490df12	handle error scenario of empty local DC	2021-11-09 16:42:42 -07:00
freddygv	b9b41625b9	Restrict DC for partition-exports writes There are two restrictions: - Writes from the primary DC which explicitly target a secondary DC. - Writes to a secondary DC that do not explicitly target the primary DC. The first restriction is because the config entry is not supported in secondary datacenters. The second restriction is to prevent the scenario where a user writes the config entry to a secondary DC, the write gets forwarded to the primary, but then the config entry does not apply in the secondary. This makes the scope more explicit.	2021-11-09 16:42:42 -07:00
Freddy	0ad360fadf	Merge pull request #11514 from hashicorp/dnephin/ca-fix-secondary-init ca: properly handle the case where the secondary initializes after the primary	2021-11-08 17:16:16 -07:00
freddygv	e6622ab0ab	Avoid returning empty roots with uninitialized CA Currently getCARoots could return an empty object with an empty trust domain before the CA is initialized. This commit returns an error while there is no CA config or no trust domain. There could be a CA config and no trust domain because the CA config can be created in InitializeCA before initialization succeeds.	2021-11-08 16:51:49 -07:00
Dhia Ayachi	f61892393f	refactor session state store tables to use the new index pattern (#11525 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * partition `tableSessions` table * fix sessions to use UUID and fix prefix index * fix oss build * clean up unused functions * fix oss compilation * add a partition indexer for sessions * Fix oss to not have partition index * fix oss tests * remove unused func `prefixIndexFromServiceNameAsString` * fix test error check * remove unused operations_ent.go and operations_oss.go func * remove unused const Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-08 16:20:50 -05:00
Dhia Ayachi	dfafd4e38c	KV refactoring, part 2 (#11512 ) * add partition to the kv get pretty print * fix failing test * add test for kvs RPC endpoint	2021-11-08 11:43:21 -05:00
Dhia Ayachi	17190c0076	KV state store refactoring and partitioning (#11510 ) * state: port KV and Tombstone tables to new pattern * go fmt'ed * handle wildcards for tombstones * Fix graveyard ent vs oss * fix oss compilation error * add partition to tombstones and kv state store indexes * refactor to use `indexWithEnterpriseIndexable` * partition kvs indexID table * add `partitionedIndexEntryName` in oss for test purpose * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> * add `singleValueID` implementation assertions * remove entmeta reference from oss Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: R.B. Boyer <4903+rboyer@users.noreply.github.com>	2021-11-08 09:35:56 -05:00
Giulio Micheloni	10cdc0a5c8	Merge branch 'main' into serve-panic-recovery	2021-11-06 16:12:06 +01:00
Daniel Nephin	69ad7c0544	ca: Only initialize clusterID in the primary The secondary must get the clusterID from the primary	2021-11-05 18:08:44 -04:00
Daniel Nephin	3173582b75	ca: return an error when secondary fails to initialize Previously secondaryInitialize would return nil in this case, which prevented the deferred initialize from happening, and left the CA in an uninitialized state until a config update or root rotation. To fix this I extracted the common parts into the delegate implementation. However looking at this again, it seems like the handling in secondaryUpdateRoots is impossible, because that function should never be called before the secondary is initialzied. I beleive we can remove some of that logic in a follow up.	2021-11-05 18:02:51 -04:00
Daniel Nephin	db29ad346b	acl: remove id and revision from Policy constructors The fields were removed in a previous commit. Also remove an unused constructor for PolicyMerger	2021-11-05 15:45:08 -04:00
R.B. Boyer	1d8e7bb565	rename helper method to reflect the non-deprecated terminology (#11509 )	2021-11-05 13:51:50 -05:00
FFMMM	27227c0fd2	add root_cert_ttl option for consul connect, vault ca providers (#11428 ) * add root_cert_ttl option for consul connect, vault ca providers Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Chris S. Kim <ckim@hashicorp.com> * add changelog, pr feedback Signed-off-by: FFMMM <FFMMM@users.noreply.github.com> * Update .changelog/11428.txt, more docs Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * Update website/content/docs/agent/options.mdx Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Chris S. Kim <ckim@hashicorp.com> Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Kyle Havlovitz <kylehav@gmail.com>	2021-11-02 11:02:10 -07:00
Daniel Nephin	00ed2b243f	Merge pull request #10771 from hashicorp/dnephin/emit-telemetry-metrics-immediately telemetry: improve cert expiry metrics	2021-11-01 18:31:03 -04:00
Daniel Nephin	eaaceedf31	Merge pull request #11338 from hashicorp/dnephin/ca-manager-isolate-secondary ca: clearly identify methods that are primary-only or secondary-only	2021-11-01 14:10:31 -04:00
Daniel Upton	a620b6be2e	Support Check-And-Set deletion of config entries (#11419 ) Implements #11372	2021-11-01 16:42:01 +00:00
Dhia Ayachi	4d763ef9e6	regenerate expired certs (#11462 ) * regenerate expired certs * add documentation to generate tests certificates	2021-11-01 11:40:16 -04:00
R.B. Boyer	017e9d5ae4	agent: add a clone function for duplicating the serf lan configuration (#11443 )	2021-10-28 16:11:26 -05:00
Daniel Nephin	0a19d7fd76	agent: move agent tls metric monitor to a more appropriate place And add a test for it	2021-10-27 16:26:09 -04:00
Daniel Nephin	1b2144c982	telemetry: set cert expiry metrics to NaN on start So that followers do not report 0, which would make alerting difficult.	2021-10-27 15:19:25 -04:00
Daniel Nephin	a7fcf14c5c	telemetry: fix cert expiry metrics by removing labels These labels should be set by whatever process scrapes Consul (for prometheus), or by the agent that receives them (for datadog/statsd). We need to remove them here because the labels are part of the "metric key", so we'd have to pre-declare the metrics with the labels. We could do that, but that is extra work for labels that should be added from elsewhere. Also renames the closure to be more descriptive.	2021-10-27 15:19:25 -04:00
Daniel Nephin	4300daa2e6	telemetry: only emit leader cert expiry metrics on the servers	2021-10-27 15:19:25 -04:00
Daniel Nephin	9de725c17d	telemetry: prevent stale values from cert monitors Prometheus scrapes metrics from each process, so when leadership transfers to a different node the previous leader would still be reporting the old cached value. By setting NaN, I believe we should zero-out the value, so that prometheus should only consider the value from the new leader.	2021-10-27 15:19:25 -04:00
Daniel Nephin	616cc9b6f8	telemetry: improve cert expiry metrics Emit the metric immediately so that after restarting an agent, the new expiry time will be emitted. This is particularly important when this metric is being monitored, because we want the alert to resovle itself immediately. Also fixed a bug that was exposed in one of these metrics. The CARoot can be nil, so we have to handle that case.	2021-10-27 15:19:25 -04:00
Daniel Nephin	24951f0c7e	subscribe: attempt to fix a flaky test TestSubscribeBackend_IntegrationWithServer_DeliversAllMessages has been flaking a few times. This commit cleans up the test a bit, and improves the failure output. I don't believe this actually fixes the flake, but I'm not able to reproduce it reliably. The failure appears to be that the event with Port=0 is being sent in both the snapshot and as the first event after the EndOfSnapshot event. Hopefully the improved logging will show us if these are really duplicate events, or actually different events with different indexes.	2021-10-27 15:09:09 -04:00
freddygv	592965d61e	Rework acl exports interface	2021-10-27 12:50:39 -06:00
Freddy	9bbeea0432	Merge pull request #11433 from hashicorp/exported-service-acls [OSS] acl: Expand ServiceRead and NodeRead to account for partition exports	2021-10-27 12:48:08 -06:00
Freddy	d8ae915160	Merge pull request #11431 from hashicorp/ap/exports-proxycfg [OSS] Update partitioned mesh gw handling for connect proxies	2021-10-27 11:27:43 -06:00
Freddy	8e23a6a0cc	Merge pull request #11416 from hashicorp/ap/exports-update Rename service-exports to partition-exports	2021-10-27 11:27:31 -06:00
freddygv	af662c8c1c	Avoid mixing named and unnamed params	2021-10-26 23:42:25 -06:00
freddygv	1de62bb0a2	Avoid passing nil config pointer	2021-10-26 23:42:25 -06:00
freddygv	4a2e40aa3c	Avoid panic on nil partitionAuthorizer config partitionAuthorizer.config can be nil if it wasn't provided on calls to newPartitionAuthorizer outside of the ACLResolver. This usage happens often in tests. This commit: adds a nil check when the config is going to be used, updates non-test usage of NewPolicyAuthorizerWithDefaults to pass a non-nil config, and dettaches setEnterpriseConf from the ACLResolver.	2021-10-26 23:42:25 -06:00
freddygv	015d85cd74	Update NodeRead for partition-exports When issuing cross-partition service discovery requests, ACL filtering often checks for NodeRead privileges. This is because the common return type is a CheckServiceNode, which contains node data.	2021-10-26 23:42:11 -06:00
Kyle Havlovitz	afb0976eac	acl: pass PartitionInfo through ent ACLConfig	2021-10-26 23:41:52 -06:00
Kyle Havlovitz	56d1858c4a	acl: Expand ServiceRead logic to look at service-exports for cross-partition	2021-10-26 23:41:32 -06:00
freddygv	3966677aaf	Finish removing useInDatacenter	2021-10-26 23:36:01 -06:00
freddygv	feaebde1f1	Remove useInDatacenter from disco chain requests useInDatacenter was used to determine whether the mesh gateway mode of the upstream should be returned in the discovery chain target. This commit makes it so that the mesh gateway mode is returned every time, and it is up to the caller to decide whether mesh gateways should be watched or used.	2021-10-26 23:35:21 -06:00
R.B. Boyer	e27e58c6cc	agent: refactor the agent delegate interface to be partition friendly (#11429 )	2021-10-26 15:08:55 -05:00
Chris S. Kim	27f8a85664	agent: Ensure partition is considered in agent endpoints (#11427 )	2021-10-26 15:20:57 -04:00
freddygv	c3e381b4c1	Rename service-exports to partition-exports Existing config entries prefixed by service- are specific to individual services. Since this config entry applies to partitions it is being renamed. Additionally, the Partition label was changed to Name because using Partition at the top-level and in the enterprise meta was leading to the enterprise meta partition being dropped by msgpack.	2021-10-25 17:58:48 -06:00
Daniel Nephin	f24bad2a52	Merge pull request #11232 from hashicorp/dnephin/acl-legacy-remove-docs acl: add docs and changelog for the removal of the legacy ACL system	2021-10-25 18:38:00 -04:00
Daniel Nephin	f7cdd210fe	Update agent/consul/acl_client.go Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2021-10-25 17:25:14 -04:00
Daniel Nephin	732b841dd7	state: remove support for updating legacy ACL tokens	2021-10-25 17:25:14 -04:00
Daniel Nephin	76b007dacd	acl: remove init check for legacy anon token This token should always already be migrated from a previous version.	2021-10-25 17:25:14 -04:00
Daniel Nephin	8ae6ee4e36	acl: remove legacy parameter to ACLDatacenter It is no longer used now that legacy ACLs have been removed.	2021-10-25 17:25:14 -04:00
Daniel Nephin	d778113773	acl: remove ACLTokenTypeManagement	2021-10-25 17:25:14 -04:00
Daniel Nephin	88c6aeea34	acl: remove legacy arg to store.ACLTokenSet And remove the tests for legacy=true	2021-10-25 17:25:14 -04:00
Daniel Nephin	b31a7fc498	acl: remove EmbeddedPolicy This method is no longer. It only existed for legacy tokens, which are no longer supported.	2021-10-25 17:25:14 -04:00
Daniel Nephin	ceaa36f983	acl: remove tests for resolving legacy tokens The code for this was already removed, which suggests this is not actually testing what it claims. I'm guessing these are still resolving because the tokens are converted to non-legacy tokens?	2021-10-25 17:25:14 -04:00
Daniel Nephin	a46e3bd2fc	acl: stop replication on leadership lost It seems like this was missing. Previously this was only called by init of ACLs during an upgrade. Now that legacy ACLs are removed, nothing was calling stop. Also remove an unused method from client.	2021-10-25 17:24:12 -04:00
Daniel Nephin	15cd8c7ab8	Remove incorrect TODO	2021-10-25 17:20:06 -04:00
Daniel Nephin	589b238374	acl: move the legacy ACL struct to the one package where it is used It is now only used for restoring snapshots. We can remove it in phase 2.	2021-10-25 17:20:06 -04:00
Daniel Nephin	0ba5d0afcd	acl: remove most of the rest of structs/acl_legacy.go	2021-10-25 17:20:06 -04:00
FFMMM	6433a57d3c	fix autopilot_failure_tolerance, add autopilot metrics test case (#11399 ) Signed-off-by: FFMMM <FFMMM@users.noreply.github.com>	2021-10-25 10:55:59 -07:00
Dhia Ayachi	75f69a98a2	fix leadership transfer on leave suggestions (#11387 ) * add suggestions * set isLeader to false when leadership transfer succeed	2021-10-21 14:02:26 -04:00
Dhia Ayachi	2d1ac1f7d0	try to perform a leadership transfer when leaving (#11376 ) * try to perform a leadership transfer when leaving * add a changelog	2021-10-21 12:44:31 -04:00
Kyle Havlovitz	752a285552	Add new service-exports config entry	2021-10-20 12:24:18 -07:00
Giulio Micheloni	10814d934e	Merge branch 'main' of https://github.com/hashicorp/consul into hashicorp-main	2021-10-16 16:59:32 +01:00
R.B. Boyer	55dd52cb17	acl: small OSS refactors to help ensure that auth methods with namespace rules work with partitions (#11323 )	2021-10-14 15:38:05 -05:00
freddygv	f76fddb28e	Use stored entmeta to fill authzContext	2021-10-14 08:57:40 -06:00
freddygv	bdf3e951f8	Ensure partition is handled by auto-encrypt	2021-10-14 08:32:45 -06:00
Chris S. Kim	0a6d683c84	Update Intentions.List with partitions (#11299 )	2021-10-13 10:47:12 -04:00
Connor	2cd80e5f66	Merge pull request #11222 from hashicorp/clly/service-mesh-metrics Start tracking connect service mesh usage metrics	2021-10-11 14:35:03 -05:00
Connor Kelly	2119351f77	Replace fmt.Sprintf with function	2021-10-11 12:43:38 -05:00
Daniel Nephin	571acb872e	ca: extract primaryUpdateRootCA This function is only run when the CAManager is a primary. Extracting this function makes it clear which parts of UpdateConfiguration are run only in the primary and also makes the cleanup logic simpler. Instead of both a defer and a local var we can call the cleanup function in two places.	2021-10-10 15:26:55 -04:00
Daniel Nephin	a65594d8ec	ca: rename functions to use a primary or secondary prefix This commit renames functions to use a consistent pattern for identifying the functions that can only be called when the Manager is run as the primary or secondary. This is a step toward eventually creating separate types and moving these methods off of CAManager.	2021-10-10 15:26:55 -04:00
Daniel Nephin	20f0efd8c1	ca: make receiver variable name consistent Every other method uses c not ca	2021-10-10 15:26:55 -04:00
FFMMM	7f28301212	fix consul_autopilot_healthy metric emission (#11231 ) https://github.com/hashicorp/consul/issues/10730	2021-10-08 10:31:50 -07:00
Connor Kelly	38986d6371	Rename ConfigUsageEnterprise to EnterpriseConfigEntryUsage	2021-10-08 10:53:34 -05:00
Connor Kelly	76b3c4ed3c	Rename and prefix ConfigEntry in Usage table Rename ConfigUsage functions to ConfigEntry prefix ConfigEntry kinds with the ConfigEntry table name to prevent potential conflicts	2021-10-07 16:19:55 -05:00
Connor Kelly	0e39a7a333	Add connect specific prefix to Usage table Ensure that connect Kind's are separate from ConfigEntry Kind's to prevent miscounting	2021-10-07 16:16:23 -05:00
Daniel Nephin	51e498717f	docs: add notice that legacy ACLs have been removed. Add changelog Also remove a metric that is no longer emitted that was missed in a previous step.	2021-10-05 18:30:22 -04:00
Connor Kelly	f9ba7c39b5	Add changelog, website and metric docs Add changelog to document what changed. Add entry to telemetry section of the website to document what changed Add docs to the usagemetric endpoint to help document the metrics in code	2021-10-05 13:34:24 -05:00
Daniel Nephin	e03b7e4c68	Merge pull request #11182 from hashicorp/dnephin/acl-legacy-remove-upgrade acl: remove upgrade from legacy, start in non-legacy mode	2021-10-04 17:25:39 -04:00
Daniel Nephin	b9f0014d70	acl: remove updateEnterpriseSerfTags The only remaining caller is a test helper, and the tests don't use the enterprise gossip pools.	2021-10-04 17:01:51 -04:00
Daniel Nephin	5ac360b22d	Merge pull request #11126 from hashicorp/dnephin/acl-legacy-remove-resolve-and-get-policy acl: remove ACL.GetPolicy RPC endpoint and ACLResolver.resolveTokenLegacy	2021-10-04 16:29:51 -04:00
Connor Kelly	ed5693b537	Add metrics to count the number of service-mesh config entries	2021-10-04 14:50:17 -05:00
Connor Kelly	9c487389cf	Add metrics to count connect native service mesh instances This will add the counts of the service mesh instances tagged by whether or not it is connect native	2021-10-04 14:37:05 -05:00
Connor Kelly	8000ea45ca	Add metrics to count service mesh Kind instance counts This will add the counts of service mesh instances tagged by the different ServiceKind's.	2021-10-04 14:36:59 -05:00
Daniel Nephin	b6435259c3	acl: fix test failures caused by remocving legacy ACLs This commit two test failures: 1. Remove check for "in legacy ACL mode", the actual upgrade will be removed in a following commit. 2. Remove the early WaitForLeader in dc2, because with it the test was failing with ACL not found.	2021-10-01 18:03:10 -04:00
Dhia Ayachi	8bd52995d1	fix token list by auth method (#11196 ) * add tests to OIDC authmethod and fix entMeta when retrieving auth-methods * fix oss compilation error	2021-10-01 12:00:43 -04:00
Daniel Nephin	ec935a2486	acl: call stop for the upgrade goroutine when done TestAgentLeaks_Server was reporting a goroutine leak without this. Not sure if it would actually be a leak in production or if this is due to the test setup, but seems easy enough to call it this way until we remove legacyACLTokenUpgrade.	2021-09-29 17:36:43 -04:00
Daniel Nephin	0c077d0527	acl: only run startACLUpgrade once Since legacy ACL tokens can no longer be created we only need to run this upgrade a single time when leadership is estalbished.	2021-09-29 16:22:01 -04:00
Daniel Nephin	f21097beda	acl: remove reading of serf acl tags We no long need to read the acl serf tag, because servers are always either ACL enabled or ACL disabled. We continue to write the tag so that during an upgarde older servers will see the tag.	2021-09-29 15:45:11 -04:00
Daniel Nephin	b866e3c4f4	acl: fix test failure For some reason removing legacy ACL upgrade requires using an ACL token now for this WaitForLeader.	2021-09-29 15:21:30 -04:00
Daniel Nephin	ebb2388605	acl: remove legacy ACL upgrades from Server As part of removing the legacy ACL system	2021-09-29 15:19:23 -04:00
Daniel Nephin	41a97360ca	acl: fix test failures caused by remocving legacy ACLs This commit two test failures: 1. Remove check for "in legacy ACL mode", the actual upgrade will be removed in a following commit. 2. Use the root token in WaitForLeader, because without it the test was failing with ACL not found.	2021-09-29 15:15:50 -04:00
Daniel Nephin	b73b68d696	acl: remove ACL.GetPolicy endpoint and resolve legacy acls And all code that was no longer used once those two were removed.	2021-09-29 14:33:19 -04:00
Daniel Nephin	b8da06a34d	acl: remove ACL upgrading from Clients As part of removing the legacy ACL system ACL upgrading and the flag for legacy ACLs is removed from Clients. This commit also removes the 'acls' serf tag from client nodes. The tag is only ever read from server nodes. This commit also introduces a constant for the acl serf tag, to make it easier to track where it is used.	2021-09-29 14:02:38 -04:00
Daniel Nephin	33a5448604	Merge pull request #11136 from hashicorp/dnephin/acl-resolver-fix-default-authz acl: fix default Authorizer for down_policy extend-cache/async-cache	2021-09-29 13:45:12 -04:00
Daniel Nephin	2995ac61f2	acl: remove the last of the legacy FSM Replace it with an implementation that returns an error, and rename some symbols to use a Deprecated suffix to make it clear. Also remove the ACLRequest struct, which is no longer referenced.	2021-09-29 12:42:23 -04:00
Daniel Nephin	a8358f7575	acl: remove bootstrap-init FSM operation	2021-09-29 12:42:23 -04:00
Daniel Nephin	ea2e0ad2ec	acl: remove initializeLegacyACL from leader init	2021-09-29 12:42:23 -04:00
Daniel Nephin	4e36442583	acl: remove ACLDelete FSM command, and state store function These are no longer used now that ACL.Apply has been removed.	2021-09-29 12:42:23 -04:00
Daniel Nephin	7e37c9a765	acl: remove legacy field to ACLBoostrap	2021-09-29 12:42:23 -04:00
Daniel Nephin	d4c48a3f23	Merge pull request #11101 from hashicorp/dnephin/acl-legacy-remove-rpc-2 acl: remove legacy ACL.Apply RPC	2021-09-29 12:23:55 -04:00
Daniel Nephin	69a83aefcf	Merge pull request #11177 from hashicorp/dnephin/remove-entmeta-methods structs: remove EnterpriseMeta helper methods	2021-09-29 12:08:07 -04:00
Daniel Nephin	acb62aa896	Merge pull request #10986 from hashicorp/dnephin/acl-legacy-remove-rpc acl: remove legacy ACL RPC - part 1	2021-09-29 12:04:09 -04:00
Daniel Nephin	1bc07c5166	structs: rename the last helper method. This one gets used a bunch, but we can rename it to make the behaviour more obvious.	2021-09-29 11:48:38 -04:00
Daniel Nephin	93b3e110b6	structs: remove another helper We already have a helper funtion.	2021-09-29 11:48:03 -04:00
Chris S. Kim	3f79aaf509	Cleanup unnecessary normalizing method (#11169 )	2021-09-28 15:31:12 -04:00
Daniel Nephin	4ed9476a61	Merge pull request #11084 from krastin/krastin-autopilot-loggingtypo Fix a tiny typo in logging in autopilot.go	2021-09-28 15:11:11 -04:00
Daniel Nephin	30fe14eed3	acl: fix default authorizer for down_policy This was causing a nil panic because a nil authorizer is no longer valid after the cleanup done in https://github.com/hashicorp/consul/pull/10632.	2021-09-23 18:12:22 -04:00
Daniel Nephin	a6a7069ecf	Remove t.Parallel from TestACLResolver_DownPolicy These tests run in under 10ms, t.Parallel does nothing but slow them down and make failures harder to debug when one panics.	2021-09-23 18:12:22 -04:00
Dhia Ayachi	4505cb2920	Refactor table index acl phase 2 (#11133 ) * extract common methods from oss and ent * remove unreachable code * add missing normalize for binding rules * fix oss to use Query	2021-09-23 15:26:09 -04:00
Dhia Ayachi	ebe333b947	Refactor table index (#11131 ) * convert tableIndex to use the new pattern * make `indexFromString` available for oss as well * refactor `indexUpdateMaxTxn`	2021-09-23 11:06:23 -04:00
Daniel Nephin	3e6dc2a843	acl: remove ACL.Apply As part of removing the legacy ACL system.	2021-09-22 18:28:08 -04:00
Daniel Nephin	2ce64e2837	acl: made acl rules in tests slightly more specific When converting these tests from the legacy ACL system to the new RPC endpoints I initially changed most things to use _prefix rules, because that was equivalent to the old legacy rules. This commit modifies a few of those rules to be a bit more specific by replacing the _prefix rule with a non-prefix one where possible.	2021-09-22 18:24:56 -04:00
Mark Anderson	c87d57bfeb	partitions/authmethod-index work from enterprise (#11056 ) * partitions/authmethod-index work from enterprise Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-09-22 13:19:20 -07:00
R.B. Boyer	ba13416b57	grpc: strip local ACL tokens from RPCs during forwarding if crossing datacenters (#11099 ) Fixes #11086	2021-09-22 13:14:26 -05:00
Connor	bc04a155fb	Merge pull request #11090 from hashicorp/clly/kv-usage-metrics Add KVUsage to consul state usage metrics	2021-09-22 11:26:56 -05:00
Connor Kelly	bfe6b64ca7	Strip out go 1.17 bits	2021-09-22 11:04:48 -05:00
Daniel Nephin	b40bdc9e98	acl: remove remaining tests that use ACL.Apply In preparation for removing ACL.Apply. Tests for ACL.Apply, ACL.GetPolicy, and ACL upgrades were removed because all 3 of those will be removed shortly. The forth test appears to be for the ACLResolver cache, so the test was moved to the correct test file, and the name was updated to make it obvious what is being tested.	2021-09-21 19:35:26 -04:00
Daniel Nephin	ab91d254a3	fsm: restore the legacy commands and emit a helpful error message.	2021-09-21 18:35:12 -04:00
Daniel Nephin	0180dd67ff	Convert tests to the new ACL system In preparation for removing ACL.Apply	2021-09-21 18:35:12 -04:00
Daniel Nephin	b639f47e3c	config: use the new ACL system in tests In preparation for removing ACL.Apply	2021-09-21 17:57:29 -04:00
Daniel Nephin	2702aecc27	catalog: use the new ACL system in tests In preparation for removing ACL.Apply	2021-09-21 17:57:29 -04:00
Daniel Nephin	ad9748adc3	acl: remove two commented out tests for legacy ACL replication They were commented out in 2018.	2021-09-21 17:57:29 -04:00
Daniel Nephin	5a31a2e167	acl: replace legacy Get and List RPCs with an error impl These endpoints are being removed as part of the legacy ACL system.	2021-09-21 17:57:29 -04:00
Daniel Nephin	26f3380688	acl: remove a couple legacy ACL operation constants structs.ACLForceSet was deprecated 4 years ago, it should be safe to remove now. ACLBootstrapNow was removed in a recent commit. While it is technically possible that a cluster with mixed version could still attempt a legacy boostrap, we documented that the legacy system was deprecated in 1.4, so no clusters that are being upgraded should be attempting a legacy boostrap.	2021-09-21 17:57:29 -04:00
Daniel Nephin	5493ff06cc	Merge pull request #10985 from hashicorp/dnephin/acl-legacy-remove-replication acl: remove legacy ACL replication	2021-09-21 17:56:54 -04:00
Connor	64852cd3e5	Apply suggestions from code review Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com>	2021-09-21 10:52:46 -05:00
Connor Kelly	973b7b5c78	Fix test	2021-09-20 13:44:43 -05:00
Connor Kelly	698fc291a9	Add KVUsage to consul state usage metrics This change will add the number of entries in the consul KV store to the already existing usage metrics.	2021-09-20 12:41:54 -05:00
Krastin Krastev	ba13dbf24c	Update autopilot.go Fixing a minuscule typo in logging	2021-09-20 14:40:58 +02:00
Freddy	f1b2ef30d1	Merge pull request #11071 from hashicorp/partitions/ixn-decisions	2021-09-16 15:18:23 -06:00
R.B. Boyer	7fa8f19077	acl: ensure the global management policy grants all necessary partition privileges (#11072 )	2021-09-16 15:53:10 -05:00
freddygv	b5a8935bb8	Default the partition in ixn check	2021-09-16 14:39:01 -06:00
freddygv	caafc1905e	Fixup test	2021-09-16 14:39:01 -06:00
freddygv	8a9bf3748c	Account for partitions in ixn match/decision	2021-09-16 14:39:01 -06:00
Jeff Widman	a8f396c55f	Bump `go-discover` to fix broken dep tree (#10898 )	2021-09-16 15:31:22 -04:00
R.B. Boyer	4e7b6888e3	acl: fix intention:*:write checks (#11061 ) This is a partial revert of #10793	2021-09-16 11:08:45 -05:00
Freddy	88627700d0	Merge pull request #11051 from hashicorp/partitions/fixes	2021-09-16 09:29:00 -06:00
Freddy	494764ee2d	acl: small resolver changes to account for partitions (#11052 ) Also refactoring the enterprise side of a test to make it easier to reason about.	2021-09-16 09:17:02 -05:00
freddygv	dc549eca30	Default partition in match endpoint	2021-09-15 17:23:52 -06:00
Mark Anderson	08b222cfc3	ACL Binding Rules table partitioning (#11044 ) * ACL Binding Rules table partitioning Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-09-15 13:26:08 -07:00
Dhia Ayachi	25ea1a9276	use const instead of literals for `tableIndex` (#11039 )	2021-09-15 10:24:04 -04:00
Mark Anderson	ffe3806aaf	Refactor `indexAuthMethod` in `tableACLBindingRules` (#11029 ) * Port consul-enterprise #1123 to OSS Signed-off-by: Mark Anderson <manderson@hashicorp.com> * Fixup missing query field Signed-off-by: Mark Anderson <manderson@hashicorp.com> * change to re-trigger ci system Signed-off-by: Mark Anderson <manderson@hashicorp.com>	2021-09-15 09:34:19 -04:00
Dhia Ayachi	4992218676	convert expiration indexed in ACLToken table to use `indexerSingle` (#11018 ) * move intFromBool to be available for oss * add expiry indexes * remove dead code: `TokenExpirationIndex` * fix remove indexer `TokenExpirationIndex` * fix rebase issue	2021-09-13 14:37:16 -04:00
Dhia Ayachi	1f23bdf388	add locality indexer partitioning (#11016 ) * convert `Roles` index to use `indexerSingle` * split authmethod write indexer to oss and ent * add index locality * add locality unit tests * move intFromBool to be available for oss * use Bool func * refactor `aclTokenList` to merge func	2021-09-13 11:53:00 -04:00
Dhia Ayachi	3638825db8	convert `indexAuthMethod` index to use `indexerSingle` (#11014 ) * convert `Roles` index to use `indexerSingle` * fix oss build * split authmethod write indexer to oss and ent * add auth method unit tests	2021-09-10 16:56:56 -04:00
Paul Banks	3484d77b18	Fix enterprise discovery chain tests; Fix multi-level split merging	2021-09-10 21:11:00 +01:00
Paul Banks	5c6d27555b	Fix discovery chain test fixtures	2021-09-10 21:09:24 +01:00
Paul Banks	1dd1683ed9	Header manip for split legs plumbing	2021-09-10 21:09:24 +01:00
Dhia Ayachi	82b30f8020	convert `Roles` index to use `indexerMulti` (#11013 ) * convert `Roles` index to use `indexerMulti` * add role test in oss * fix oss to use the right index func * preallocate slice	2021-09-10 16:04:33 -04:00
Dhia Ayachi	569e18d002	convert indexPolicies in ACLTokens table to the new index (#11011 )	2021-09-10 14:57:37 -04:00
Dhia Ayachi	0d0edeec27	convert indexSecret to the new index (#11007 )	2021-09-10 09:10:11 -04:00
Dhia Ayachi	f0cbe25ca6	convert indexAccessor to the new index (#11002 )	2021-09-09 16:28:04 -04:00
Hans Hasselberg	24c6ce0be0	tls: consider presented intermediates during server connection tls handshake. (#10964 ) * use intermediates when verifying * extract connection state * remove useless import * add changelog entry * golint * better error * wording * collect errors * use SAN.DNSName instead of CommonName * Add test for unknown intermediate * improve changelog entry	2021-09-09 21:48:54 +02:00
Chris S. Kim	3fb797382b	Sync enterprise changes to oss (#10994 ) This commit updates OSS with files for enterprise-specific admin partitions feature work	2021-09-08 11:59:30 -04:00
Kyle Havlovitz	a7b5a5d1b4	Merge pull request #10984 from hashicorp/mesh-resource acl: adding a new mesh resource	2021-09-07 15:06:20 -07:00
Dhia Ayachi	96d7842118	partition dicovery chains (#10983 ) * partition dicovery chains * fix default partition for OSS	2021-09-07 16:29:32 -04:00
Daniel Nephin	4dd5bb8e3b	acl: remove legacy ACL replication	2021-09-03 12:42:06 -04:00
R.B. Boyer	4206f585f0	acl: adding a new mesh resource	2021-09-03 09:12:03 -04:00
Evan Culver	93f94ac24f	rpc: authorize raft requests (#10925 )	2021-08-26 15:04:32 -07:00
Chris S. Kim	86de20c975	ent->oss test fix (#10926 )	2021-08-26 14:06:49 -04:00
Chris S. Kim	efbdf7e117	api: expose upstream routing configurations in topology view (#10811 ) Some users are defining routing configurations that do not have associated services. This commit surfaces these configs in the topology visualization. Also fixes a minor internal bug with non-transparent proxy upstream/downstream references.	2021-08-25 15:20:32 -04:00
R.B. Boyer	6b5a58de50	acl: some acl authz refactors for nodes (#10909 )	2021-08-25 13:43:11 -05:00
R.B. Boyer	a84f5fa25d	grpc: ensure that streaming gRPC requests work over mesh gateway based wan federation (#10838 ) Fixes #10796	2021-08-24 16:28:44 -05:00
Giulio Micheloni	10b03c3f4e	Merge branch 'main' into serve-panic-recovery	2021-08-22 20:31:11 +02:00
Giulio Micheloni	465e9fecda	grpc, xds: recovery middleware to return and log error in case of panic 1) xds and grpc servers: 1.1) to use recovery middleware with callback that prints stack trace to log 1.2) callback turn the panic into a core.Internal error 2) added unit test for grpc server	2021-08-22 19:06:26 +01:00
R.B. Boyer	b6be94e7fa	fixing various bits of enterprise meta plumbing to be more correct (#10889 )	2021-08-20 14:34:23 -05:00
Dhia Ayachi	f766b6dff7	oss portion of ent #1069 (#10883 )	2021-08-20 12:57:45 -04:00
R.B. Boyer	d730298f59	state: partition the nodes.uuid and nodes.meta indexes as well (#10882 )	2021-08-19 16:17:59 -05:00
R.B. Boyer	61f1c01b83	agent: ensure that most agent behavior correctly respects partition configuration (#10880 )	2021-08-19 15:09:42 -05:00
R.B. Boyer	e565409c6a	state: partition the usage metrics subsystem (#10867 )	2021-08-18 09:27:15 -05:00
R.B. Boyer	1cef3c99c2	state: adjust streaming event generation to account for partitioned nodes (#10860 ) Also re-enabled some tests that had to be disabled in the prior PR.	2021-08-17 16:49:26 -05:00
R.B. Boyer	e50e13d2ab	state: partition nodes and coordinates in the state store (#10859 ) Additionally: - partitioned the catalog indexes appropriately for partitioning - removed a stray reference to a non-existent index named "node.checks"	2021-08-17 13:29:39 -05:00
Daniel Nephin	5a82859ee1	acl: small improvements to ACLResolver disable due to RPC error Remove the error return, so that not handling is not reported as an error by errcheck. It was returning the error passed as an arg unmodified so there is no reason to return the same value that was passed in. Remove the term upstreams to remove any confusion with the term used in service mesh. Remove the AutoDisable field, and replace it with the TTL value, using 0 to indicate the setting is turned off. Replace "not Before" with "After". Add some test coverage to show the behaviour is still correct.	2021-08-17 13:34:18 -04:00
Daniel Nephin	09ae0ab94a	acl: make ACLDisabledTTL a constant This field was never user-configurable. We always overwrote the value with 120s from NonUserSource. However, we also never copied the value from RuntimeConfig to consul.Config, So the value in NonUserSource was always ignored, and we used the default value of 30s set by consul.DefaultConfig. All of this code is an unnecessary distraction because a user can not actually configure this value. This commit removes the fields and uses a constant value instad. Someone attempting to set acl.disabled_ttl in their config will now get an error about an unknown field, but previously the value was completely ignored, so the new behaviour seems more correct. We have to keep this field in the AutoConfig response for backwards compatibility, but the value will be ignored by the client, so it doesn't really matter what value we set.	2021-08-17 13:34:18 -04:00
Daniel Nephin	a8bc964241	Fix test failures Tests only specified one of the fields, but in production we copy the value from a single place, so we can do the same in tests. The AutoConfig test broke because of the problem noticed in a previous commit. The DisabledTTL is not wired up properly so it reports 0s here. Changed the test to use an explicit value.	2021-08-17 13:32:52 -04:00
Daniel Nephin	0d69b49f41	config: remove ACLResolver settings from RuntimeConfig	2021-08-17 13:32:52 -04:00
Daniel Nephin	75baa22e64	acl: remove ACLResolver config fields from consul.Config	2021-08-17 13:32:52 -04:00
Daniel Nephin	454f62eacc	acl: replace ACLResolver.Config with its own struct This is step toward decoupling ACLResolver from the agent/consul package.	2021-08-17 13:32:52 -04:00
Daniel Nephin	be0358df02	acl: remove legacy bootstrap Return an explicit error from the RPC, and remove the flag from the HTTP API.	2021-08-17 13:10:00 -04:00
Daniel Nephin	4f54d9708c	acl: add some notes about removing legacy ACL system	2021-08-17 13:08:29 -04:00
Daniel Nephin	e4c6bee7e6	Merge pull request #10792 from hashicorp/dnephin/rename-authz-vars acl: use authz consistently as the variable name for an acl.Authorizer	2021-08-17 13:07:17 -04:00
Daniel Nephin	7f71a672f3	Merge pull request #10807 from hashicorp/dnephin/remove-acl-datacenter config: remove ACLDatacenter	2021-08-17 13:07:09 -04:00
Daniel Nephin	608b291565	acl: use authz consistently as the variable name for an acl.Authorizer Follow up to https://github.com/hashicorp/consul/pull/10737#discussion_r682147950 Renames all variables for acl.Authorizer to use `authz`. Previously some places used `rule` which I believe was an old name carried over from the legacy ACL system. A couple places also used authorizer. This commit also removes another couple of authorizer nil checks that are no longer necessary.	2021-08-17 12:14:10 -04:00
Daniel Nephin	364ef3d052	server: remove defaulting of PrimaryDatacenter The constructor for Server is not at all the appropriate place to be setting default values for a config struct that was passed in. In production this value is always set from agent/config. In tests we should set the default in a test helper.	2021-08-06 18:45:24 -04:00
Daniel Nephin	87fb26fd65	Merge pull request #10612 from bigmikes/acl-replication-fix acl: acl replication routine to report the last error message	2021-08-06 18:29:51 -04:00
Daniel Nephin	047abdd73c	acl: remove ACLDatacenter This field has been unnecessary for a while now. It was always set to the same value as PrimaryDatacenter. So we can remove the duplicate field and use PrimaryDatacenter directly. This change was made by GoLand refactor, which did most of the work for me.	2021-08-06 18:27:00 -04:00
Giulio Micheloni	5c34a48d45	String type instead of error type and changelog.	2021-08-06 22:35:27 +01:00
Daniel Nephin	9435118179	acl: remove Server.ResolveTokenIdentityAndDefaultMeta This method suffered from similar naming to a couple other methods on Server, and had not great re-use (2 callers). By copying a few of the lines into one of the callers we can move the implementation into the second caller. Once moved, we can see that ResolveTokenAndDefaultMeta is identical in both Client and Server, and likely should be further refactored, possibly into ACLResolver. This change is being made to make ACL resolution easier to trace.	2021-08-05 15:20:13 -04:00
Daniel Nephin	25f40de163	acl: remove Server.ResolveTokenToIdentityAndAuthorizer This method was an alias for ACLResolver.ResolveTokenToIdentityAndAuthorizer. By removing the method that does nothing the code becomes easier to trace.	2021-08-05 15:20:13 -04:00
Daniel Nephin	695963acb7	acl: recouple acl filtering from ACLResolver ACL filtering only needs an authorizer and a logger. We can decouple filtering from the ACLResolver by passing in the necessary logger. This change is being made in preparation for moving the ACLResolver into an acl package	2021-08-05 15:20:13 -04:00
Daniel Nephin	ba2f9a65d1	acl: remove unused error return filterACLWithAuthorizer could never return an error. This change moves us a little bit closer to being able to enable errcheck and catch problems caused by unhandled error return values.	2021-08-05 15:20:13 -04:00
Daniel Nephin	c80b9565e2	acl: rename acl.Authorizer vars to authz For consistency	2021-08-05 15:19:47 -04:00
Daniel Nephin	37c67cb280	acl: move vet functions These functions are moved to the one place they are called to improve code locality. They are being moved out of agent/consul/acl.go in preparation for moving ACLResolver to an acl package.	2021-08-05 15:19:24 -04:00
Daniel Nephin	c8eedabc7c	acl: move vetRegisterWithACL and vetDeregisterWithACL These functions are used in only one place. Move the functions next to their one caller to improve code locality. This change is being made in preparation for moving the ACLResolver into an acl package. The moved functions were previously in the same file as the ACLResolver. By moving them out of that file we may be able to move the entire file with fewer modifications.	2021-08-05 15:17:54 -04:00
Daniel Nephin	b223c2bc25	Merge pull request #10770 from hashicorp/dnephin/log-cert-expiration telemetry: add log message when certs are about to expire	2021-08-05 15:17:20 -04:00
Daniel Nephin	c866f1041a	Merge pull request #10793 from hashicorp/dnephin/acl-intentions acl: small cleanup of a couple Authorization flows	2021-08-05 15:16:49 -04:00
Dhia Ayachi	40baf98159	defer setting the state before returning to avoid stuck in `INITIALIZING` state (#10630 ) * defer setting the state before returning to avoid being stuck in `INITIALIZING` state * add changelog * move comment with the right if statement * ca: report state transition error from setSTate * update comment to reflect state transition Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-08-05 14:51:19 -04:00
Daniel Nephin	79ab48ef81	Merge pull request #10768 from hashicorp/dnephin/agent-tls-cert-expiration-metric telemetry: add Agent TLS Certificate expiration metric	2021-08-04 18:42:02 -04:00
Daniel Nephin	0ca9e875e2	acl: remove special handling of services in txn_endpoint Follow up to: https://github.com/hashicorp/consul/pull/10738#discussion_r680190210 Previously we were passing an Authorizer that would always allow the operation, then later checking the authorization using vetServiceTxnOp. On the surface this seemed strange, but I think it was actually masking a bug as well. Over time `servicePreApply` was changed to add additional authorization for `service.Proxy.DestinationServiceName`, but because we were passing a nil Authorizer, that authorization was not handled on the txn_endpoint. `TxnServiceOp.FillAuthzContext` has some special handling in enterprise, so we need to make sure to continue to use that from the Txn endpoint. This commit removes the `vetServiceTxnOp` function, and passes in the `FillAuthzContext` function so that `servicePreApply` can be used by both the catalog and txn endpoints. This should be much less error prone and prevent bugs like this in the future.	2021-08-04 18:32:20 -04:00
Daniel Nephin	3dc113ada6	Merge pull request #10738 from hashicorp/dnephin/remove-authorizer-nil-checks-2 acl: remove the last of the authz == nil checks	2021-08-04 17:41:40 -04:00
Daniel Nephin	2e9aa91256	Merge pull request #10737 from hashicorp/dnephin/remove-authorizer-nil-checks acl: remove authz == nil checks	2021-08-04 17:39:34 -04:00
Daniel Nephin	210a850353	telemetry: add log message when certs are about to expire	2021-08-04 14:18:59 -04:00
Daniel Nephin	13aa7b70d5	telemetry: fix a couple bugs in cert expiry metrics 1. do not emit the metric if Query fails 2. properly check for PrimaryUsersIntermediate, the logic was inverted Also improve the logging by including the metric name in the log message	2021-08-04 13:51:44 -04:00
Daniel Nephin	1673b3a68c	telemetry: add a metric for agent TLS cert expiry	2021-08-04 13:51:44 -04:00
Dhia Ayachi	6ed6966a1f	fix state index for `CAOpSetRootsAndConfig` op (#10675 ) * fix state index for `CAOpSetRootsAndConfig` op * add changelog * Update changelog Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * remove the change log as it's not needed Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-08-04 13:07:49 -04:00
Daniel Nephin	953c9bee4f	acl: Remove the remaining authz == nil checks These checks were a bit more involved. They were previously skipping some code paths when the authorizer was nil. After looking through these it seems correct to remove the authz == nil check, since it will never evaluate to true.	2021-07-30 14:55:35 -04:00
Daniel Nephin	e4821a58ee	acl: remove acl == nil checks	2021-07-30 14:28:19 -04:00
Daniel Nephin	fbaeac9ecf	acl: remove authz == nil checks These case are already impossible conditions, because most of these functions already start with a check for ACLs being disabled. So the code path being removed could never be reached. The one other case (ConnectAuthorized) was already changed in a previous commit. This commit removes an impossible branch because authz == nil can never be true.	2021-07-30 13:58:35 -04:00
Daniel Nephin	b6d9d0d9f7	acl: remove many instances of authz == nil	2021-07-30 13:58:35 -04:00
Daniel Nephin	2503f27a36	acl: remove rule == nil checks	2021-07-30 13:58:35 -04:00
Daniel Nephin	9b41e7287f	acl: use acl.ManangeAll when ACLs are disabled Instead of returning nil and checking for nilness Removes a bunch of nil checks, and fixes one test failures.	2021-07-30 12:58:24 -04:00
Freddy	b136b1795a	Reset root prune interval after TestLeader_CARootPruning completes #10645 Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-07-26 15:43:40 -06:00
R.B. Boyer	c271976445	state: refactor some node/coordinate state store functions to take an EnterpriseMeta (#10687 ) Note the field is not used yet.	2021-07-23 13:42:23 -05:00
R.B. Boyer	254557a1f6	sync changes to oss files made in enterprise (#10670 )	2021-07-22 13:58:08 -05:00
R.B. Boyer	62ac98b564	agent/structs: add a bunch more EnterpriseMeta helper functions to help with partitioning (#10669 )	2021-07-22 13:20:45 -05:00
Dhia Ayachi	b725605fe4	config raft apply silent error (#10657 ) * return an error when the index is not valid * check response as bool when applying `CAOpSetConfig` * remove check for bool response * fix error message and add check to test * fix comment * add changelog	2021-07-22 10:32:27 -04:00
Daniel Nephin	db29c51cd2	acl: use SetHash consistently in testPolicyForID A previous commit used SetHash on two of the cases to fix a data race. This commit applies that change to all cases. Using SetHash in this test helper should ensure that the test helper behaves closer to production.	2021-07-16 17:59:56 -04:00
Giulio Micheloni	3a1afd8f57	acl: fix error type into a string type for serialization issue acl_endpoint_test.go:507: Error Trace: acl_endpoint_test.go:507 retry.go:148 retry.go:149 retry.go:103 acl_endpoint_test.go:504 Error: Received unexpected error: codec.decoder: decodeValue: Cannot decode non-nil codec value into nil error (1 methods) Test: TestACLEndpoint_ReplicationStatus	2021-07-15 11:31:44 +02:00
Daniel Nephin	27871498f0	Fix a data race in TestACLResolver_Client By setting the hash when we create the policy. ``` WARNING: DATA RACE Read at 0x00c0028b4b10 by goroutine 1182: github.com/hashicorp/consul/agent/structs.(ACLPolicy).SetHash() /home/daniel/pers/code/consul/agent/structs/acl.go:701 +0x40d github.com/hashicorp/consul/agent/structs.ACLPolicies.resolveWithCache() /home/daniel/pers/code/consul/agent/structs/acl.go:779 +0xfe github.com/hashicorp/consul/agent/structs.ACLPolicies.Compile() /home/daniel/pers/code/consul/agent/structs/acl.go:809 +0xf1 github.com/hashicorp/consul/agent/consul.(ACLResolver).ResolveTokenToIdentityAndAuthorizer() /home/daniel/pers/code/consul/agent/consul/acl.go:1226 +0x6ef github.com/hashicorp/consul/agent/consul.resolveTokenAsync() /home/daniel/pers/code/consul/agent/consul/acl_test.go:66 +0x5c Previous write at 0x00c0028b4b10 by goroutine 1509: github.com/hashicorp/consul/agent/structs.(ACLPolicy).SetHash() /home/daniel/pers/code/consul/agent/structs/acl.go:730 +0x3a8 github.com/hashicorp/consul/agent/structs.ACLPolicies.resolveWithCache() /home/daniel/pers/code/consul/agent/structs/acl.go:779 +0xfe github.com/hashicorp/consul/agent/structs.ACLPolicies.Compile() /home/daniel/pers/code/consul/agent/structs/acl.go:809 +0xf1 github.com/hashicorp/consul/agent/consul.(ACLResolver).ResolveTokenToIdentityAndAuthorizer() /home/daniel/pers/code/consul/agent/consul/acl.go:1226 +0x6ef github.com/hashicorp/consul/agent/consul.resolveTokenAsync() /home/daniel/pers/code/consul/agent/consul/acl_test.go:66 +0x5c Goroutine 1182 (running) created at: github.com/hashicorp/consul/agent/consul.TestACLResolver_Client.func4() /home/daniel/pers/code/consul/agent/consul/acl_test.go:1669 +0x459 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 Goroutine 1509 (running) created at: github.com/hashicorp/consul/agent/consul.TestACLResolver_Client.func4() /home/daniel/pers/code/consul/agent/consul/acl_test.go:1668 +0x415 testing.tRunner() /usr/lib/go/src/testing/testing.go:1193 +0x202 ```	2021-07-14 18:58:16 -04:00
Daniel Nephin	ff26294d63	consul: fix data race in leader CA tests Some global variables are patched to shorter values in these tests. But the goroutines that read them can outlive the test because nothing waited for them to exit. This commit adds a Wait() method to the routine manager, so that tests can wait for the goroutines to exit. This prevents the data race because the 'reset to original value' can happen after all other goroutines have stopped.	2021-07-14 18:58:15 -04:00
Giulio Micheloni	96fe1f4078	acl: acl replication routine to report the last error message	2021-07-14 11:50:23 +02:00
Dhia Ayachi	53b45a8441	check expiry date of the root/intermediate before using it to sign a leaf (#10500 ) * ca: move provider creation into CAManager This further decouples the CAManager from Server. It reduces the interface between them and removes the need for the SetLogger method on providers. * ca: move SignCertificate to CAManager To reduce the scope of Server, and keep all the CA logic together * ca: move SignCertificate to the file where it is used * auto-config: move autoConfigBackend impl off of Server Most of these methods are used exclusively for the AutoConfig RPC endpoint. This PR uses a pattern that we've used in other places as an incremental step to reducing the scope of Server. * fix linter issues * check error when `raftApplyMsgpack` * ca: move SignCertificate to CAManager To reduce the scope of Server, and keep all the CA logic together * check expiry date of the intermediate before using it to sign a leaf * fix typo in comment Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> * Fix test name * do not check cert start date * wrap error to mention it is the intermediate expired * Fix failing test * update comment Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * use shim to avoid sleep in test * add root cert validation * remove duplicate code * Revert "fix linter issues" This reverts commit 6356302b54f06c8f2dee8e59740409d49e84ef24. * fix import issue * gofmt leader_connect_ca * add changelog entry * update error message Co-authored-by: Freddy <freddygv@users.noreply.github.com> * fix error message in test Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2021-07-13 12:15:06 -04:00
R.B. Boyer	ae8b526be8	connect/ca: ensure edits to the key type/bits for the connect builtin CA will regenerate the roots (#10330 ) progress on #9572	2021-07-13 11:12:07 -05:00
R.B. Boyer	0537922c6c	connect/ca: require new vault mount points when updating the key type/bits for the vault connect CA provider (#10331 ) progress on #9572	2021-07-13 11:11:46 -05:00
Daniel Nephin	58cf5767a8	Merge pull request #10479 from hashicorp/dnephin/ca-provider-explore-2 ca: move Server.SignIntermediate to CAManager	2021-07-12 19:03:43 -04:00
Daniel Nephin	a22bdb2ac9	Merge pull request #10445 from hashicorp/dnephin/ca-provider-explore ca: isolate more of the CA logic in CAManager	2021-07-12 15:26:23 -04:00
Daniel Nephin	fdb0ba8041	ca: use provider constructors to be more consistent Adds a contructor for the one provider that did not have one.	2021-07-12 14:04:34 -04:00
Dhia Ayachi	3eac4ffda4	check error when `raftApplyMsgpack`	2021-07-12 13:42:51 -04:00
Daniel Nephin	34c8585b29	auto-config: move autoConfigBackend impl off of Server Most of these methods are used exclusively for the AutoConfig RPC endpoint. This PR uses a pattern that we've used in other places as an incremental step to reducing the scope of Server.	2021-07-12 13:42:40 -04:00
Daniel Nephin	605275b4dc	ca: move SignCertificate to the file where it is used	2021-07-12 13:42:39 -04:00
Daniel Nephin	c2e85f25d4	ca: move SignCertificate to CAManager To reduce the scope of Server, and keep all the CA logic together	2021-07-12 13:42:39 -04:00
Dhia Ayachi	a0320169fe	add missing state reset when stopping ca manager	2021-07-12 09:32:36 -04:00
Daniel Nephin	68d5f7769a	ca: fix mockCAServerDelegate to work with the new interface raftApply was removed so ApplyCARequest needs to handle all the possible operations Also set the providerShim to use the mock provider. other changes are small test improvements that were necessary to debug the failures.	2021-07-12 09:32:36 -04:00
Daniel Nephin	6d4b0ce194	ca: remove unused method and small refactor to getCAProvider so that GoLand is less confused about what it is doing. Previously it was reporting that the for condition was always true, which was not the case.	2021-07-12 09:32:35 -04:00
Daniel Nephin	4330122d9a	ca: remove raftApply from delegate interface After moving ca.ConsulProviderStateDelegate into the interface we now have the ApplyCARequest method which does the same thing. Use this more specific method instead of raftApply.	2021-07-12 09:32:35 -04:00
Daniel Nephin	fae0a8f851	ca: move generateCASignRequest to the delegate This method on Server was only used by the caDelegateWithState, so move it there until we can move it entirely into CAManager.	2021-07-12 09:32:35 -04:00
Daniel Nephin	d4bb9fd97a	ca: move provider creation into CAManager This further decouples the CAManager from Server. It reduces the interface between them and removes the need for the SetLogger method on providers.	2021-07-12 09:32:33 -04:00
Daniel Nephin	fc629d9eaa	ca-manager: move provider shutdown into CAManager Reducing the coupling between Server and CAManager	2021-07-12 09:27:28 -04:00
Daniel Nephin	1e23d181b5	config: remove misleading UseTLS field This field was documented as enabling TLS for outgoing RPC, but that was not the case. All this field did was set the use_tls serf tag. Instead of setting this field in a place far from where it is used, move the logic to where the serf tag is set, so that the code is much more obvious.	2021-07-09 19:01:45 -04:00
Daniel Nephin	3c60a46376	config: remove duplicate TLSConfig fields from agent/consul.Config tlsutil.Config already presents an excellent structure for this configuration. Copying the runtime config fields to agent/consul.Config makes code harder to trace, and provides no advantage. Instead of copying the fields around, use the tlsutil.Config struct directly instead. This is one small step in removing the many layers of duplicate configuration.	2021-07-09 18:49:42 -04:00
Evan Culver	5ff191ad99	Add support for returning ACL secret IDs for accessors with acl:write (#10546 )	2021-07-08 15:13:08 -07:00

... 5 6 7 8 9 ...

1952 Commits