open-consul

Commit Graph

Author	SHA1	Message	Date
Daniel Nephin	d2f5b4d335	debug: improve a couple of the test cases Use gotest.tools/v3/fs to make better assertions about the files Remove the TestAgent from TestDebugCommand_Prepare_ValidateTiming, since we can test that validation without making any API calls.	2021-08-18 12:29:34 -04:00
Roopak Venkatakrishnan	d4dacd0e2e	Update x/sys to support go 1.17	2021-08-18 03:00:22 +00:00
Mike Morris	86d76cb099	deps: upgrade gogo-protobuf to v1.3.2 (#10813 ) * deps: upgrade gogo-protobuf to v1.3.2 * go mod tidy using go 1.16 * proto: regen protobufs after upgrading gogo/protobuf Co-authored-by: Daniel Nephin <dnephin@hashicorp.com>	2021-08-12 14:05:46 -04:00
Daniel Nephin	5da6c51ae4	Update armon/go-metrics To pickup new InMemSink.Stream method	2021-07-26 15:58:17 -04:00
Daniel Nephin	291315e39f	Update serf To pick up data race fixes	2021-07-14 18:58:16 -04:00
Dhia Ayachi	c3eacac764	upgrade golang crypto from 0.0.0-20200930160638-afb6bcd081ae => v0.0.0-20210513164829-c07d793c2f9a (#10390 )	2021-06-14 12:38:42 -04:00
Dhia Ayachi	e3dd0f9a44	generate a single debug file for a long duration capture (#10279 ) * debug: remove the CLI check for debug_enabled The API allows collecting profiles even debug_enabled=false as long as ACLs are enabled. Remove this check from the CLI so that users do not need to set debug_enabled=true for no reason. Also: - fix the API client to return errors on non-200 status codes for debug endpoints - improve the failure messages when pprof data can not be collected Co-Authored-By: Dhia Ayachi <dhia@hashicorp.com> * remove parallel test runs parallel runs create a race condition that fail the debug tests * snapshot the timestamp at the beginning of the capture - timestamp used to create the capture sub folder is snapshot only at the beginning of the capture and reused for subsequent captures - capture append to the file if it already exist * Revert "snapshot the timestamp at the beginning of the capture" This reverts commit c2d03346 * Refactor captureDynamic to extract capture logic for each item in a different func * snapshot the timestamp at the beginning of the capture - timestamp used to create the capture sub folder is snapshot only at the beginning of the capture and reused for subsequent captures - capture append to the file if it already exist * Revert "snapshot the timestamp at the beginning of the capture" This reverts commit c2d03346 * Refactor captureDynamic to extract capture logic for each item in a different func * extract wait group outside the go routine to avoid a race condition * capture pprof in a separate go routine * perform a single capture for pprof data for the whole duration * add missing vendor dependency * add a change log and fix documentation to reflect the change * create function for timestamp dir creation and simplify error handling * use error groups and ticker to simplify interval capture loop * Logs, profile and traces are captured for the full duration. Metrics, Heap and Go routines are captured every interval * refactor Logs capture routine and add log capture specific test * improve error reporting when log test fail * change test duration to 1s * make time parsing in log line more robust * refactor log time format in a const * test on log line empty the earliest possible and return Co-authored-by: Freddy <freddygv@users.noreply.github.com> * rename function to captureShortLived * more specific changelog Co-authored-by: Paul Banks <banks@banksco.de> * update documentation to reflect current implementation * add test for behavior when invalid param is passed to the command * fix argument line in test * a more detailed description of the new behaviour Co-authored-by: Paul Banks <banks@banksco.de> * print success right after the capture is done * remove an unnecessary error check Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> * upgraded github.com/google/pprof v0.0.0-20181206194817-3ea8567a2e57 => v0.0.0-20210601050228-01bbb1931b22 Co-authored-by: Daniel Nephin <dnephin@hashicorp.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com> Co-authored-by: Paul Banks <banks@banksco.de>	2021-06-07 13:00:51 -04:00
Matt Keeler	b45dd03b8f	Bump raft-autopilot version to the latest. (#10306 )	2021-05-27 12:59:14 -04:00
Daniel Nephin	71d6a2bf4b	Fix some test flakes - return errors in TestAgent.Start so that the retry works correctly - remove duplicate logging, the error is returned already - add a missing t.Helper() to retry.Run - properly set a.Agent to nil so that subsequent retry attempts will actually try to start	2021-05-10 13:20:45 -04:00
Daniel Nephin	203c752ee8	Update a couple dependencies To pickup bug fixes	2021-05-04 14:09:10 -04:00
Paul Banks	d47eea3a3f	Make Raft trailing logs and snapshot timing reloadable (#10129 ) * WIP reloadable raft config * Pre-define new raft gauges * Update go-metrics to change gauge reset behaviour * Update raft to pull in new metric and reloadable config * Add snapshot persistance timing and installSnapshot to our 'protected' list as they can be infrequent but are important * Update telemetry docs * Update config and telemetry docs * Add note to oldestLogAge on when it is visible * Add changelog entry * Update website/content/docs/agent/options.mdx Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com> Co-authored-by: Matt Keeler <mkeeler@users.noreply.github.com>	2021-05-04 15:36:53 +01:00
R.B. Boyer	91bee6246f	Support Incremental xDS mode (#9855 ) This adds support for the Incremental xDS protocol when using xDS v3. This is best reviewed commit-by-commit and will not be squashed when merged. Union of all commit messages follows to give an overarching summary: xds: exclusively support incremental xDS when using xDS v3 Attempts to use SoTW via v3 will fail, much like attempts to use incremental via v2 will fail. Work around a strange older envoy behavior involving empty CDS responses over incremental xDS. xds: various cleanups and refactors that don't strictly concern the addition of incremental xDS support Dissolve the connectionInfo struct in favor of per-connection ResourceGenerators instead. Do a better job of ensuring the xds code uses a well configured logger that accurately describes the connected client. xds: pull out checkStreamACLs method in advance of a later commit xds: rewrite SoTW xDS protocol tests to use protobufs rather than hand-rolled json strings In the test we very lightly reuse some of the more boring protobuf construction helper code that is also technically under test. The important thing of the protocol tests is testing the protocol. The actual inputs and outputs are largely already handled by the xds golden output tests now so these protocol tests don't have to do double-duty. This also updates the SoTW protocol test to exclusively use xDS v2 which is the only variant of SoTW that will be supported in Consul 1.10. xds: default xds.Server.AuthCheckFrequency at use-time instead of construction-time	2021-04-29 13:54:05 -05:00
R.B. Boyer	1ae772ff99	mod: bump to github.com/hashicorp/mdns v1.0.4 (#10018 )	2021-04-14 14:17:52 -05:00
Daniel Nephin	78cb867c8e	Update memberlist to v0.2.3 To pickup data race fixes	2021-03-24 18:20:19 -04:00
Daniel Nephin	66c3c76aa6	Update go-memdb To use a version that will not panic when an iterator is used with modifications.	2021-01-28 17:19:55 -05:00
Daniel Nephin	2eea58bcc4	Merge pull request #9302 from hashicorp/dnephin/add-service-3 agent: remove ServiceManager.Start goroutine	2021-01-28 16:59:41 -05:00
Matt Keeler	1379b5f7d6	Upgrade raft-autopilot and wait for autopilot it to stop when revoking leadership (#9644 ) Fixes: 9626	2021-01-27 11:14:52 -05:00
Daniel Nephin	3685f39970	lib/mutex: add mutex with TryLock and update vendor	2021-01-25 18:01:47 -05:00
Daniel Nephin	90bf8460a1	Update mapstructure	2021-01-12 12:24:56 -05:00
Pierre Souchay	4f8b0b307c	[bugfix] Prometheus metrics without warnings go-metrics is updated to 0.3.6 to properly handle help in prometheus metrics This fixes https://github.com/hashicorp/consul/issues/9303 and https://github.com/hashicorp/consul/issues/9471	2021-01-06 13:54:05 +01:00
Mike Morris	67a11e4d16	Merge pull request #9270 from hashicorp/release/1.9.0 merge: release/1.9.0 back into 1.9.x	2020-11-24 17:36:47 -05:00
Matt Keeler	755fb72994	Switch to using the external autopilot module	2020-11-09 09:22:11 -05:00
Mike Morris	9ccb340893	chore: upgrade to gopsutil/v3 (#9118 ) * deps: update golang.org/x/sys * deps: update imports to gopsutil/v3 * chore: make update-vendor	2020-11-06 20:48:38 -05:00
Kit Patella	b668592326	rollback golang.org/x/sys version to fix distro-build	2020-11-05 12:09:07 -08:00
Kit Patella	fbe61ad16c	upgrade go-metrics to latest	2020-11-04 14:02:13 -08:00
Kyle Havlovitz	95f7b354c2	vendor: Update github.com/hashicorp/yamux	2020-10-09 05:05:46 -07:00
Kyle Havlovitz	8e0ea86754	vendor: Update github.com/hashicorp/mdns	2020-10-09 04:43:27 -07:00
Kyle Havlovitz	3cd60e1d72	vendor: Update github.com/hashicorp/hil	2020-10-09 04:43:27 -07:00
Kyle Havlovitz	02e282a7ab	vendor: Update github.com/hashicorp/go-version	2020-10-09 04:43:27 -07:00
Kyle Havlovitz	bc6ffb59b8	vendor: Update github.com/hashicorp/go-memdb	2020-10-09 04:43:27 -07:00
Kyle Havlovitz	b5bb29f938	vendor: Update github.com/hashicorp/go-checkpoint	2020-10-09 04:43:27 -07:00
Mike Morris	4ae98cde2b	chore: update raft to v1.2.0 (#8822 )	2020-10-08 15:07:10 -04:00
Matt Keeler	141eb60f06	Add per-agent reconnect timeouts (#8781 ) This allows for client agent to be run in a more stateless manner where they may be abruptly terminated and not expected to come back. If advertising a per-agent reconnect timeout using the advertise_reconnect_timeout configuration when that agent leaves, other agents will wait only that amount of time for the agent to come back before reaping it. This has the advantageous side effect of causing servers to deregister the node/services/checks for that agent sooner than if the global reconnect_timeout was used.	2020-10-08 15:02:19 -04:00
Mike Morris	1d4f3166fb	chore(deps): update gopsutil to v2.20.9 (#8843 ) * core(deps): bump golang.org/x/sys To resolve /go/pkg/mod/github.com/shirou/gopsutil@v2.20.9+incompatible/host/host_bsd.go:20:13: undefined: unix.SysctlTimeval * chore(deps): make update-vendor	2020-10-07 12:57:18 -04:00
Daniel Nephin	b9bf0b527c	Vendor gofuzz and google/go-cmp	2020-09-28 18:28:37 -04:00
Kyle Havlovitz	c8fd61abc7	Merge branch 'master' into vault-ca-renew-token	2020-09-15 14:39:04 -07:00
Kyle Havlovitz	316600a685	Update vault CA for latest api client	2020-09-15 13:33:55 -07:00
Kyle Havlovitz	c3bd917650	vendor: Update vault api package	2020-09-15 12:45:29 -07:00
Daniel Nephin	beb125f053	Update go-metrics dependencies, to use metrics.Default()	2020-09-14 19:05:22 -04:00
Mike Morris	e08272ce8b	vendor: bump consul/api to v1.7.0	2020-09-10 21:40:41 -04:00
R.B. Boyer	f2b8bf109c	xds: use envoy's rbac filter to handle intentions entirely within envoy (#8569 )	2020-08-27 12:20:58 -05:00
Hans Hasselberg	02de4c8b76	add primary keys to list keyring (#8522 ) During gossip encryption key rotation it would be nice to be able to see if all nodes are using the same key. This PR adds another field to the json response from `GET v1/operator/keyring` which lists the primary keys in use per dc. That way an operator can tell when a key was successfully setup as primary key. Based on https://github.com/hashicorp/serf/pull/611 to add primary key to list keyring output: ```json [ { "WAN": true, "Datacenter": "dc2", "Segment": "", "Keys": { "0OuM4oC3Os18OblWiBbZUaHA7Hk+tNs/6nhNYtaNduM=": 6, "SINm887hKTzmMWeBNKTJReaTLX3mBEJKriDyt88Ad+g=": 6 }, "PrimaryKeys": { "SINm887hKTzmMWeBNKTJReaTLX3mBEJKriDyt88Ad+g=": 6 }, "NumNodes": 6 }, { "WAN": false, "Datacenter": "dc2", "Segment": "", "Keys": { "0OuM4oC3Os18OblWiBbZUaHA7Hk+tNs/6nhNYtaNduM=": 8, "SINm887hKTzmMWeBNKTJReaTLX3mBEJKriDyt88Ad+g=": 8 }, "PrimaryKeys": { "SINm887hKTzmMWeBNKTJReaTLX3mBEJKriDyt88Ad+g=": 8 }, "NumNodes": 8 }, { "WAN": false, "Datacenter": "dc1", "Segment": "", "Keys": { "0OuM4oC3Os18OblWiBbZUaHA7Hk+tNs/6nhNYtaNduM=": 3, "SINm887hKTzmMWeBNKTJReaTLX3mBEJKriDyt88Ad+g=": 8 }, "PrimaryKeys": { "SINm887hKTzmMWeBNKTJReaTLX3mBEJKriDyt88Ad+g=": 8 }, "NumNodes": 8 } ] ``` I intentionally did not change the CLI output because I didn't find a good way of displaying this information. There are a couple of options that we could implement later: * add a flag to show the primary keys * add a flag to show json output Fixes #3393.	2020-08-18 09:50:24 +02:00
s-christoff	efcda70b85	Update Go-Metrics 0.3.4 (#8478 )	2020-08-11 11:17:43 -05:00
Mike Morris	68389410d6	api: bump consul/api to v1.6.0 and consul/sdk to v0.6.0 (#8460 ) * api: bump consul/sdk dependency to v0.6.0 * api: bump dependency to v1.6.0	2020-08-07 17:26:05 -04:00
Kyle Havlovitz	22721d56c8	vendor: Update github.com/armon/go-metrics to v0.3.3	2020-07-23 11:37:33 -07:00
Matt Keeler	2f68d5972a	Update mapstructure to v1.3.3 (#8361 ) This was done in preparation for another PR where I was running into https://github.com/mitchellh/mapstructure/issues/202 and implemented a fix for the library.	2020-07-22 15:13:21 -04:00
R.B. Boyer	33f3436e94	gossip: Avoid issue where two unique leave events for the same node could lead to infinite rebroadcast storms (#8343 ) bump serf to v0.9.3 to include fix for https://github.com/hashicorp/serf/pull/606	2020-07-21 15:48:10 -05:00
Pierre Souchay	f77182aa51	Upgrade go-connlimit to v0.3.0 / return http 429 on too many connections (#8221 ) Fixes #7527 I want to highlight this and explain what I think the implications are and make sure we are aware: * `HTTPConnStateFunc` closes the connection when it is beyond the limit. `Close` does not block. * `HTTPConnStateFuncWithDefault429Handler(10 * time.Millisecond)` blocks until the following is done (worst case): 1) `conn.SetDeadline(10*time.Millisecond)` so that 2) `conn.Write(429error)` is guaranteed to timeout after 10ms, so that the http 429 can be written and 3) `conn.Close` can happen The implication of this change is that accepting any new connection is worst case delayed by 10ms. But only after a client reached the limit already.	2020-07-03 09:25:07 +02:00
Hans Hasselberg	9a38e4f766	Update gopsutil (#8208 ) https://github.com/shirou/gopsutil/pull/895 is merged and fixes our problem. Time to update. Since there is no new version just yet, updating to the sha.	2020-07-01 14:47:56 +02:00
Matt Keeler	2ab8af4093	Add a test for go routine leaks This is in its own separate package so that it will be a separate test binary that runs thus isolating the go runtime from other tests and allowing accurate go routine leak checking. This test would ideally use goleak.VerifyTestMain but that will fail 100% of the time due to some architectural things (blocking queries and net/rpc uncancellability). This test is not comprehensive. We should enable/exercise more features and more cluster configurations. However its a start.	2020-06-24 17:09:50 -04:00

1 2 3

113 Commits