open-consul

Commit Graph

Author	SHA1	Message	Date
Paul Banks	d6c0557e86	Connect: allow configuring Envoy for L7 Observability (#5558 ) * Add support for HTTP proxy listeners * Add customizable bootstrap configuration options * Debug logging for xDS AuthZ * Add Envoy Integration test suite with basic test coverage * Add envoy command tests to cover new cases * Add tracing integration test * Add gRPC support WIP * Merged changes from master Docker. get CI integration to work with same Dockerfile now * Make docker build optional for integration * Enable integration tests again! * http2 and grpc integration tests and fixes * Fix up command config tests * Store all container logs as artifacts in circle on fail * Add retries to outer part of stats measurements as we keep missing them in CI * Only dump logs on failing cases * Fix typos from code review * Review tidying and make tests pass again * Add debug logs to exec test. * Fix legit test failure caused by upstream rename in envoy config * Attempt to reduce cases of bad TLS handshake in CI integration tests * bring up the right service * Add prometheus integration test * Add test for denied AuthZ both HTTP and TCP * Try ANSI term for Circle	2019-04-29 17:27:57 +01:00
R.B. Boyer	5a505c5b3a	acl: adding support for kubernetes auth provider login (#5600 ) * auth providers * binding rules * auth provider for kubernetes * login/logout	2019-04-26 14:49:25 -05:00
Matt Keeler	2831c8993d	Move the watch package into the api module (#5664 ) * Move the watch package into the api module It was already just a thin wrapper around the API anyways. The biggest change was to the testing. Instead of using a test agent directly from the agent package it now uses the binary on the PATH just like the other API tests. The other big changes were to fix up the connect based watch tests so that we didn’t need to pull in the connect package (and therefore all of Consul)	2019-04-26 12:33:01 -04:00
Matt Keeler	913c82ec9f	Update go-msgpack version (#5683 ) Fixes #4673 Supercedes: #5677 There was an error decoding `map[string]string` values due to Go strings being immutable. This was fixes in our go-msgpack fork.	2019-04-18 15:10:34 -04:00
Matt Keeler	ac78c23021	Implement data filtering of some endpoints (#5579 ) Fixes: #4222 # Data Filtering This PR will implement filtering for the following endpoints: ## Supported HTTP Endpoints - `/agent/checks` - `/agent/services` - `/catalog/nodes` - `/catalog/service/:service` - `/catalog/connect/:service` - `/catalog/node/:node` - `/health/node/:node` - `/health/checks/:service` - `/health/service/:service` - `/health/connect/:service` - `/health/state/:state` - `/internal/ui/nodes` - `/internal/ui/services` More can be added going forward and any endpoint which is used to list some data is a good candidate. ## Usage When using the HTTP API a `filter` query parameter can be used to pass a filter expression to Consul. Filter Expressions take the general form of: ``` <selector> == <value> <selector> != <value> <value> in <selector> <value> not in <selector> <selector> contains <value> <selector> not contains <value> <selector> is empty <selector> is not empty not <other expression> <expression 1> and <expression 2> <expression 1> or <expression 2> ``` Normal boolean logic and precedence is supported. All of the actual filtering and evaluation logic is coming from the [go-bexpr](https://github.com/hashicorp/go-bexpr) library ## Other changes Adding the `Internal.ServiceDump` RPC endpoint. This will allow the UI to filter services better.	2019-04-16 12:00:15 -04:00
Freddy	4fa4cffd41	Add additional raft metrics (#5628 ) * Add documentation for new raft metrics * Revendor raft from master	2019-04-09 16:09:22 -06:00
Paul Banks	869387323f	Pull go-discover to fix Sirupsen/logrus (#5598 ) * Pull go-discover to fix Sirupsen/logrus * Actually rename Sirupsen -> sirupsen in vendor (despite macOS) * Actually _actually_ rename Sirupsen -> sirupsen in vendor (despite macOS)	2019-04-03 20:07:00 +01:00
Hans Hasselberg	cf4eb2474a	fix remaining CI failures after Go 1.12.1 Upgrade (#5576 )	2019-03-29 16:29:27 +01:00
Jeff Mitchell	ae509858ab	Bump vendor to take in new sdk/api versions (#5574 )	2019-03-27 09:03:07 -04:00
Jeff Mitchell	d3c7d57209	Move internal/ to sdk/ (#5568 ) * Move internal/ to sdk/ * Add a readme to the SDK folder	2019-03-27 08:54:56 -04:00
Jeff Mitchell	b43800125c	Update vendoring from go mod. (#5566 )	2019-03-26 17:50:42 -04:00
R.B. Boyer	d65008700a	acl: reduce complexity of token resolution process with alternative singleflighting (#5480 ) acl: reduce complexity of token resolution process with alternative singleflighting Switches acl resolution to use golang.org/x/sync/singleflight. For the identity/legacy lookups this is a drop-in replacement with the same overall approach to request coalescing. For policies this is technically a change in behavior, but when considered holistically is approximately performance neutral (with the benefit of less code). There are two goals with this blob of code (speaking specifically of policy resolution here): 1) Minimize cross-DC requests. 2) Minimize client-to-server LAN requests. The previous iteration of this code was optimizing for the case of many possibly different tokens being resolved concurrently that have a significant overlap in linked policies such that deduplication would be worth the complexity. While this is laudable there are some things to consider that can help to adjust expectations: 1) For v1.4+ policies are always replicated, and once a single policy shows up in a secondary DC the replicated data is considered authoritative for requests made in that DC. This means that our earlier concerns about minimizing cross-DC requests are irrelevant because there will be no cross-DC policy reads that occur. 2) For Server nodes the in-memory ACL policy cache is capped at zero, meaning it has no caching. Only Client nodes run with a cache. This means that instead of having an entire DC's worth of tokens (what a Server might see) that can have policy resolutions coalesced these nodes will only ever be seeing node-local token resolutions. In a reasonable worst-case scenario where a scheduler like Kubernetes has "filled" a node with Connect services, even that will only schedule ~100 connect services per node. If every service has a unique token there will only be 100 tokens to coalesce and even then those requests have to occur concurrently AND be hitting an empty consul cache. Instead of seeing a great coalescing opportunity for cutting down on redundant Policy resolutions, in practice it's far more likely given node densities that you'd see requests for the same token concurrently than you would for two tokens sharing a policy concurrently (to a degree that would warrant the overhead of the current variation of singleflighting. Given that, this patch switches the Policy resolution process to only singleflight by requesting token (but keeps the cache as by-policy).	2019-03-14 09:35:34 -05:00
petems	e9b7569759	Update go-discover vendor * Adds note about use of ENV variables for auto-join on Azure	2019-03-08 22:57:48 +00:00
Pierre Souchay	2ed7fddb06	Revendor memberlist to Fix #3217 Upgrade leads to protocol version (2) is incompatible: [1, 0] (#5313) This is fixed in https://github.com/hashicorp/memberlist/pull/178, bump memberlist to fix possible split brain in Consul.	2019-02-05 10:20:14 -05:00
Matt Keeler	a2fb5eafdd	Revendor serf to pull in keyring list truncation changes. (#5251 )	2019-01-22 16:07:04 -05:00
Pierre Souchay	8dd3476921	Allow `"disable_host_node_id": false` to work on Linux as non-root. (#4926 ) Bump `shirou/gopsutil` to include https://github.com/shirou/gopsutil/pull/603 This will allow to have consistent node-id even when machine is reinstalled when using `"disable_host_node_id": false` It will fix https://github.com/hashicorp/consul/issues/4914 and allow having the same node-id even when reinstalling a node from scratch. However, it is only compatible with a single OS (installing to Windows will change the node-id, but it seems acceptable).	2019-01-10 10:50:14 -05:00
R.B. Boyer	7f30950060	update github.com/hashicorp/{serf,memberlist,go-sockaddr} (#5189 ) This activates large-cluster improvements in the gossip layer from https://github.com/hashicorp/memberlist/pull/167	2019-01-07 15:00:47 -06:00
Jack Pearkes	5951f842d3	vendor: upgrade to latest version of gopsutil	2018-10-19 11:33:23 -07:00
Matt Keeler	99e0a124cb	New ACLs (#4791 ) This PR is almost a complete rewrite of the ACL system within Consul. It brings the features more in line with other HashiCorp products. Obviously there is quite a bit left to do here but most of it is related docs, testing and finishing the last few commands in the CLI. I will update the PR description and check off the todos as I finish them over the next few days/week. Description At a high level this PR is mainly to split ACL tokens from Policies and to split the concepts of Authorization from Identities. A lot of this PR is mostly just to support CRUD operations on ACLTokens and ACLPolicies. These in and of themselves are not particularly interesting. The bigger conceptual changes are in how tokens get resolved, how backwards compatibility is handled and the separation of policy from identity which could lead the way to allowing for alternative identity providers. On the surface and with a new cluster the ACL system will look very similar to that of Nomads. Both have tokens and policies. Both have local tokens. The ACL management APIs for both are very similar. I even ripped off Nomad's ACL bootstrap resetting procedure. There are a few key differences though. Nomad requires token and policy replication where Consul only requires policy replication with token replication being opt-in. In Consul local tokens only work with token replication being enabled though. All policies in Nomad are globally applicable. In Consul all policies are stored and replicated globally but can be scoped to a subset of the datacenters. This allows for more granular access management. Unlike Nomad, Consul has legacy baggage in the form of the original ACL system. The ramifications of this are: A server running the new system must still support other clients using the legacy system. A client running the new system must be able to use the legacy RPCs when the servers in its datacenter are running the legacy system. The primary ACL DC's servers running in legacy mode needs to be a gate that keeps everything else in the entire multi-DC cluster running in legacy mode. So not only does this PR implement the new ACL system but has a legacy mode built in for when the cluster isn't ready for new ACLs. Also detecting that new ACLs can be used is automatic and requires no configuration on the part of administrators. This process is detailed more in the "Transitioning from Legacy to New ACL Mode" section below.	2018-10-19 12:04:07 -04:00
Jack Pearkes	197d62c6ca	New command: consul debug (#4754 ) * agent/debug: add package for debugging, host info * api: add v1/agent/host endpoint * agent: add v1/agent/host endpoint * command/debug: implementation of static capture * command/debug: tests and only configured targets * agent/debug: add basic test for host metrics * command/debug: add methods for dynamic data capture * api: add debug/pprof endpoints * command/debug: add pprof * command/debug: timing, wg, logs to disk * vendor: add gopsutil/disk * command/debug: add a usage section * website: add docs for consul debug * agent/host: require operator:read * api/host: improve docs and no retry timing * command/debug: fail on extra arguments * command/debug: fixup file permissions to 0644 * command/debug: remove server flags * command/debug: improve clarity of usage section * api/debug: add Trace for profiling, fix profile * command/debug: capture profile and trace at the same time * command/debug: add index document * command/debug: use "clusters" in place of members * command/debug: remove address in output * command/debug: improve comment on metrics sleep * command/debug: clarify usage * agent: always register pprof handlers and protect This will allow us to avoid a restart of a target agent for profiling by always registering the pprof handlers. Given this is a potentially sensitive path, it is protected with an operator:read ACL and enable debug being set to true on the target agent. enable_debug still requires a restart. If ACLs are disabled, enable_debug is sufficient. * command/debug: use trace.out instead of .prof More in line with golang docs. * agent: fix comment wording * agent: wrap table driven tests in t.run()	2018-10-19 08:41:03 -07:00
Paul Banks	251da1077f	xDS Server Implementation (#4731 ) * Vendor updates for gRPC and xDS server * xDS server implementation for serving Envoy as a Connect proxy * Address initial review comments * consistent envoy package aliases; typos fixed; override TLS and authz for custom listeners * Moar Typos * Moar typos	2018-10-10 16:55:34 +01:00
Mitchell Hashimoto	9846999505	vendor: update mapstructure to v1.1.0 We require this change to support struct to struct decoding.	2018-09-30 19:15:40 -07:00
Matt Keeler	ba4f912b25	Update Raft Vendoring (#4539 ) Pulls in a fix for a potential memory leak regarding consistent reads that invoke VerifyLeader.	2018-09-06 15:07:42 -04:00
Mitchell Hashimoto	f7a95e1a28	vendor k8s client lib	2018-09-05 14:59:02 -07:00
Mitchell Hashimoto	144b7efa51	Update go-discover vendor	2018-09-05 13:31:10 -07:00
Shubheksha	1afcabb0a2	replace old fork of text package (#4501 )	2018-08-14 12:23:18 -07:00
Paul Banks	3adfe86f03	Update Serf and memberlist (#4511 ) This includes fixes that improve gossip scalability on very large (> 10k node) clusters. The Serf changes: - take snapshot disk IO out of the critical path for handling messages hashicorp/serf#524 - make snapshot compaction much less aggressive - the old fixed threshold caused snapshots to be constantly compacted (synchronously with request handling) on clusters larger than about 2000 nodes! hashicorp/serf#525 Memberlist changes: - prioritize handling alive messages over suspect/dead to improve stability, and handle queue in LIFO order to avoid acting on info that 's already stale in the queue by the time we handle it. hashicorp/memberlist#159 - limit the number of concurrent pushPull requests being handled at once to 128. In one test scenario with 10s of thousands of servers we saw channel and lock blocking cause over 3000 pushPulls at once which ballooned the memory of the server because each push pull contained a de-serialised list of all known 10k+ nodes and their tags for a total of about 60 million objects and 7GB of memory stuck. While the rest of the fixes here should prevent the same root cause from blocking in the same way, this prevents any other bug or source of contention from allowing pushPull messages to stack up and eat resources. hashicorp/memberlist#158	2018-08-09 13:16:13 -04:00
Siva Prasad	a5ebab63e7	Vendoring update for go-discover. (#4412 ) * New Providers added and updated vendoring for go-discover * Vendor.json formatted using make vendorfmt * Docs/Agent/auto-join: Added documentation for the new providers introduced in this PR * Updated the golang.org/x/sys/unix in the vendor directory * Agent: TestGoDiscoverRegistration updated to reflect the addition of new providers * Deleted terraform.tfstate from vendor. * Deleted terraform.tfstate.backup Deleted terraform state file artifacts from unknown runs. * Updated x/sys/windows vendor for Windows binary compilation	2018-07-25 16:21:04 -07:00
Matt Keeler	9757a6fb62	Vendor golang.org/x/sys/windows/svc	2018-07-12 11:29:57 -04:00
mkeeler	1da3c42867	Merge remote-tracking branch 'connect/f-connect'	2018-06-25 19:42:51 +00:00
Matt Keeler	bc7e9b6fd4	Remove build tags from vendored vault file to allow for this to merge properly into enterprise	2018-06-25 12:26:10 -07:00
Matt Keeler	2f90768662	Vendor the vault api	2018-06-25 12:26:10 -07:00
Paul Banks	86a55892fd	Remove go-diff vendor as assert.JSONEq output is way better for our case	2018-06-25 12:25:39 -07:00
Leo Zhang	b498816e80	Fix invalid vendor.json syntax for go-discover	2018-06-15 02:02:12 -07:00
Kyle Havlovitz	80b6d0a6cf	Add missing vendor dep github.com/stretchr/objx	2018-06-14 09:42:13 -07:00
Matt Keeler	33148f482d	Remove bogus second yamux vendoring	2018-06-04 16:28:33 -04:00
Matt Keeler	1e485ed727	Update yamux vendoring Pulls in logging fixes.	2018-06-04 16:02:50 -04:00
Jack Pearkes	c4112f2b9a	Merge pull request #4013 from sethvargo/sethvargo/user_agent Add a helper for generating Consul's user-agent string	2018-06-01 09:13:38 -07:00
Matt Keeler	1c577b2012	Merge pull request #4131 from pierresouchay/enable_full_dns_compression Enable full dns compression	2018-06-01 10:42:03 -04:00
Seth Vargo	5911fd5344	Update vendor for go-discover	2018-05-25 15:52:05 -04:00
Wim	e8d0474a8e	Add github.com/coredns/coredns/plugin/pkg/dnsutil files	2018-05-21 22:25:16 +02:00
Wim	9565b5415b	Add github.com/coredns/coredns/plugin/pkg/dnsutil to vendor.json	2018-05-21 22:18:19 +02:00
Pierre Souchay	61e7d06174	Bump DNS lib to 1.0.7 with 14bits Len() fix	2018-05-16 10:52:51 +02:00
Matt Keeler	4d2a0308e8	Fix vendoring of two missed libs	2018-05-11 11:31:42 -04:00
Matt Keeler	586c91e8ea	Update prometheus indirect deps	2018-05-11 11:18:15 -04:00
Matt Keeler	ba376bcd2b	Update the various deps of miekg/dns in our vendor.json	2018-05-11 10:52:05 -04:00
Matt Keeler	7928af61f2	Pull in miekg/dns deps on the golang crypto ed25519 packages	2018-05-11 10:31:27 -04:00
Kyle Havlovitz	7cd7f4acd7	vendor: pull in latest version of go-discover	2018-05-10 15:40:16 -07:00
Preetha Appan	98a04a0af9	Update serf to pick up clean leave fix	2018-05-04 15:51:55 -05:00
Paul Banks	06e1a62653	Merge pull request #4016 from pierresouchay/support_for_prometheus Support for prometheus for metrics endpoint	2018-04-24 16:14:43 +01:00

1 2 3 4 5 ...

283 Commits