open-consul

Commit Graph

Author	SHA1	Message	Date
Matt Keeler	42f02e80c3	Enable filtering language support for the v1/connect/intentions… (#7593 ) * Enable filtering language support for the v1/connect/intentions listing API * Update website for filtering of Intentions * Update website/source/api/connect/intentions.html.md	2020-04-07 11:48:44 -04:00
Daniel Nephin	72e2695986	Merge pull request #7598 from pierresouchay/preallocation_of_dns_meta Pre-allocations of DNS meta to avoid several allocations	2020-04-06 14:00:32 -04:00
Pierre Souchay	a7fbf003c1	[LINT] Close resp.Body to avoid linter complaining (#7600 )	2020-04-06 09:11:04 -04:00
Pierre Souchay	91b3510821	Pre-allocations of DNS meta to avoid several allocations	2020-04-05 11:12:41 +02:00
Pierre Souchay	5f9f86a327	Fixed unstable test TestForwardSignals() Sometimes, in the CI, it could receive a SIGURG, producing this line: FAIL: TestForwardSignals/signal-interrupt (0.06s) util_test.go:286: expected to read line "signal: interrupt" but got "signal: urgent I/O condition" Only forward the signals we test to avoid this kind of false positive Example of such unstable errors in CI: https://circleci.com/gh/hashicorp/consul/153571	2020-04-03 14:23:03 +02:00
Pierre Souchay	984583d980	tests: more tolerance to latency for unstable test `TestCacheNotifyPolling()`. (#7574 )	2020-04-03 10:29:38 +02:00
Matt Keeler	5d0e661203	Ensure that token clone copies the roles (#7577 )	2020-04-02 12:09:35 -04:00
Chris Piraino	d7a870fd32	Fix flapping of mesh gateway connect-service watches (#7575 )	2020-04-02 10:12:13 -05:00
Pierre Souchay	2b8da952a8	agent: show warning when enable_script_checks is enabled without safty net (#7437 ) In order to enforce a bit security on Consul agents, add a new method in agent to highlight possible security issues. This does not return an error for now, but might in the future. For now, it detects issues such as: https://www.hashicorp.com/blog/protecting-consul-from-rce-risk-in-specific-configurations/ This would display this kind of messages: ``` 2020-03-11T18:27:49.873+0100 [ERROR] agent: [SECURITY] issue: error="using enable-script-checks without ACLs and without allow_write_http_from is DANGEROUS, use enable-local-script-checks instead see https://www.hashicorp.com/blog/protecting-consul-from-rce-risk-in-specific-configurations/" ```	2020-04-02 09:59:23 +02:00
Andy Lindeman	0d1d5d0863	agent: rewrite checks with proxy address, not local service address (#7518 ) Exposing checks is supposed to allow a Consul agent bound to a different IP address (e.g., in a different Kubernetes pod) to access healthchecks through the proxy while the underlying service binds to localhost. This is an important security feature that makes sure no external traffic reaches the service except through the proxy. However, as far as I can tell, this is subtly broken in the case where the Consul agent cannot reach the proxy over localhost. If a proxy is configured with: `{ LocalServiceAddress: "127.0.0.1", Checks: true }`, as is typical with a sidecar proxy, the Consul checks are currently rewritten to `127.0.0.1:<random port>`. A Consul agent that does not share the loopback address cannot reach this address. Just to make sure I was not misunderstanding, I tried configuring the proxy with `{ LocalServiceAddress: "<pod ip>", Checks: true }`. In this case, while the checks are rewritten as expected and the agent can reach the dynamic port, the proxy can no longer reach its backend because the traffic is no longer on the loopback interface. I think rewriting the checks to use `proxy.Address`, the proxy's own address, is more correct in this case. That is the IP where the proxy can be reached, both by other proxies and by a Consul agent running on a different IP. The local service address should continue to use `127.0.0.1` in most cases.	2020-04-02 09:35:43 +02:00
Andy Lindeman	42224fe45c	proxycfg: support path exposed with non-HTTP2 protocol (#7510 ) If a proxied service is a gRPC or HTTP2 service, but a path is exposed using the HTTP1 or TCP protocol, Envoy should not be configured with `http2ProtocolOptions` for the cluster backing the path. A situation where this comes up is a gRPC service whose healthcheck or metrics route (e.g. for Prometheus) is an HTTP1 service running on a different port. Previously, if these were exposed either using `Expose: { Checks: true }` or `Expose: { Paths: ... }`, Envoy would still be configured to communicate with the path over HTTP2, which would not work properly.	2020-04-02 09:35:04 +02:00
Pierre Souchay	3b5e72913e	config: validate system limits against limits.http_max_conns_per_client (#7434 ) I spent some time today on my local Mac to figure out why Consul 1.6.3+ was not accepting limits.http_max_conns_per_client. This adds an explicit check on number of file descriptors to be sure it might work (this is no guarantee as if many clients are reaching the agent, it might consume even more file descriptors) Anyway, many users are fighting with RLIMIT_NOFILE, having a clear message would allow them to figure out what to fix. Example of message (reload or start): ``` 2020-03-11T16:38:37.062+0100 [ERROR] agent: Error starting agent: error="system allows a max of 512 file descriptors, but limits.http_max_conns_per_client: 8192 needs at least 8212" ```	2020-04-02 09:22:17 +02:00
Shaker Islam	d8ac493395	docs: document exported functions in agent.go (closes #7101 ) (#7366 ) and fix one linter error	2020-04-01 22:52:23 +02:00
Pierre Souchay	96d7229bd9	[FIX BUILD] fix build due to merge of #7562 Due to merge #7562, upstream does not compile anymore. Error is: ERRO Running error: gofmt: analysis skipped: errors in package: [/Users/p.souchay/go/src/github.com/hashicorp/consul/agent/config_endpoint_test.go:188:33: too many arguments]	2020-04-01 18:29:45 +02:00
Daniel Nephin	bc02b8fbe6	Merge pull request #7562 from hashicorp/dnephin/remove-tname-from-name testing: Remove old default value from NewTestAgent() calls	2020-04-01 11:48:45 -04:00
Daniel Nephin	8d7c21b255	Merge pull request #7533 from hashicorp/dnephin/xds-server-1 agent/xds: small cleanup	2020-04-01 11:24:50 -04:00
Emre Savcı	7a99f29adc	agent: add len, cap while initializing arrays	2020-04-01 10:54:51 +02:00
Daniel Nephin	09c6ac8b92	Rename NewTestAgentWithFields to StartTestAgent This function now only starts the agent. Using: git grep -l 'StartTestAgent(t, true,' \| \ xargs sed -i -e 's/StartTestAgent(t, true,/StartTestAgent(t,/g'	2020-03-31 17:14:55 -04:00
Daniel Nephin	d623dcbd01	Convert the remaining calls to NewTestAgentWithFields After removing the t.Name() parameter with sed, convert the last few tests which use a custom name to call NewTestAgentWithFields instead.	2020-03-31 17:14:55 -04:00
Daniel Nephin	428dd566b9	Merge pull request #7470 from hashicorp/dnephin/dns-unused-params dns: Remove a few unused function parameters	2020-03-31 16:56:19 -04:00
Pierre Souchay	5a6abf4d68	config: allow running `consul agent -dev -ui-dir=some_path` (#7525 ) When run in with `-dev` in DevMode, it is not possible to replace the embeded UI with another one because `-dev` implies `-ui`. This commit allows this an slightly change the error message about Consul 0.7.0 which is very old and does not apply to current version anyway.	2020-03-31 22:36:20 +02:00
Daniel Nephin	8b6877febd	Remove name from NewTestAgent Using: git grep -l 'NewTestAgent(t, t.Name(),' \| \ xargs sed -i -e 's/NewTestAgent(t, t.Name(),/NewTestAgent(t,/g'	2020-03-31 16:13:44 -04:00
Freddy	8a1e53754e	Add config entry for terminating gateways (#7545 ) This config entry will be used to configure terminating gateways. It accepts the name of the gateway and a list of services the gateway will represent. For each service users will be able to specify: its name, namespace, and additional options for TLS origination. Co-authored-by: Kyle Havlovitz <kylehav@gmail.com> Co-authored-by: Chris Piraino <cpiraino@hashicorp.com>	2020-03-31 13:27:32 -06:00
Kyle Havlovitz	01a23b8eb4	Add config entry/state for Ingress Gateways (#7483 ) * Add Ingress gateway config entry and other relevant structs * Add api package tests for ingress gateways * Embed EnterpriseMeta into ingress service struct * Add namespace fields to api module and test consul config write decoding * Don't require a port for ingress gateways * Add snakeJSON and camelJSON cases in command test * Run Normalize on service's ent metadata Sadly cannot think of a way to test this in OSS. * Every protocol requires at least 1 service * Validate ingress protocols * Update agent/structs/config_entry_gateways.go Co-authored-by: Chris Piraino <cpiraino@hashicorp.com> Co-authored-by: Freddy <freddygv@users.noreply.github.com>	2020-03-31 11:59:10 -05:00
Daniel Nephin	db1fb95f21	Merge pull request #7485 from hashicorp/dnephin/do-not-skip-tests-on-ci ci: Make it harder to accidentally skip tests on CI, and doc why some are skipped	2020-03-31 11:15:44 -04:00
Daniel Nephin	96c4a35de7	Remove t.Name() from TestAgent.Name And re-add the name to the logger so that log messages from different agents in a single can be identified.	2020-03-30 16:47:24 -04:00
Daniel Nephin	fe027ac766	Document Agent.LogOutput	2020-03-30 14:32:13 -04:00
Daniel Nephin	823295fe2a	testing: reduce verbosity of output log Previously the log output included the test name twice and a long date format. The test output is already grouped by test, so adding the test name did not add any new information. The date and time are only useful to understand elapsed time, so using a short format should provide succident detail. Also fixed a bug in NewTestAgentWithFields where nil was returned instead of the test agent.	2020-03-30 13:23:13 -04:00
Daniel Nephin	6d612abbde	Remove unused token parameter	2020-03-27 17:57:16 -04:00
Daniel Nephin	d29c47c420	A little less 'just'	2020-03-27 16:08:25 -04:00
Daniel Nephin	1e59c6b03e	Remove unused customEDSClusterJSON	2020-03-27 15:38:16 -04:00
Matt Keeler	35c8e996c3	Ensure server requirements checks are done against ALL known se… (#7491 ) Co-authored-by: Paul Banks <banks@banksco.de>	2020-03-27 12:31:43 -04:00
Matt Keeler	ac78be97f4	Add information about which services are proxied to ui services… (#7417 )	2020-03-27 10:57:46 -04:00
Daniel Nephin	a2eb66963c	Merge pull request #7516 from hashicorp/dnephin/remove-unused-method agent: Remove unused method Encrypted from delegate interface	2020-03-26 14:17:58 -04:00
Daniel Nephin	ebb851f32d	agent: Remove unused Encrypted from interface It appears to be unused. It looks like it has been around a while, I geuss at some point we stopped using this method.	2020-03-26 12:34:31 -04:00
Freddy	cb55fa3742	Enable CLI to register terminating gateways (#7500 ) * Enable CLI to register terminating gateways * Centralize gateway proxy configuration	2020-03-26 10:20:56 -06:00
Daniel Nephin	02cacf8128	Merge pull request #7498 from hashicorp/dnephin/small-cleanup envoy: small cleanup in cmd and server	2020-03-25 13:24:44 -04:00
Alejandro Baez	7d68d7eaa6	Add PolicyReadByName for API (#6615 )	2020-03-25 10:34:24 -04:00
Chris Piraino	0c5c97205f	Fix flakey health check reload test (#7490 ) This test would occasionally fail because we checked for a status of "critical" initially. This races with the actual healthcheck being run and declared passing. We instead use a ttl health check so that we don't rely on timing at all.	2020-03-25 09:09:13 -05:00
Daniel Nephin	f994bc9157	agent: Remove xdsServer field The field is only referenced from a single method, it can be a local var	2020-03-24 18:05:14 -04:00
Daniel Nephin	38ec02e022	dns: Remove a few unused params	2020-03-24 15:56:41 -04:00
Daniel Nephin	8b6e07d960	ci: Run all connect/ca tests from the integration suite To reduce the chance of some tests not being run because it does not match the regex passed to '-run'. Also document why some tests are allowed to be skipped on CI.	2020-03-24 15:22:01 -04:00
Daniel Nephin	dc983db333	ci: Do not skip tests because of missing binaries on CI If the CI environment is not correct for running tests the tests should fail, so that we don't accidentally stop running some tests because of a change to our CI environment. Also removed a duplicate delcaration from init. I believe one was overriding the other as they are both in the same package.	2020-03-24 14:34:13 -04:00
Kim Ngo	9e8eb7896f	agent/xds: Update mesh gateway to use service router timeout (#7444 ) * website/connect/proxy/envoy: specify timeout precedence for services behind mesh gateway	2020-03-17 14:50:14 -05:00
Matt Keeler	58e2969fc1	Fix ACL mode advertisement and detection (#7451 ) These changes are necessary to ensure advertisement happens correctly even when datacenters are connected via network areas in Consul enterprise. This also changes how we check if ACLs can be upgraded within the local datacenter. Previously we would iterate through all LAN members. Now we just use the ServerLookup type to iterate through all known servers in the DC.	2020-03-16 12:54:45 -04:00
Freddy	8a7ff69b19	Update MSP token and filtering (#7431 )	2020-03-11 12:08:49 -06:00
Hans Hasselberg	6a55f70fa6	tls: remove old ciphers (#7282 ) Following advice from: https://github.com/ssllabs/research/wiki/SSL-and-TLS-Deployment-Best-Practices, this PR removes old ciphers.	2020-03-10 21:44:26 +01:00
R.B. Boyer	10d3ff9a4f	server: strip local ACL tokens from RPCs during forwarding if crossing datacenters (#7419 ) Fixes #7414	2020-03-10 11:15:22 -05:00
Kyle Havlovitz	520d464c85	Merge pull request #7373 from hashicorp/acl-segments-fix Add stub methods for ACL/segment bug fix from enterprise	2020-03-09 14:25:49 -07:00
R.B. Boyer	a7fb26f50f	wan federation via mesh gateways (#6884 ) This is like a Möbius strip of code due to the fact that low-level components (serf/memberlist) are connected to high-level components (the catalog and mesh-gateways) in a twisty maze of references which make it hard to dive into. With that in mind here's a high level summary of what you'll find in the patch: There are several distinct chunks of code that are affected: * new flags and config options for the server * retry join WAN is slightly different * retry join code is shared to discover primary mesh gateways from secondary datacenters * because retry join logic runs in the agent and the results of that operation for primary mesh gateways are needed in the server there are some methods like `RefreshPrimaryGatewayFallbackAddresses` that must occur at multiple layers of abstraction just to pass the data down to the right layer. * new cache type `FederationStateListMeshGatewaysName` for use in `proxycfg/xds` layers * the function signature for RPC dialing picked up a new required field (the node name of the destination) * several new RPCs for manipulating a FederationState object: `FederationState:{Apply,Get,List,ListMeshGateways}` * 3 read-only internal APIs for debugging use to invoke those RPCs from curl * raft and fsm changes to persist these FederationStates * replication for FederationStates as they are canonically stored in the Primary and replicated to the Secondaries. * a special derivative of anti-entropy that runs in secondaries to snapshot their local mesh gateway `CheckServiceNodes` and sync them into their upstream FederationState in the primary (this works in conjunction with the replication to distribute addresses for all mesh gateways in all DCs to all other DCs) * a "gateway locator" convenience object to make use of this data to choose the addresses of gateways to use for any given RPC or gossip operation to a remote DC. This gets data from the "retry join" logic in the agent and also directly calls into the FSM. * RPC (`:8300`) on the server sniffs the first byte of a new connection to determine if it's actually doing native TLS. If so it checks the ALPN header for protocol determination (just like how the existing system uses the type-byte marker). * 2 new kinds of protocols are exclusively decoded via this native TLS mechanism: one for ferrying "packet" operations (udp-like) from the gossip layer and one for "stream" operations (tcp-like). The packet operations re-use sockets (using length-prefixing) to cut down on TLS re-negotiation overhead. * the server instances specially wrap the `memberlist.NetTransport` when running with gateway federation enabled (in a `wanfed.Transport`). The general gist is that if it tries to dial a node in the SAME datacenter (deduced by looking at the suffix of the node name) there is no change. If dialing a DIFFERENT datacenter it is wrapped up in a TLS+ALPN blob and sent through some mesh gateways to eventually end up in a server's :8300 port. * a new flag when launching a mesh gateway via `consul connect envoy` to indicate that the servers are to be exposed. This sets a special service meta when registering the gateway into the catalog. * `proxycfg/xds` notice this metadata blob to activate additional watches for the FederationState objects as well as the location of all of the consul servers in that datacenter. * `xds:` if the extra metadata is in place additional clusters are defined in a DC to bulk sink all traffic to another DC's gateways. For the current datacenter we listen on a wildcard name (`server.<dc>.consul`) that load balances all servers as well as one mini-cluster per node (`<node>.server.<dc>.consul`) * the `consul tls cert create` command got a new flag (`-node`) to help create an additional SAN in certs that can be used with this flavor of federation.	2020-03-09 15:59:02 -05:00

1 2 3 4 5 ...

1878 Commits