open-nomad

Commit Graph

Author	SHA1	Message	Date
Michael Schurter	8c332a3757	Merge pull request #7102 from hashicorp/test-limits Fix some race conditions and flaky tests	2020-02-13 10:19:11 -08:00
Seth Hoenig	7f33b92e0b	command: use consistent CONSUL_HTTP_TOKEN name Consul CLI uses CONSUL_HTTP_TOKEN, so Nomad should use the same. Note that consul-template uses CONSUL_TOKEN, which Nomad also uses, so be careful to preserve any reference to that in the consul-template context.	2020-02-12 10:42:33 -06:00
Seth Hoenig	0e44094d1a	client: enable configuring enable_tag_override for services Consul provides a feature of Service Definitions where the tags associated with a service can be modified through the Catalog API, overriding the value(s) configured in the agent's service configuration. To enable this feature, the flag enable_tag_override must be configured in the service definition. Previously, Nomad did not allow configuring this flag, and thus the default value of false was used. Now, it is configurable. Because Nomad itself acts as a state machine around the the service definitions of the tasks it manages, it's worth describing what happens when this feature is enabled and why. Consider the basic case where there is no Nomad, and your service is provided to consul as a boring JSON file. The ultimate source of truth for the definition of that service is the file, and is stored in the agent. Later, Consul performs "anti-entropy" which synchronizes the Catalog (stored only the leaders). Then with enable_tag_override=true, the tags field is available for "external" modification through the Catalog API (rather than directly configuring the service definition file, or using the Agent API). The important observation is that if the service definition ever changes (i.e. the file is changed & config reloaded OR the Agent API is used to modify the service), those "external" tag values are thrown away, and the new service definition is once again the source of truth. In the Nomad case, Nomad itself is the source of truth over the Agent in the same way the JSON file was the source of truth in the example above. That means any time Nomad sets a new service definition, any externally configured tags are going to be replaced. When does this happen? Only on major lifecycle events, for example when a task is modified because of an updated job spec from the 'nomad job run <existing>' command. Otherwise, Nomad's periodic re-sync's with Consul will now no longer try to restore the externally modified tag values (as long as enable_tag_override=true). Fixes #2057	2020-02-10 08:00:55 -06:00
Michael Schurter	65d38d9255	test: fix flaky TestHTTP_FreshClientAllocMetrics	2020-02-07 15:50:53 -08:00
Michael Schurter	9d3093fa31	test: fix missing agent shutdowns	2020-02-07 15:50:53 -08:00
Michael Schurter	d96ceee8c5	testagent: fix case where agent would retry forever	2020-02-07 15:50:53 -08:00
Michael Schurter	e903501e65	test: improve error messages when failing	2020-02-07 15:50:53 -08:00
Michael Schurter	63032917fc	test: allow goroutine to exit even if test blocks	2020-02-07 15:50:53 -08:00
Michael Schurter	9905dec6a3	test: workaround limits race	2020-02-07 15:50:53 -08:00
Michael Schurter	19a1932bbb	test: wait longer than timeout The 1s timeout raced with the 1s deadline it was trying to detect.	2020-02-07 15:50:53 -08:00
Michael Schurter	fd81208db7	test: fix flaky health test Test set Agent.client=nil which prevented the client from being shutdown. This leaked goroutines and could cause panics due to the leaked client goroutines logging after their parent test had finished. Removed ACLs from the server test because I couldn't get it to work with the test agent, and it tested very little.	2020-02-07 15:50:53 -08:00
Michael Schurter	2896f78f77	client: fix race accessing Node.status * Call Node.Canonicalize once when Node is created. * Lock when accessing fields mutated by node update goroutine	2020-02-07 15:50:47 -08:00
Drew Bailey	d830998572	agent Profile req nil check s.agent.Server() clean up logic and tests	2020-02-03 13:20:05 -05:00
Drew Bailey	c4f45f9bde	Fix panic when monitoring a local client node Fixes a panic when accessing a.agent.Server() when agent is a client instead. This pr removes a redundant ACL check since ACLs are validated at the RPC layer. It also nil checks the agent server and uses Client() when appropriate.	2020-02-03 13:20:04 -05:00
Seth Hoenig	78a7d1e426	comments: cleanup some leftover debug comments and such	2020-01-31 19:04:35 -06:00
Seth Hoenig	076cb4754e	agent: re-enable the server in dev mode	2020-01-31 19:04:19 -06:00
Seth Hoenig	8219c78667	nomad: handle SI token revocations concurrently Be able to revoke SI token accessors concurrently, and also ratelimit the requests being made to Consul for the various ACL API uses.	2020-01-31 19:04:14 -06:00
Seth Hoenig	2c7ac9a80d	nomad: fixup token policy validation	2020-01-31 19:04:08 -06:00
Seth Hoenig	9df33f622f	nomad: proxy requests for Service Identity tokens between Clients and Consul Nomad jobs may be configured with a TaskGroup which contains a Service definition that is Consul Connect enabled. These service definitions end up establishing a Consul Connect Proxy Task (e.g. envoy, by default). In the case where Consul ACLs are enabled, a Service Identity token is required for these tasks to run & connect, etc. This changeset enables the Nomad Server to recieve RPC requests for the derivation of SI tokens on behalf of instances of Consul Connect using Tasks. Those tokens are then relayed back to the requesting Client, which then injects the tokens in the secrets directory of the Task.	2020-01-31 19:03:53 -06:00
Seth Hoenig	f030a22c7c	command, docs: create and document consul token configuration for connect acls (gh-6716) This change provides an initial pass at setting up the configuration necessary to enable use of Connect with Consul ACLs. Operators will be able to pass in a Consul Token through `-consul-token` or `$CONSUL_TOKEN` in the `job run` and `job revert` commands (similar to Vault tokens). These values are not actually used yet in this changeset.	2020-01-31 19:02:53 -06:00
Michael Schurter	c82b14b0c4	core: add limits to unauthorized connections Introduce limits to prevent unauthorized users from exhausting all ephemeral ports on agents: * `{https,rpc}_handshake_timeout` * `{http,rpc}_max_conns_per_client` The handshake timeout closes connections that have not completed the TLS handshake by the deadline (5s by default). For RPC connections this timeout also separately applies to first byte being read so RPC connections with TLS enabled have `rpc_handshake_time * 2` as their deadline. The connection limit per client prevents a single remote TCP peer from exhausting all ephemeral ports. The default is 100, but can be lowered to a minimum of 26. Since streaming RPC connections create a new TCP connection (until MultiplexV2 is used), 20 connections are reserved for Raft and non-streaming RPCs to prevent connection exhaustion due to streaming RPCs. All limits are configurable and may be disabled by setting them to `0`. This also includes a fix that closes connections that attempt to create TLS RPC connections recursively. While only users with valid mTLS certificates could perform such an operation, it was added as a safeguard to prevent programming errors before they could cause resource exhaustion.	2020-01-30 10:38:25 -08:00
Mahmood Ali	9611324654	Merge pull request #6922 from hashicorp/b-alloc-canoncalize Handle Upgrades and Alloc.TaskResources modification	2020-01-28 15:12:41 -05:00
Mahmood Ali	90cae566e5	Merge pull request #6935 from hashicorp/b-default-preemption-flag scheduler: allow configuring default preemption for system scheduler	2020-01-28 15:11:06 -05:00
Mahmood Ali	af17b4afc7	Support customizing full scheduler config	2020-01-28 14:51:42 -05:00
Nick Ethier	5636203d4e	consul: fix var name from rebase	2020-01-27 14:00:19 -05:00
Nick Ethier	0ae99b3c9c	consul: fix var name from rebase	2020-01-27 12:55:52 -05:00
Nick Ethier	5cbb94e16e	consul: add support for canary meta	2020-01-27 09:53:30 -05:00
Danielle	5fd52171aa	cli: add system command and subcmds to interact with system API. (#6924 ) cli: add system command and subcmds to interact with system API.	2020-01-13 16:16:08 +01:00
Mahmood Ali	1ab682f622	scheduler: allow configuring default preemption for system scheduler Some operators want a greater control over when preemption is enabled, especially during an upgrade to limit potential side-effects.	2020-01-13 08:30:49 -05:00
James Rasell	4e48217a4e	cli: add system command and subcmds to interact with system API. The system command includes gc and reconcile-summaries subcommands which covers all currently available system API calls. The help information is largely pulled from the current Nomad website API documentation.	2020-01-13 11:34:46 +01:00
Drew Bailey	f97d2e96c1	refactor api profile methods comment why we ignore errors parsing params	2020-01-09 15:15:12 -05:00
Drew Bailey	b702dede49	adds qc param, address pr feedback	2020-01-09 15:15:11 -05:00
Drew Bailey	085659f6ff	condense table test	2020-01-09 15:15:10 -05:00
Drew Bailey	45210ed901	Rename profile package to pprof Address pr feedback, rename profile package to pprof to more accurately describe its purpose. Adds gc param for heap lookup profiles.	2020-01-09 15:15:10 -05:00
Drew Bailey	1b8af920f3	address pr feedback	2020-01-09 15:15:09 -05:00
Drew Bailey	4ced73875b	leave acl checking to rpc endpoints fix test expectation test wrapNonJSON	2020-01-09 15:15:08 -05:00
Drew Bailey	279512c7f8	provide helpful error, cleanup logic	2020-01-09 15:15:08 -05:00
Drew Bailey	7bbba613a5	prevent doubly wrapping with rpc error	2020-01-09 15:15:07 -05:00
Drew Bailey	fd42020ad6	RPC server EnableDebug option Passes in agent enable_debug config to nomad server and client configs. This allows for rpc endpoints to have more granular control if they should be enabled or not in combination with ACLs. enable debug on client test	2020-01-09 15:15:07 -05:00
Drew Bailey	9a80938fb1	region forwarding; prevent recursive forwards for impossible requests prevent region forwarding loop, backfill tests fix failing test	2020-01-09 15:15:06 -05:00
Drew Bailey	46121fe3fd	move shared structs out of client and into nomad	2020-01-09 15:15:05 -05:00
Drew Bailey	3672414888	test pprof headers and profile methods tidy up, add comments clean up seconds param assignment	2020-01-09 15:15:04 -05:00
Drew Bailey	fc37448683	warn when enabled debug is on when registering m -> a receiver name return codederrors, fix query	2020-01-09 15:15:04 -05:00
Drew Bailey	62eb2d76a6	acl and debug test table rename implementation method	2020-01-09 15:15:03 -05:00
Drew Bailey	50288461c9	Server request forwarding for Agent.Profile Return rpc errors for profile requests, set up remote forwarding to target leader or server id for profile requests. server forwarding, endpoint tests	2020-01-09 15:15:03 -05:00
Drew Bailey	901f362858	test for known pprof endpoints	2020-01-09 15:15:02 -05:00
Drew Bailey	49ad5fbc85	agent pprof endpoints wip, agent endpoint and client endpoint for pprof profiles agent endpoint test	2020-01-09 15:15:02 -05:00
Mahmood Ali	a2e181dd45	CLI: protect against AllocatedResources being nil	2020-01-08 17:22:05 -05:00
Charlie Voiselle	5298fee5d6	Typo fix Synopsis needs to start with uppercase to match other commands	2020-01-08 10:44:00 -05:00
James Rasell	f2d1e45135	cli: include namespace in output when querying job stauts. (#6912 )	2020-01-08 08:24:03 -05:00
Michael Schurter	571ed261c8	Merge pull request #6898 from hashicorp/hicks/fix-typo Fix typo, Ethier -> Either	2020-01-02 14:52:18 -08:00
Kris Hicks	7fef7508cb	Fix typo, Ethier -> Either	2020-01-02 14:42:27 -08:00
Charlie Voiselle	fd3bf5f971	cli: Allow user to specify dest filename for nomad init (#6520 ) * Allow user to specify dest filename for nomad init * Create changelog entry for GH-6520	2019-12-19 14:59:12 -05:00
Drew Bailey	8e59e91991	Merge pull request #6746 from hashicorp/f-shutdown-delay-tg Group shutdown_delay	2019-12-18 16:01:30 -05:00
Lang Martin	06f441f562	test: quota: relax multierror message matching to Contains	2019-12-17 13:20:14 -05:00
Lang Martin	fb6c27b828	test: build quota_apply_test, remove the tests that require ent	2019-12-17 13:20:14 -05:00
Drew Bailey	d9e41d2880	docs for shutdown delay update docs, address pr comments ensure pointer is not nil use pointer for diff tests, set vs unset	2019-12-16 11:38:35 -05:00
Drew Bailey	24929776a2	shutdown delay for task groups copy struct values ensure groupserviceHook implements RunnerPreKillhook run deregister first test that shutdown times are delayed move magic number into variable	2019-12-16 11:38:16 -05:00
Mahmood Ali	76be9b4afb	cli: sequence cli.Ui operations Fixes a bug where if a command flag parsing errors, the resulting error and help usage messages get interleaved in unexpected and non-user friendly way. The reason is that we have flag parsing library effectively writes to ui.Error in a goroutine. This is problematic: first, we lose the sequencing between help usage and error message; second, cli.Ui methods are not concurrent safe. Here, we introduce a custom error writer that buffers result and calls ui.Error() in the write method and in the same goroutine. For context, we need to wrap ui.Error because it's line-oriented, while flags library expects a io.Writer which is bytes oriented.	2019-12-16 10:08:17 -05:00
Danielle	246a4e898b	Merge pull request #6828 from hashicorp/b/nomad-monitor-panic command: error when no node is found for `monitor`	2019-12-10 14:29:32 +01:00
Danielle Lancashire	cd764ab0e9	command: error when no node is found for `monitor` Currently `nomad monitor -node-id` will panic when a node-id does not match any nodes, as there is no empty result bounds checking. Here we return an error to the user when no nodes are found.	2019-12-10 13:10:47 +01:00
Seth Hoenig	f0c3dca49c	tests: swap lib/freeport for tweaked helper/freeport Copy the updated version of freeport (sdk/freeport), and tweak it for use in Nomad tests. This means staying below port 10000 to avoid conflicts with the lib/freeport that is still transitively used by the old version of consul that we vendor. Also provide implementations to find ephemeral ports of macOS and Windows environments. Ports acquired through freeport are supposed to be returned to freeport, which this change now also introduces. Many tests are modified to include calls to a cleanup function for Server objects. This should help quite a bit with some flakey tests, but not all of them. Our port problems will not go away completely until we upgrade our vendor version of consul. With Go modules, we'll probably do a 'replace' to swap out other copies of freeport with the one now in 'nomad/helper/freeport'.	2019-12-09 08:37:32 -06:00
Michael Schurter	3008473f9b	Merge branch 'master' into release-0102	2019-12-04 14:13:34 -08:00
Mahmood Ali	7b8cfee162	tests: deflake TestHTTP_FreshClientAllocMetrics The test asserts that alloc counts get reported accurately in metrics by inspecting the metrics endpoint directly. Sadly, the metrics as collected by `armon/go-metrics` seem to be stateful and may contain info from other tests. This means that the test can fail depending on the order of returned metrics. Inspecting the metrics output of one failing run, you can see the duplicate guage entries but for different node_ids: ``` { "Name": "service-name.default-0a3ba4b6-2109-485e-be74-6864228aed3d.client.allocations.terminal", "Value": 10, "Labels": { "datacenter": "dc1", "node_class": "none", "node_id": "67402bf4-00f3-bd8d-9fa8-f4d1924a892a" } }, { "Name": "service-name.default-0a3ba4b6-2109-485e-be74-6864228aed3d.client.allocations.terminal", "Value": 0, "Labels": { "datacenter": "dc1", "node_class": "none", "node_id": "a2945b48-7e66-68e2-c922-49b20dd4e20c" } }, ```	2019-11-22 18:41:21 -05:00
Nomad Release bot	db6420367d	Generate files for 0.10.2-rc1 release	2019-11-22 18:42:49 +00:00
Drew Bailey	b45ce9e997	add server-id to -h output	2019-11-21 16:04:28 -05:00
Drew Bailey	b3765b06ea	add server-id to -h output	2019-11-21 16:01:09 -05:00
Drew Bailey	6d5156bbba	Allows a node uuid prefix to be passed in	2019-11-21 15:15:41 -05:00
Drew Bailey	7ca6dbe61e	Allows a node uuid prefix to be passed in	2019-11-21 14:51:48 -05:00
Lang Martin	069e9a624b	command: quota init writes files with a network limit	2019-11-20 17:59:55 -06:00
Lang Martin	d2fc279af4	command: quota status reports network usage	2019-11-20 17:59:34 -06:00
Lang Martin	f45bebdb66	command: quota init writes files with a network limit	2019-11-20 18:44:06 -05:00
Lang Martin	2e2c662977	command: quota status reports network usage	2019-11-20 18:44:06 -05:00
Michael Schurter	48239d7f2e	Merge pull request #6017 from hashicorp/f-policy-json api: Add parsed rules to policy response	2019-11-20 15:31:03 -08:00
Mahmood Ali	be6c60455e	Merge pull request #6669 from hashicorp/b-cors-allow-credentials Allow UI to query client directly for task logs/state	2019-11-20 15:14:01 -05:00
Buck Doyle	db77a24ed3	Merge branch 'master' into f-policy-json	2019-11-20 11:20:07 -06:00
Michael Schurter	ecf970b5a5	Merge pull request #6370 from pmcatominey/tls-server-name command: add -tls-server-name flag	2019-11-20 08:44:54 -08:00
Preetha	42c1c85285	Merge pull request #6421 from hashicorp/b-acl-bootstrap-codes api: acl bootstrap errors aren't 500	2019-11-20 10:36:08 -06:00
Preetha	be4a51d5b8	Merge pull request #6349 from hashicorp/b-host-stats client: Return empty values when host stats fail	2019-11-20 10:13:02 -06:00
Buck Doyle	7e3188a4ea	CLI: Remove duplicated error output (#6738 )	2019-11-19 16:05:53 -06:00
Mahmood Ali	97974c4b13	Merge pull request #6684 from hashicorp/b-nomad-exec-stdout-tty nomad exec: check stdout for tty as well	2019-11-19 15:55:21 -05:00
Mahmood Ali	6f8bb5e90b	api: acl bootstrap errors aren't 500 Noticed that ACL endpoints return 500 status code for user errors. This is confusing and can lead to false monitoring alerts. Here, I introduce a concept of RPCCoded errors to be returned by RPC that signal a code in addition to error message. Codes for now match HTTP codes to ease reasoning. ``` $ nomad acl bootstrap Error bootstrapping: Unexpected response code: 500 (ACL bootstrap already done (reset index: 9)) $ nomad acl bootstrap Error bootstrapping: Unexpected response code: 400 (ACL bootstrap already done (reset index: 9)) ```	2019-11-19 15:51:57 -05:00
Tim Gross	1210261fe2	hclfmt nomad jobspecs (#6724 )	2019-11-19 10:36:41 -05:00
Nick Ethier	bd454a4c6f	client: improve group service stanza interpolation and check_re… (#6586 ) * client: improve group service stanza interpolation and check_restart support Interpolation can now be done on group service stanzas. Note that some task runtime specific information that was previously available when the service was registered poststart of a task is no longer available. The check_restart stanza for checks defined on group services will now properly restart the allocation upon check failures if configured.	2019-11-18 13:04:01 -05:00
Drew Bailey	9b63828658	serverID to target remote leader or server handle the case where we request a server-id which is this current server update docs, error on node and server id params more accurate names for tests use shared no leader err, formatting rm bad comment remove redundant variable	2019-11-14 10:07:35 -05:00
Drew Bailey	b644e1f47d	add server-id to monitor specific server	2019-11-14 09:53:41 -05:00
Drew Bailey	acd97d0731	Merge pull request #6670 from hashicorp/api/fallthrough-test test rootfallthrough handler	2019-11-13 10:51:31 -05:00
Lars Lehtonen	1dbf44bc40	command/agent: Prune Dead Code (#6682 ) * remove unused MockPeriodicJob() from tests * remove unused getIndex() from tests * remove unused checkIndex() from tests * remove unused assertIndex() from tests * remove unused Agent.findLoopbackDevice()	2019-11-13 08:20:01 -05:00
Lars Lehtonen	e85509c466	command: error handling before file close (#6681 )	2019-11-13 08:18:20 -05:00
Drew Bailey	f5310ff63f	fix so assertions are test case driven	2019-11-12 14:28:21 -05:00
Mahmood Ali	591cb75ee4	nomad exec: check stdout for tty as well When inferring whether to use TTY, check both stdin and stdout are terminals. Otherwise, we get failures like the following: ``` $ nomad alloc exec --job example echo hi hi $ echo \| nomad alloc exec --job example echo hi hi $ nomad alloc exec --job example echo hi \| head -n1 failed to exec into task: not a terminal ```	2019-11-12 11:39:06 -05:00
Lars Lehtonen	98d3e47b32	command: fix TestHelpers_LineLimitReader_TimeLimit() goroutine (#6678 )	2019-11-12 08:35:11 -05:00
Charlie Voiselle	835831a3d8	Added service wrapper code (#6220 ) This is the basic code to add the Windows Service Manager hooks to Nomad. Includes vendoring golang.org/x/sys/windows/svc and added Docs: * guide for installing as a windows service. * configuration for logging to file from PR #6429	2019-11-11 15:16:07 -05:00
Drew Bailey	f989f38594	test /ui/ path	2019-11-11 12:12:42 -05:00
Drew Bailey	a0548824f3	test rootfallthrough handler	2019-11-11 12:08:44 -05:00
Mahmood Ali	b2145f2d02	Allow UI to query client directly Nomad web UI currently fails when querying client nodes for allocation state end endpoints, due to CORS policy. The issue is that CORS requests that are marked `withCredentials` need the http server to include a `Access-Control-Allow-Credentials` [1]. But Nomad Task Logs and filesystem requests include authenticating information and thus marked with `credentials=true`[2][3]. It's worth noting that the browser currently sends credentials and authentication token to servers anyway; it's just that the response is not made available to caller nomad ui javascript. For task logs specifically, nomad ui retries again by querying the web ui address (typically pointing to a nomad server) which will forward the request to the nomad client agent appropriately. [1] https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Access-Control-Allow-Credentials [2] `101d0373ee/ui/app/components/task-log.js (L50)` [3] `101d0373ee/ui/app/services/token.js (L25-L39)`	2019-11-11 15:13:30 +00:00
Lars Lehtonen	08d5342812	command/agent: TestAgent_ServerConfig() fix dropped errors (#6659 )	2019-11-11 09:46:46 -05:00
Drew Bailey	04439a5a78	better func name, swap conditional	2019-11-11 08:35:56 -05:00
Drew Bailey	c85df2dac7	returns a 404 if not found instead of redirect to ui	2019-11-08 15:34:35 -05:00
Tim Gross	4909adb32c	fix broken test expectation from message change (#6635 )	2019-11-06 16:33:13 -05:00
Drew Bailey	7b2ad28ef6	unlock before returning, no need for label comment, trigger build return length written	2019-11-05 11:44:29 -05:00
Drew Bailey	d3b48a3e45	simplify logch goroutine	2019-11-05 11:44:28 -05:00
Drew Bailey	df57f70a68	wireup plain=true\|false query param	2019-11-05 11:44:28 -05:00
Drew Bailey	f4a7e3dc75	coordinate closing of doneCh, use interface to simplify callers comments	2019-11-05 11:44:26 -05:00
Drew Bailey	fe542680dc	log-json -> json fix typo command/agent/monitor/monitor.go Co-Authored-By: Chris Baker <1675087+cgbaker@users.noreply.github.com> Update command/agent/monitor/monitor.go Co-Authored-By: Chris Baker <1675087+cgbaker@users.noreply.github.com> address feedback, lock to prevent send on closed channel fix lock/unlock for dropped messages	2019-11-05 09:51:59 -05:00
Drew Bailey	84c8e79f90	simplify assert message	2019-11-05 09:51:56 -05:00
Drew Bailey	8726b685de	address feedback	2019-11-05 09:51:56 -05:00
Drew Bailey	e4b3e1d7d4	allow more time for streaming message remove unused struct	2019-11-05 09:51:55 -05:00
Drew Bailey	318b6c91bf	monitor command takes no args rm extra new line fix lint errors return after close fix, simplify test	2019-11-05 09:51:55 -05:00
Drew Bailey	0e759c401c	moving endpoints over to frames	2019-11-05 09:51:54 -05:00
Drew Bailey	c7b633b6c1	lock in sub select rm redundant lock wip to use framing wip switch to stream frames	2019-11-05 09:51:54 -05:00
Drew Bailey	fb23c1325d	fix deadlock issue, switch to frames envelope	2019-11-05 09:51:54 -05:00
Drew Bailey	32f62edbb0	return 400 if invalid log_json param is given Addresses feedback around monitor implementation subselect on stopCh to prevent blocking forever. Set up a separate goroutine to check every 3 seconds for dropped messages. rename returned ch to avoid confusion	2019-11-05 09:51:53 -05:00
Drew Bailey	17d876d5ef	rename function, initialize log level better underscores instead of dashes for query params	2019-11-05 09:51:53 -05:00
Drew Bailey	8e3915c7fc	use channel instead of empty string to determine close	2019-11-05 09:51:52 -05:00
Drew Bailey	da6229d704	update go-hclog dep remove duplicate lock	2019-11-05 09:51:52 -05:00
Drew Bailey	db65b1f4a5	agent:read acl policy for monitor	2019-11-05 09:51:52 -05:00
Drew Bailey	f46fd5b3e1	only look up rpchandler for node if we have nodeid fix some comments and nomad monitor -h output	2019-11-05 09:51:51 -05:00
Drew Bailey	3b9c33a5f0	new hclog with standardlogger intercept	2019-11-05 09:51:49 -05:00
Drew Bailey	a45ae1cd58	enable json formatting, use queryoptions	2019-11-05 09:51:49 -05:00
Drew Bailey	786989dbe3	New monitor pkg for shared monitor functionality Adds new package that can be used by client and server RPC endpoints to facilitate monitoring based off of a logger clean up old code small comment about write rm old comment about minsize rename to Monitor Removes connection logic from monitor command Keep connection logic in endpoints, use a channel to send results from monitoring use new multisink logger and interfaces small test for dropped messages update go-hclogger and update sink/intercept logger interfaces	2019-11-05 09:51:49 -05:00
Drew Bailey	e076204820	get local rpc endpoint working	2019-11-05 09:51:48 -05:00
Drew Bailey	976c43157c	remove log_writer prefix output with proper spacing update gzip handler, adjust first byte flow to allow gzip handler bypass wip, first stab at wiring up rpc endpoint	2019-11-05 09:51:48 -05:00
Drew Bailey	0de94466b2	Display error when remote side ended monitor multisink logger remove usage of logwriter	2019-11-05 09:51:48 -05:00
Drew Bailey	f60e44afc7	Adds nomad monitor command Adds nomad monitor command. Like consul monitor, this command allows you to stream logs from a nomad agent in real time with a a specified log level add endpoint tests Upgrade go-hclog to latest version The current version of go-hclog pads log prefixes to equal lengths so info becomes [INFO ] and debug becomes [DEBUG]. This breaks hashicorp/logutils/level.go Check function. Upgrading to the latest version removes this padding and fixes log filtering that uses logutils Check	2019-11-05 09:51:47 -05:00
Drew Bailey	b386119d15	Add Agent Monitor to receive streaming logs Queries /v1/agent/monitor and receives streaming logs from client	2019-11-05 09:51:47 -05:00
Drew Bailey	b0184e2032	Adds AgentMonitor Endpoint AgentMonitor is an endpoint to stream logs for a given agent. It allows callers to pass in a supplied log level, which may be different than the agents config allowing for temporary debugging with lower log levels. Pass in logWriter when setting up Agent	2019-11-05 09:51:46 -05:00
Drew Bailey	3a11f1f23a	Merge pull request #6609 from hashicorp/b-alloc-status-consistency Prevent nomad alloc status output inconsistency	2019-11-04 10:12:04 -05:00
Drew Bailey	a7adc54235	Prevent nomad alloc status output inconsistency Prevent random map ordering and sort alphabetically better variable name	2019-11-01 14:01:32 -04:00
Michael Schurter	9fed8d1bed	client: fix panic from 0.8 -> 0.10 upgrade makeAllocTaskServices did not do a nil check on AllocatedResources which causes a panic when upgrading directly from 0.8 to 0.10. While skipping 0.9 is not supported we intend to fix serious crashers caused by such upgrades to prevent cluster outages. I did a quick audit of the client package and everywhere else that accesses AllocatedResources appears to be properly guarded by a nil check.	2019-11-01 07:47:03 -07:00
Mahmood Ali	3f6e50617a	Merge pull request #6047 from hashicorp/b-ignore-server-if-disabled Only warn against BootstrapExpect set in CLI flag	2019-10-29 10:55:44 -04:00
Lang Martin	aa77ea4032	quota: parse network stanza in quotas (#6511 )	2019-10-24 10:41:54 -04:00
Michael Schurter	39437a5c5b	Merge branch 'master' into release-0100	2019-10-22 08:17:57 -07:00
Nomad Release bot	3e6c9dd40e	Generate files for 0.10.0 release	2019-10-22 12:34:56 +00:00
Seth Hoenig	8b03477f46	Merge pull request #6448 from hashicorp/f-set-connect-sidecar-tags connect: enable setting tags on consul connect sidecar service in job…	2019-10-17 15:14:09 -05:00
Seth Hoenig	039fbd3f3b	connect: enable setting tags on consul connect sidecar service in jobspec (#6415 )	2019-10-17 19:25:20 +00:00
Mahmood Ali	61e66cb077	Merge pull request #6427 from hashicorp/b-fs-endpoint-errors agent: report fs log errors as http errors	2019-10-15 20:12:59 -04:00
Mahmood Ali	88f8127820	tests: avoid using unnecessary pipe	2019-10-15 17:22:03 -04:00
Mahmood Ali	e6d5635e1a	Merge pull request #6425 from hashicorp/f-cli-show-full-ids cli: show full id for single node or alloc status	2019-10-15 10:54:25 -04:00
Danielle	fee482ae6c	Merge pull request #6331 from hashicorp/dani/f-volume-mount-propagation volumes: Add support for mount propagation	2019-10-14 14:29:40 +02:00
Danielle Lancashire	4fbcc668d0	volumes: Add support for mount propagation This commit introduces support for configuring mount propagation when mounting volumes with the `volume_mount` stanza on Linux targets. Similar to Kubernetes, we expose 3 options for configuring mount propagation: - private, which is equivalent to `rprivate` on Linux, which does not allow the container to see any new nested mounts after the chroot was created. - host-to-task, which is equivalent to `rslave` on Linux, which allows new mounts that have been created _outside of the container_ to be visible inside the container after the chroot is created. - bidirectional, which is equivalent to `rshared` on Linux, which allows both the container to see new mounts created on the host, but importantly _allows the container to create mounts that are visible in other containers an don the host_ private and host-to-task are safe, but bidirectional mounts can be dangerous, as if the code inside a container creates a mount, and does not clean it up before tearing down the container, it can cause bad things to happen inside the kernel. To add a layer of safety here, we require that the user has ReadWrite permissions on the volume before allowing bidirectional mounts, as a defense in depth / validation case, although creating mounts should also require a priviliged execution environment inside the container.	2019-10-14 14:09:58 +02:00
Danielle	2640155ae5	Merge pull request #6429 from hashicorp/f-log-to-file Add support for logging to a file	2019-10-11 13:35:39 +02:00
Nomad Release bot	3007f1662e	Generate files for 0.10.0-rc1 release	2019-10-10 19:08:23 +00:00
Danielle Lancashire	5cedf6d024	logging: Correctly track number of written bytes Currently this assumes that a short write will never happen. While these are improbable in a case where rotation being off a few bytes would matter, this now correctly tracks the number of written bytes.	2019-10-10 14:02:14 +02:00
Danielle Lancashire	b67215d4f8	logging: Sort files when pruning old logs Currently this logging implementation is dependent on the order of files as returned by filepath.Glob, which although internal methods are documented to be lexographical, does not publicly document this. Here we defensively resort.	2019-10-10 13:51:16 +02:00
Mahmood Ali	4b2ba62e35	acl: check ACL against object namespace Fix a bug where a millicious user can access or manipulate an alloc in a namespace they don't have access to. The allocation endpoints perform ACL checks against the request namespace, not the allocation namespace, and performs the allocation lookup independently from namespaces. Here, we check that the requested can access the alloc namespace regardless of the declared request namespace. Ideally, we'd enforce that the declared request namespace matches the actual allocation namespace. Unfortunately, we haven't documented alloc endpoints as namespaced functions; we suspect starting to enforce this will be very disruptive and inappropriate for a nomad point release. As such, we maintain current behavior that doesn't require passing the proper namespace in request. A future major release may start enforcing checking declared namespace.	2019-10-08 12:59:22 -04:00
Mahmood Ali	3c0d8c7611	Merge pull request #6441 from hashicorp/b-agent-token Redact replication tokens in /agent/self	2019-10-08 12:55:45 -04:00
Danielle Lancashire	9eaac48f25	agent: Refactor log setup to support log-to-file	2019-10-07 14:42:32 +02:00
Danielle Lancashire	442f4888b3	agent: Introduce File Logger This commit introduces a rotating file logger for Nomad Agent Logs. The logger implementation itself is a lift and shift from Consul, with tests updated to fit with the Nomad pattern of using require, and not having a testutil for creating tempdirs cleanly.	2019-10-07 14:37:31 +02:00
Danielle Lancashire	d3614ea0a8	config: Add required configuration for logging to a file	2019-10-07 14:16:59 +02:00

1 2 3 4 5 ...

2583 Commits