open-nomad

Author	SHA1	Message	Date
Lang Martin	8fe9699e51	client_test cleanup comments from review	2019-04-11 09:56:22 -04:00
Lang Martin	63c993c8ae	fix client-test, avoid hardwired platform dependecy on lo0	2019-04-11 09:56:22 -04:00
Lang Martin	a9db848974	refactor error in client fingerprint to include the offending data	2019-04-11 09:56:22 -04:00
Lang Martin	f211500cea	add client updateNodeResources to merge but preserve manual config	2019-04-11 09:56:22 -04:00
Lang Martin	a4b59130d2	test that fingerprint resources are updated, net not clobbered	2019-04-11 09:56:21 -04:00
Danielle Lancashire	e135876493	allocs: Add nomad alloc restart This adds a `nomad alloc restart` command and api that allows a job operator with the alloc-lifecycle acl to perform an in-place restart of a Nomad allocation, or a given subtask.	2019-04-11 14:25:49 +02:00
Chris Baker	829a972693	vault client test: minor formatting vendor: using upstream circonus-gometrics	2019-04-10 10:34:10 -05:00
Chris Baker	c0a7aee610	vault e2e: pass vault version into setup instead of having to infer it from test name	2019-04-10 10:34:10 -05:00
Chris Baker	f0c184fc29	taskrunner: removed some unecessary config from a test	2019-04-10 10:34:10 -05:00
Chris Baker	a26d4fe1e5	docs: -vault-namespace, VAULT_NAMESPACE, and config agent: added VAULT_NAMESPACE env-based configuration	2019-04-10 10:34:10 -05:00
Chris Baker	170f5239c8	client: gofmt	2019-04-10 10:34:10 -05:00
Chris Baker	a1d7971b2e	taskrunner: pass configured Vault namespace into TaskTemplateConfig	2019-04-10 10:34:10 -05:00
Chris Baker	0eaeef872f	config/docs: added `namespace` to vault config server/client: process `namespace` config, setting on the instantiated vault client	2019-04-10 10:34:10 -05:00
Michael Schurter	45b4827ad7	Bump to 0.9.1-dev	2019-04-09 09:01:48 -07:00
Nomad Release bot	e307734e4a	Generate files for 0.9.0 release	2019-04-09 01:56:00 +00:00
Michael Schurter	f7d4428855	client: simplify kill logic Remove runLaunched tracking as Run is always called for killable TaskRunners. TaskRunners which fail before Run can be called (during NewTaskRunner or Restore) are not killable as they're never added to the client's alloc map.	2019-04-04 15:18:33 -07:00
Michael Schurter	3af602b633	Remove 0.9.0-rc2 generated files	2019-04-03 07:41:09 -07:00
Nomad Release bot	16b4336ccf	Generate files for 0.9.0-rc2 release	2019-04-03 01:54:29 +00:00
Michael Schurter	923cd91850	Merge pull request #5504 from hashicorp/b-exec-path executor/linux: make chroot binary paths absolute	2019-04-02 14:09:50 -07:00
Michael Schurter	1d569a27dc	Revert "executor/linux: add defensive checks to binary path" This reverts commit cb36f4537e63d53b198c2a87d1e03880895631bd.	2019-04-02 11:17:12 -07:00
Michael Schurter	fc5487dbbc	executor/linux: add defensive checks to binary path	2019-04-02 09:40:53 -07:00
Michael Schurter	7d49bc4c71	executor/linux: make chroot binary paths absolute Avoid libcontainer.Process trying to lookup the binary via $PATH as the executor has already found where the binary is located.	2019-04-01 15:45:31 -07:00
Mahmood Ali	81f4f07ed7	rename fifo methods for clarity	2019-04-01 16:52:58 -04:00
Mahmood Ali	e87afe465b	clarify closeDone blocking and field name	2019-04-01 16:10:34 -04:00
Mahmood Ali	9d647713c0	no requires in a test goroutine	2019-04-01 15:38:39 -04:00
Mahmood Ali	2b1f858e1b	log when fifo fails to open	2019-04-01 13:18:03 -04:00
Mahmood Ali	967452a3f0	fifo: Use plain fifo file in Unix This PR switches to using plain fifo files instead of golang structs managed by containerd/fifo library. The library main benefit is management of opening fifo files. In Linux, a reader `open()` request would block until a writer opens the file (and vice-versa). The library uses goroutines so that it's the first IO operation that blocks. This benefit isn't really useful for us: Given that logmon simply streams output in a separate process, blocking of opening or first read is effectively the same. The library additionally makes further complications for managing state and tracking read/write permission that seems overhead for our use, compared to using a file directly. Looking here, I made the following incidental changes: * document that we do handle if fifo files are already created, as we rely on that behavior for logmon restarts * use type system to lock read vs write: currently, fifo library returns `io.ReadWriteCloser` even if fifo is opened for writing only!	2019-04-01 13:18:03 -04:00
Michael Schurter	a4572919cd	Merge pull request #5456 from hashicorp/test-taskenv tests: port pre-0.9 task env tests	2019-03-25 10:41:38 -07:00
Michael Schurter	8efad12538	tests: port pre-0.9 task env tests I chose to make them more of integration tests since there's a lot more plumbing involved. The internal implementation details of how we craft task envs can now change and these tests will still properly assert the task runtime environment is setup properly.	2019-03-25 09:46:53 -07:00
Michael Schurter	9afbc45cff	Bump to dev post-0.9.0-rc1 release	2019-03-22 08:26:30 -07:00
Nomad Release bot	3ab3dd4105	Generate files for 0.9.0-rc1 release	2019-03-21 19:06:13 +00:00
Mahmood Ali	b08a2744f8	Merge pull request #5428 from hashicorp/b-dropped-logs-on-task-restart client/logmon: restart log collection correctly when a task is restarted	2019-03-21 14:02:08 -04:00
Mahmood Ali	729458f110	fix TestLogmon_Start_restart	2019-03-21 13:36:46 -04:00
Nick Ethier	b252d712df	logmon: fix test assertion	2019-03-20 21:37:17 -04:00
Nick Ethier	c1f5011181	logmon: remove sleeps from tests	2019-03-20 10:45:09 -04:00
Nick Ethier	e14041bdec	logmon: add tests for rotation and open/closing of fifos	2019-03-19 14:41:23 -04:00
Nick Ethier	dc18b8928a	logmon: make Start rpc idempotent and simplify hook	2019-03-19 14:02:36 -04:00
Nick Ethier	ac7fbee1b8	logmon:add static check for logmon exited hook	2019-03-18 15:59:43 -04:00
Nick Ethier	7dc3d83634	client/logmon: restart log collection correctly when a task is restarted	2019-03-15 23:59:18 -04:00
Mahmood Ali	fb55717b0c	Regenerate Proto files (#5421 ) Noticed that the protobuf files are out of sync with ones generated by 1.2.0 protoc go plugin. The cause for these files seem to be related to release processes, e.g. [0.9.0-beta1 preperation](`ecec3d38de (diff-da4da188ee496377d456025c2eab4e87)`), and [0.9.0-beta3 preperation](`b849d84f2f`). This restores the changes to that of the pinned protoc version and fails build if protobuf files are out of sync. Sample failing Travis job is that of the first commit change: https://travis-ci.org/hashicorp/nomad/jobs/506285085	2019-03-14 10:56:27 -04:00
Michael Schurter	b126e9eec4	Merge pull request #5386 from hashicorp/b-logmon-stop Fix task/logmon leak after crash	2019-03-12 15:23:02 -07:00
Michael Schurter	0ba1a5251b	client: cleanup and document context uses Some of the context uses in TR hooks are useless (Killed during Stop never seems meaningful). None of the hooks are interruptable for graceful shutdown which is unfortunate and probably needs fixing.	2019-03-12 15:03:54 -07:00
Mahmood Ali	8deb532be2	run TestAllocations_Stats in CI	2019-03-08 07:57:37 -05:00
Michael Schurter	32d31575cc	client: emit event and call exited hooks during cleanup Builds upon earlier commit that cleans up restored handles of terminal allocs by also emitting terminated events and calling exited hooks when appropriate.	2019-03-05 15:12:02 -08:00
Michael Schurter	a4bc46b6e6	test: fix NewMemDB API change	2019-03-04 13:37:20 -08:00
Michael Schurter	64e145ebdb	logmon: drop reattach log level as its expected Logged once per terminal task on agent restart.	2019-03-04 13:26:01 -08:00
Michael Schurter	c5271d3fa5	client: test logmon cleanup The test is sadly quite complicated and peeks into things (logmon's reattach config) AR doesn't normally have access to. However, I couldn't find another way of asserting logmon got cleaned up without resorting to smaller unit tests. Smaller unit tests risk re-implementing dependencies in an unrealistic way, so I opted for an ugly integration test.	2019-03-04 13:15:15 -08:00
Preetha Appan	0e547d29ad	s/mananger/manager	2019-03-04 12:25:54 -06:00
Michael Schurter	ef8d284352	client: ensure task is cleaned up when terminal This commit is a significant change. TR.Run is now always executed, even for terminal allocations. This was changed to allow TR.Run to cleanup (run stop hooks) if a handle was recovered. This is intended to handle the case of Nomad receiving a DesiredStatus=Stop allocation update, persisting it, but crashing before stopping AR/TR. The commit also renames task runner hook data as it was very easy to accidently set state on Requests instead of Responses using the old field names.	2019-03-01 14:00:23 -08:00
Michael Schurter	3f386e3951	Remove generated files for 0.9.0-beta3	2019-02-26 10:34:08 -08:00
Michael Schurter	d74755900e	Generate files for 0.9.0-beta3 release	2019-02-26 09:44:49 -08:00
Michael Schurter	812f1679e2	Merge pull request #5352 from hashicorp/b-leaked-logmon logmon fixes	2019-02-26 08:35:46 -08:00
Michael Schurter	e39a10a1f4	tests: move unix-specific test to its own file Other logmon tests should be portable.	2019-02-26 07:56:44 -08:00
Mahmood Ali	45b6392d4e	tests: port some fingerprint tests from 0.8 (#5359 ) Port some integration tests of driver fingerprinting. Some tests (e.g. `TestFingerprintManager_Run_DriversInBlacklist`) have been subsituted by more isolated tests in `client/pluginmanager/drivermanager/manager_test.go`	2019-02-26 10:54:16 -05:00
Michael Schurter	3b2a592e93	client: restart task on logmon failures This code chooses to be conservative as opposed to optimal: when failing to reattach to logmon simply return a recoverable error instead of immediately trying to restart logmon. The recoverable error will cause the task's restart policy to be applied and a new logmon will be launched upon restart. Trying to do the optimal approach of simply starting a new logmon requires error string comparison and should be tested against a task actively logging to assert the behavior (are writes blocked? dropped?).	2019-02-25 15:42:45 -08:00
Michael Schurter	8830b00866	client: test logmon_hook	2019-02-23 15:36:48 -08:00
Preetha Appan	43679f4ce1	More alloc runner tests ported from 0.8.7	2019-02-22 17:58:06 -06:00
Mahmood Ali	32551fb0e5	emit TaskRestartSignal event on vault restart When Vault token expires and task is restarted, emit `TaskRestartSignal` similar to v0.8.7	2019-02-22 15:56:14 -05:00
Mahmood Ali	8cb4bbcc08	address review comments	2019-02-22 15:56:14 -05:00
Mahmood Ali	216eaa4843	tests: port TestTaskRunner_VaultManager_Signal From https://github.com/hashicorp/nomad/blob/v0.8.7/client/task_runner_test.go#L1427	2019-02-22 15:53:04 -05:00
Mahmood Ali	8e9e732319	tests: port TestTaskRunner_VaultManager_Restart From https://github.com/hashicorp/nomad/blob/v0.8.7/client/task_runner_test.go#L1352	2019-02-22 15:53:04 -05:00
Mahmood Ali	33122ca7c0	tests: port TestTaskRunner_UnregisterConsul_Retries From https://github.com/hashicorp/nomad/blob/v0.8.7/client/task_runner_test.go#L620	2019-02-22 15:53:04 -05:00
Mahmood Ali	0128b0ce7a	tests: port TestTaskRunner_Template_NewVaultToken From https://github.com/hashicorp/nomad/blob/v0.8.7/client/task_runner_test.go#L1275	2019-02-22 15:53:04 -05:00
Mahmood Ali	cfb80583af	tests: port TestTaskRunner_Template_Artifact From https://github.com/hashicorp/nomad/blob/v0.8.7/client/task_runner_test.go#L1195	2019-02-22 15:52:59 -05:00
Mahmood Ali	1b14214a88	tests: port TestAllocRunner_RetryArtifact Port TestAllocRunner_RetryArtifact from https://github.com/hashicorp/nomad/blob/v0.8.7/client/alloc_runner_test.go#L610-L672 I changed the test name because it doesn't actually test that artifact hooks is retried	2019-02-22 15:50:39 -05:00
Mahmood Ali	c827e6e05a	tests: port TestAllocRunner_MoveAllocDir test	2019-02-22 15:50:39 -05:00
Michael Schurter	a2e3ea6dc9	logmon: fix reattach configuration There were multiple bugs here: 1. Reattach unmarshalling always returned an error because you can't unmarshal into a nil pointer. 2. The hook data wasn't being saved because it was put on the request struct, not the response struct. 3. The plugin configuration should only have reattach or a command set. Not both. 4. Setting Done=true meant the hook was never re-run on agent restart so reattaching was never attempted.	2019-02-21 15:32:18 -08:00
Michael Schurter	f5e0dba9d1	fingerprint: improve initial fingerpint message The initial fingerprint message is actually fairly useful, so I bumped it to Debug and fixed the output formatting.	2019-02-21 15:32:18 -08:00
Michael Schurter	01cabdff88	client: restart on recoverable StartTask errors Fixes restarting on recoverable errors from StartTask. Ports TestTaskRunner_Run_RecoverableStartError from 0.8 which discovered the bug.	2019-02-21 15:30:49 -08:00
Michael Schurter	e3f321cd27	test: port TestTaskRunner_RestartSignalTask_NotRunning from 0.8	2019-02-21 15:30:49 -08:00
Michael Schurter	f3aa945a00	test: port TestTaskRunner_DriverNetwork from 0.8	2019-02-21 15:30:49 -08:00
Michael Schurter	518405ac33	Merge pull request #5322 from hashicorp/b-artifact-retries Fix regression by restarting on artifact download errors	2019-02-21 15:28:51 -08:00
Mahmood Ali	6d30284ec9	Merge pull request #5341 from hashicorp/ci-windows-docker Run Docker tests in Windows AppVeyor CI	2019-02-21 13:17:33 -05:00
Michael Schurter	2553800eb8	tests: port TestAllocRunner_Destroy from 0.8 Also add destroy(ar) helper to fix a bunch of shutdown races in AR tests.	2019-02-20 12:35:09 -08:00
Michael Schurter	6580ed668e	client: don't redownload completed artifacts on retries Track the download status of each artifact independently so that if only one of many artifacts fails to download, completed artifacts aren't downloaded again.	2019-02-20 08:45:12 -08:00
Michael Schurter	908bfab4c2	client: artifact errors are retry-able 0.9.0beta2 contains a regression where artifact download errors would not cause a task restart and instead immediately fail the task. This restores the pre-0.9 behavior of retrying all artifact errors and adds missing tests.	2019-02-20 07:21:27 -08:00
Michael Schurter	79ccf00b72	tests: add new task runner test helper Adds a new helper and removes a duplicated test.	2019-02-20 07:21:27 -08:00
Mahmood Ali	33ff8c3e8d	tests: expect Docker on AppVeyor Prepare to run docker on AppVeyor Windows environment	2019-02-20 07:41:47 -05:00
Michael Schurter	159042a1a3	client: fix setting alloc unhealthy at deadline During the 0.9 client refactor the code to fail a deployment when the deadline was reached was broken. This restores and tests that behavior.	2019-02-19 07:44:14 -08:00
Mahmood Ali	87be233aca	test: improve readability of duration Co-Authored-By: schmichael <michael.schurter@gmail.com>	2019-02-14 08:12:06 -08:00
Mahmood Ali	16d3414842	test: improve failure message Co-Authored-By: schmichael <michael.schurter@gmail.com>	2019-02-14 08:11:37 -08:00
Michael Schurter	4814f0fb0b	tests: port TestTaskRunner_Download_List from 0.8	2019-02-12 15:48:04 -08:00
Michael Schurter	a152e3ef17	consul: fix task deregistration hook Broke ShutdownDelay but the test was timing dependent so it just appeared flaky. Made the test slower so that it should never incorrectly pass.	2019-02-12 15:36:02 -08:00
Michael Schurter	4ad879e75e	tests: port TaskRunner_DeriveToken tests from 0.8	2019-02-12 15:36:02 -08:00
Michael Schurter	6743ed9fdc	tests: port TestTaskRunner_BlockForVault from 0.8 Also fix race conditions in the mock vault client.	2019-02-12 13:46:09 -08:00
Michael Schurter	6c0cc65b2e	simplify hcl2 parsing helper No need to pass in the entire eval context	2019-02-04 11:07:57 -08:00
Michael Schurter	fec2752fb2	client: log when allocs have been processed Will hopefully help us catch deadlocks/livelocks/slowdowns in the add/remove allocs pipeline which should be fast.	2019-02-04 11:07:57 -08:00
Michael Schurter	2db91425e3	Remove 0.9.0-beta2 generated files	2019-02-01 08:28:44 -08:00
Alex Dadgar	84d0afccae	Generate files for 0.9.0-beta2	2019-01-30 13:31:50 -08:00
Alex Dadgar	449e582ffc	Merge pull request #5281 from hashicorp/f-affinity-weight-int Change types of weights on spread/affinity	2019-01-30 13:25:56 -08:00
Alex Dadgar	d2e5ede119	remove generated structs	2019-01-30 12:38:34 -08:00
Nick Ethier	e7ea26449e	client: fix bug during 0.8 state up grade that causes external drivers to fail	2019-01-30 14:22:29 -05:00
Alex Dadgar	bc804dda2e	Nomad 0.9.0-beta1 generated code	2019-01-30 10:49:44 -08:00
Alex Dadgar	5062c54874	Fix usage of fsi variable	2019-01-29 14:07:55 -08:00
Alex Dadgar	6f418ebaf0	Always populate task dir environment variables Fixes an issue where if a task was restarted after restating the client, the task dir environment variables would not be populated. This PR fixes this for both upgrades from 0.8.X and for normal 0.9 restarts.	2019-01-29 13:17:10 -08:00
Nick Ethier	bcbed3c532	Merge pull request #5248 from hashicorp/b-rawexec-leak Fix leaked executor in raw_exec	2019-01-28 21:18:31 -05:00
Alex Dadgar	5da21635fb	Fix env templates having interpolated destinations Fixes an issue where env templates that had interpolated destinations would not work. Fixes https://github.com/hashicorp/nomad/issues/5250	2019-01-28 10:28:53 -08:00
Nick Ethier	8d7a47340c	drivermanager: don't store nil reattach configs	2019-01-25 23:07:04 -05:00
Alex Dadgar	d6412fd8e7	Fix double restart counting for templates This PR fixes an issue where template restarts would count twice since it was emitting a restarting event.	2019-01-25 15:38:13 -08:00
Nick Ethier	be976d9c9a	Merge branch 'master' into f-driver-upgradepath-test * master: (23 commits) tests: avoid assertion in goroutine spell check ci: run checkscripts tests: deflake TestRktDriver_StartWaitRecoverWaitStop drivers/rkt: Remove unused github.com/rkt/rkt drivers/rkt: allow development on non-linux cli: Hide `nomad docker_logger` from help output api: test api and structs are in sync goimports until make check is happy nil check node resources to prevent panic tr: use context in as select statement move pluginutils -> helper/pluginutils vet goimports gofmt Split hclspec move hclutils Driver tests do not use hcl2/hcl, hclspec, or hclutils move reattach config loader and singleton ...	2019-01-23 21:01:24 -05:00

1 2 3 4 5 ...

3724 commits