open-nomad

Author	SHA1	Message	Date
Michael Schurter	91b5bb58d9	add HasHealth helper for nil checks We performed the DeploymentStatus nil checks a couple different ways, so hopefully this helper will consoldiate them and make it more clear what the code is doing.	2018-03-29 09:29:19 -07:00
Michael Schurter	5eb0cb7176	only service allocs should have health watched	2018-03-28 16:20:11 -07:00
Michael Schurter	8b346c6176	test: try to prevent flakiness on travis	2018-03-21 16:51:45 -07:00
Josh Soref	58b794875f	spelling: artifact	2018-03-11 17:41:02 +00:00
Alex Dadgar	b24b05e025	Remove testing	2018-02-15 13:59:01 -08:00
Michael Schurter	f86f0bd9ea	Handle leader task being dead in RestoreState Fixes the panic mentioned in https://github.com/hashicorp/nomad/issues/3420#issuecomment-341666932 While a leader task dying serially stops all follower tasks, the synchronizing of state is asynchrnous. Nomad can shutdown before all follower tasks have updated their state to dead thus saving the state necessary to hit this panic: have a non-terminal alloc with a dead leader. The actual fix is a simple nil check to not assume non-terminal allocs leader's have a TaskRunner.	2017-11-15 10:36:13 -08:00
Preetha Appan	0eaef09675	Remove event GenericSource, and address other code review comments. Also added deprecation info in comments.	2017-11-03 10:10:06 -05:00
Chelsea Holland Komlo	410adaf726	Add functionality for authenticated volumes	2017-10-11 17:09:20 -07:00
Michael Schurter	a66c53d45a	Remove `structs` import from `api` Goes a step further and removes structs import from api's tests as well by moving GenerateUUID to its own package.	2017-09-29 10:36:08 -07:00
Alex Dadgar	4173834231	Enable more linters	2017-09-26 15:26:33 -07:00
Alex Dadgar	d6187cd3e8	Fix tests	2017-08-16 16:26:52 -07:00
Alex Dadgar	7dd86b5dfe	Merge pull request #3025 from hashicorp/f-health-events Emit task events explaining alloc health	2017-08-15 12:23:46 -07:00
Alex Dadgar	fdc0115427	test	2017-08-12 14:42:53 -07:00
Michael Schurter	b7915bdac7	Update tests for new blocking/migrating code	2017-08-11 16:21:57 -07:00
Alex Dadgar	d86b3977b9	Fix alloc health with checks using interpolation Fixes an issue in which the allocation health watcher was checking for allocations health based on un-interpolated services and checks. Change the interface for retrieving check information from Consul to retrieving all registered services and checks by allocation. In the future this will allow us to output nicer messages. Fixes https://github.com/hashicorp/nomad/issues/2969	2017-08-07 16:27:08 -07:00
Alex Dadgar	553bc91725	Parallel client tests (#2890 ) * alloc_runner * Random tests * parallel task_runner and no exec compatible check * Parallel client * Fail fast and use random ports * Fix docker port mapping * Make concurrent pull less timing dependant * up parallel * Fixes * don't build chroots in parallel on travis * Reduce parallelism on travis with lxc/rkt * make java test app not run forever * drop parallelism a little * use docker ports that are out of the os's ephemeral port range * Limit even more on travis * rkt deadline	2017-07-22 19:04:36 -07:00
Michael Schurter	a22cfa8387	Minor test race fix	2017-07-21 16:17:23 -07:00
Michael Schurter	996ce9286e	Fix test race by locking around ar.tasks access	2017-07-21 14:25:51 -07:00
Michael Schurter	5f40901422	Fix more test races	2017-07-21 14:00:21 -07:00
Michael Schurter	b9ba447399	Fixup a few more even rarer test races	2017-07-21 13:43:32 -07:00
Michael Schurter	6e80a8ee39	Fix TestAllocRunner_TaskLeader_StopTG Also make alloc runner tests less racy. Basically every alloc runner test used to have races with `upd.{Count,Allocs}`	2017-07-21 13:37:16 -07:00
Alex Dadgar	067ed86a47	Client watches for allocation health using task state and Consul checks This PR adds watching of allocation health at the client. The client can watch for health based on the tasks running on time and also based on the consul checks passing.	2017-07-07 12:10:04 -07:00
Michael Schurter	5ec52ec24a	Destroy task group leader first Before this commit all tasks in a task group were destroyed concurrently. This meant logging sidecars might be stopped before the leader task whose logs still need to be shipped. This commit blocks on the leader shutting down before signalling to followers to shutdown.	2017-07-03 13:56:56 -07:00
Michael Schurter	cb568a5cf6	Cleanup lots of leaked alloc runners in tests	2017-05-31 11:39:50 -07:00
Michael Schurter	5f9cb4c514	Switch tests to mock_driver	2017-05-25 09:28:10 -07:00
Michael Schurter	d793dde4e9	Shrink chroot to avoid timing test failure	2017-05-23 16:11:24 -07:00
Michael Schurter	15ef740ab6	Add env.Builder.UpdateTask for alloc updates	2017-05-23 16:00:57 -07:00
Michael Schurter	e7db2c9b0e	Handle Driver.Prestart returning nil, nil	2017-05-23 13:53:34 -07:00
Alex Dadgar	3cd7e06fba	Fix test	2017-05-09 11:35:48 -07:00
Alex Dadgar	ba70cc4f01	Merge branch 'master' into f-bolt-db	2017-05-09 11:11:55 -07:00
Michael Schurter	5b8415df2c	Merge pull request #2585 from hashicorp/b-2554-container-exec Execute exec/java script checks in containers	2017-05-05 10:31:18 -07:00
Alex Dadgar	2d54ee2925	Fix tests	2017-05-03 15:14:19 -07:00
Alex Dadgar	1d8444bc1e	Fix tests	2017-05-03 11:15:30 -07:00
Michael Schurter	20322a5e92	Test pre-0.6 script check upgrade path	2017-04-25 11:41:03 -07:00
Michael Schurter	e204a287ed	Refactor Consul Syncer into new ServiceClient Fixes #2478 #2474 #1995 #2294 The new client only handles agent and task service advertisement. Server discovery is mostly unchanged. The Nomad client agent now handles all Consul operations instead of the executor handling task related operations. When upgrading from an earlier version of Nomad existing executors will be told to deregister from Consul so that the Nomad agent can re-register the task's services and checks. Drivers - other than qemu - now support an Exec method for executing abritrary commands in a task's environment. This is used to implement script checks. Interfaces are used extensively to avoid interacting with Consul in tests that don't assert any Consul related behavior.	2017-04-19 12:42:47 -07:00
Alex Dadgar	c52000f792	FinishedAt only records when the task has actually started	2017-03-31 17:06:05 -07:00
Alex Dadgar	81b78f77e1	Track task start/finish time & improve logs errors This PR adds tracking to when a task starts and finishes and the logs API takes advantage of this and returns better errors when asking for logs that do not exist.	2017-03-31 16:14:11 -07:00
Alex Dadgar	3fb285f7d3	Fix TestAllocRunner_SaveRestoreState	2017-03-02 20:45:46 -08:00
Alex Dadgar	5be806a3df	Fix vet script and fix vet problems This PR fixes our vet script and fixes all the missed vet changes. It also fixes pointers being printed in `nomad stop <job>` and `nomad node-status <node>`.	2017-02-27 16:00:19 -08:00
Alex Dadgar	238b4bcafd	Add Leader support to client	2017-02-10 17:55:19 -08:00
Alex Dadgar	6b02229eb0	fix flaky test	2017-01-23 14:12:38 -08:00
Michael Schurter	ea87091e58	Prevent race between alloc runners Block ar1's periodic syncing which could recreate the state file ar2 was destroying.	2017-01-17 13:10:20 -08:00
Michael Schurter	5a6bd19eb7	Fix upgrade path for #2132 AllocRunner's state dropped the Context struct which needs to be converted to the new AllocDir+TaskDir structs in RestoreState. TaskRunner added a TaskDirBuilt flag, but it's safe to just let that default to `false` and rebuild all task dirs once on upgrade.	2017-01-05 16:31:55 -08:00
Michael Schurter	3ea09ba16a	Move chroot building into TaskRunner * Refactor AllocDir to have a TaskDir struct per task. * Drivers expose filesystem isolation preference * Fix lxc mounting of `secrets/`	2017-01-05 16:31:49 -08:00
Michael Schurter	e1d63f6c0f	Bump timeout on test	2016-11-29 16:19:40 -08:00
Diptanu Choudhury	1098dc4aa3	Fixed alloc dir move tests	2016-10-26 15:17:57 -07:00
Alex Dadgar	e85d0ebace	Merge pull request #1840 from hashicorp/f-kill-fail Change how we mark tasks as failed and allow consul-template to fail tasks	2016-10-24 13:40:52 -07:00
Michael Schurter	285e80ac0f	Remove disk usage enforcement Many thanks to @iverberk for the original PR (#1609), but we ended up not wanting to ship this implementation with 0.5. We'll come back to it after 0.5 and hopefully find a way to leverage filesystem accounting and quotas, so we can skip the expensive polling.	2016-10-21 13:55:51 -07:00
Alex Dadgar	46a7d1a0d7	Change how we mark tasks as failed and allow consul-template to fail tasks	2016-10-20 17:27:16 -07:00
Alex Dadgar	36cfe6e89e	Large refactor of task runner and Vault token rehandling	2016-10-18 11:24:20 -07:00

1 2 3

103 commits