open-nomad

Author	SHA1	Message	Date
Lang Martin	34230577df	describe a pending deployment with auto_promote accurately	2019-05-22 12:32:08 -04:00
Lang Martin	d462639cc9	sched reconcile copy AutoPromote to DeploymentState	2019-05-22 12:32:08 -04:00
Preetha Appan	1574e898af	Fix bug in reconciler where terminal allocs on a job already stopped were unnecessarily updated	2018-10-08 21:03:49 -05:00
Alex Dadgar	3c19d01d7a	server	2018-09-15 16:23:13 -07:00
Alex Dadgar	3ba62efd5e	Failed/paused deployments do not block migrations This PR changes behavior of the scheduler such that a task group with a deployment that is failed or paused will not cause the scheduler to skip migrations. The reason for this change is that it causes a bad UX when draining nodes with allocations that are part of a failed/paused deployment. These operations should not be coupled in any way and this remedies that. Prior behavior was still correct, but required either jobs to transistion to a healthy state or for the node to hit its drain deadline.	2018-09-10 15:28:45 -07:00
Preetha Appan	3e264dcb79	Fix reconciler bug with deployment not being created if job create index is different This fixes an issue where if a job is purged and resubmitted Nomad does not create a new deployment. Adds unit test that failed before this fix	2018-06-05 13:58:53 -05:00
Preetha Appan	cf44670d56	Make sure that task group has a deployment state before using it	2018-05-07 14:55:01 -05:00
Alex Dadgar	768fec8505	Allow healthy canary deployment to skip progress deadline	2018-05-07 14:55:01 -05:00
Alex Dadgar	8626c1b94a	Reschedule when we have canaries properly	2018-05-07 14:55:01 -05:00
Alex Dadgar	550f5e31f8	Allow canary count greater than desired	2018-05-07 14:50:01 -05:00
Preetha Appan	5329900f6d	Only use DesiredTransition.Reschedule in reconciler when its an active deployment	2018-05-07 14:50:01 -05:00
Alex Dadgar	57969b4ee0	fix reconcile tests	2018-05-07 14:50:01 -05:00
Alex Dadgar	fcf4f582d0	small review feedback fixes	2018-05-07 14:50:01 -05:00
Alex Dadgar	1336002255	Progress deadline in deployment state	2018-05-07 14:50:01 -05:00
Alex Dadgar	ee50789c22	Initial implementation	2018-05-07 14:50:01 -05:00
Preetha Appan	a569d34f25	Add custom status description for rescheduling follow up evals, and make unit test robust	2018-04-10 15:30:15 -05:00
Preetha Appan	7e17bc231f	remove unnecessary check and other fixes from code review	2018-04-04 07:35:20 -05:00
Preetha Appan	00537c739b	Fixes edge cases around timing and task finish time being set more than once	2018-04-03 16:34:59 -05:00
Alex Dadgar	e106da84de	name and test	2018-03-26 11:06:21 -07:00
Alex Dadgar	e2a6e64fca	Don't create unnecessary deployments	2018-03-23 16:55:21 -07:00
Alex Dadgar	3b72dd94ba	Do not mark an allocation as an inplace update if specification hasn't changed	2018-03-23 14:36:05 -07:00
Michael Schurter	cb61a4bdc7	Fix linting errors	2018-03-21 16:51:45 -07:00
Alex Dadgar	92b636dd32	Fix deadline handling	2018-03-21 16:51:44 -07:00
Alex Dadgar	db4a634072	RPC, FSM, State Store for marking DesiredTransistion fix build tag	2018-03-21 16:49:48 -07:00
Preetha Appan	56e60e5840	Fix linting warning	2018-03-14 16:12:22 -05:00
Preetha Appan	9fed0d2103	Get reschedule policy from the alloc directly	2018-03-14 16:10:32 -05:00
Preetha Appan	e2656ef546	Cleaner handling of batched evals	2018-03-14 16:10:32 -05:00
Preetha Appan	47e0280d96	More small review feedback	2018-03-14 16:10:32 -05:00
Preetha Appan	5373ade731	Scheduler and Reconciler changes to support delayed rescheduling	2018-03-14 16:10:32 -05:00
Josh Soref	a89e1b8395	spelling: strategy	2018-03-11 18:58:19 +00:00
Josh Soref	f8eb766fb5	spelling: reschedulable	2018-03-11 18:48:12 +00:00
Preetha Appan	7c57303dd2	Clarify comment	2018-02-05 16:37:07 -06:00
Preetha Appan	d48c411692	Reconciler should consider failed allocs when marking deployment as failed.	2018-02-02 19:40:25 -06:00
Preetha Appan	ea4a889e28	Address more code review feedback	2018-01-31 09:56:53 -06:00
Preetha Appan	bd89d2b39e	Make sure that reschedule trackers are not added for node drain replacements	2018-01-31 09:56:53 -06:00
Preetha Appan	21b7b79d5d	Add helper methods, use require and other code review feedback	2018-01-31 09:56:53 -06:00
Preetha Appan	fbb1936dee	Fix some comments and lint warnings, remove unused method	2018-01-31 09:56:53 -06:00
Preetha Appan	031c566ada	Reschedule previous allocs and track their reschedule attempts	2018-01-31 09:56:53 -06:00
Alex Dadgar	746cd7403f	Allow batch jobs to be rerun if purged This PR allows batch jobs to be rerun if they have been purged.	2017-10-13 12:40:37 -07:00
Alex Dadgar	3904bde9a3	Fix batch handling of complete allocs/node drains This PR fixes: * An issue in which a node-drain that contains a complete batch alloc would cause a replacement * An issue in which allocations with the same name during a scale down/stop event wouldn't be properly stopped. * An issue in which batch allocations from previous job versions may not have been stopped properly. Fixes https://github.com/hashicorp/nomad/issues/3210	2017-09-14 15:08:57 -07:00
Alex Dadgar	27256ebcc6	Placing allocs counts towards placement limit This PR makes placing new allocations count towards the limit. We do not restrict how many new placements are made by the limit but we still count towards the limit. This has the nice affect that if you have a group with count = 5 and max_parallel = 1 but only 3 allocs exist for it and a change is made, you will create 2 more at the new version but not destroy one, taking you down to two running as you would have previously. Fixes https://github.com/hashicorp/nomad/issues/3053	2017-08-21 12:41:19 -07:00
Luke Farnell	f0ced87b95	fixed all spelling mistakes for goreport	2017-08-07 17:13:05 -04:00
Alex Dadgar	7b13c0d702	Lost allocs replaced even if deployment failed This PR allows the scheduler to replace lost allocations even if the job has a failed or paused deployment. The prior behavior was confusing to users. Fixes https://github.com/hashicorp/nomad/issues/2958	2017-08-03 17:42:14 -07:00
Alex Dadgar	492239d3ee	Improve multiple group handling in a deployment This PR resolves a bug in which a job with multiple task groups would create new deployment objects each, thus clearing out all other task groups deployment state.	2017-07-25 11:27:47 -07:00
Alex Dadgar	a9ec1d6ca7	Fix update limit calculation to avoid panic This PR fixes the rolling update limit calculation to avoid a panic when there are more allocations for a deployment that haven't determined their health than the max_parallel count of the task group. Fixes https://github.com/hashicorp/nomad/issues/2820	2017-07-19 11:11:47 -07:00
Alex Dadgar	66a90326e1	Treat destructive updates atomically	2017-07-16 10:35:38 -07:00
Alex Dadgar	f86760db3c	Basic logs	2017-07-07 16:49:08 -07:00
Alex Dadgar	20005f925a	Rolling node drains using max_parallel and stagger This PR adds rolling node drains done at max_parallel and stagger of the update spec. It brings it inline with old behavior.	2017-07-07 12:12:48 -07:00
Alex Dadgar	3a29b38108	Status description shows requiring promotion	2017-07-07 12:12:48 -07:00
Alex Dadgar	9f016606aa	Fix some tests, eval monitor shows deployment id and deployment cancels based on version	2017-07-07 12:12:48 -07:00

1 2

74 commits