open-nomad

Author	SHA1	Message	Date
James Rasell	751c8217d1	core: allow setting and propagation of eval priority on job de/registration (#11532 ) This change modifies the Nomad job register and deregister RPCs to accept an updated option set which includes eval priority. This param is optional and override the use of the job priority to set the eval priority. In order to ensure all evaluations as a result of the request use the same eval priority, the priority is shared to the allocReconciler and deploymentWatcher. This creates a new distinction between eval priority and job priority. The Nomad agent HTTP API has been modified to allow setting the eval priority on job update and delete. To keep consistency with the current v1 API, job update accepts this as a payload param; job delete accepts this as a query param. Any user supplied value is validated within the agent HTTP handler removing the need to pass invalid requests to the server. The register and deregister opts functions now all for setting the eval priority on requests. The change includes a small change to the DeregisterOpts function which handles nil opts. This brings the function inline with the RegisterOpts.	2021-11-23 09:23:31 +01:00
Mahmood Ali	33dfe98770	deployment watcher: Reuse allocsCh if allocIndex remains the same (#10756 ) Fix deployment watchers to avoid creating unnecessary deployment watcher goroutines and blocking queries. `deploymentWatcher.getAllocsCh` creates a new goroutine that makes a blocking query to fetch updates of deployment allocs. ## Background When operators submit a new or updated service job, Nomad create a new deployment by default. The deployment object controls how fast to place the allocations through [`max_parallel`](https://www.nomadproject.io/docs/job-specification/update#max_parallel) and health checks configurations. The `scheduler` and `deploymentwatcher` package collaborate to achieve deployment logic: The scheduler only places the canaries and `max_parallel` allocations for a new deployment; the `deploymentwatcher` monitors for alloc progress and then enqueues a new evaluation whenever the scheduler should reprocess a job and places the next `max_parallel` round of allocations. The `deploymentwatcher` package makes blocking queries against the state store, to fetch all deployments and the relevant allocs for each running deployments. If `deploymentwatcher` fails or is hindered from fetching the state, the deployments fail to make progress. `Deploymentwatcher` logic only runs on the leader. ## Why unnecessary deployment watchers can halt cluster progress Previously, `getAllocsCh` is called on every for loop iteration in `deploymentWatcher.watch()` function. However, the for-loop may iterate many times before the allocs get updated. In fact, whenever a new deployment is created/updated/deleted, all `deploymentWatcher`s get notified through `w.deploymentUpdateCh`. The `getAllocsCh` goroutines and blocking queries spike significantly and grow quadratically with respect to the number of running deployments. The growth leads to two adverse outcomes: 1. it spikes the CPU/Memory usage resulting potentially leading to OOM or very slow processing 2. it activates the [query rate limiter](`abaa9c5c5b/nomad/deploymentwatcher/deployment_watcher.go (L896-L898)`), so later the watcher fails to get updates and consequently fails to make progress towards placing new allocations for the deployment! So the cluster fails to catch up and fails to make progress in almost all deployments. The cluster recovers after a leader transition: the deposed leader stops all watchers and free up goroutines and blocking queries; the new leader recreates the watchers without the quadratic growth and remaining under the rate limiter. Well, until a spike of deployments are created triggering the condition again. ### Relevant Code References Path for deployment monitoring: * [`Watcher.watchDeployments`](`abaa9c5c5b/nomad/deploymentwatcher/deployments_watcher.go (L164-L192)`) loops waiting for deployment updates. * On every deployment update, [`w.getDeploys`](`abaa9c5c5b/nomad/deploymentwatcher/deployments_watcher.go (L194-L229)`) returns all deployments in the system * `watchDeployments` calls `w.add(d)` on every active deployment * which in turns, [updates existing watcher if one is found](`abaa9c5c5b/nomad/deploymentwatcher/deployments_watcher.go (L251-L255)`). * The deployment watcher [updates local local deployment field and trigger `deploymentUpdateCh` channel]( `abaa9c5c5b/nomad/deploymentwatcher/deployment_watcher.go (L136-L147)`) * The [deployment watcher `deploymentUpdateCh` selector is activated](`abaa9c5c5b/nomad/deploymentwatcher/deployment_watcher.go (L455-L489)`). Most of the time the selector clause is a no-op, because the flow was triggered due to another deployment update * The `watch` for-loop iterates again and in the previous code we create yet another goroutine and blocking call that risks being rate limited. Co-authored-by: Tim Gross <tgross@hashicorp.com>	2021-06-14 16:01:01 -04:00
Tim Gross	b764f52ab9	deploymentwatcher: reset progress deadline on promotion (#10042 ) In a deployment with two groups (ex. A and B), if group A's canary becomes healthy before group B's, the deadline for the overall deployment will be set to that of group A. When the deployment is promoted, if group A is done it will not contribute to the next deadline cutoff. Group B's old deadline will be used instead, which will be in the past and immediately trigger a deployment progress failure. Reset the progress deadline when the job is promotion to avoid this bug, and to better conform with implicit user expectations around how the progress deadline should interact with promotions.	2021-02-22 16:44:03 -05:00
Kris Hicks	0a3a748053	Add gosimple linter (#9590 )	2020-12-09 11:05:18 -08:00
Michael Schurter	8ccbd92cb6	api: add field filters to /v1/{allocations,nodes} Fixes #9017 The ?resources=true query parameter includes resources in the object stub listings. Specifically: - For `/v1/nodes?resources=true` both the `NodeResources` and `ReservedResources` field are included. - For `/v1/allocations?resources=true` the `AllocatedResources` field is included. The ?task_states=false query parameter removes TaskStates from /v1/allocations responses. (By default TaskStates are included.)	2020-10-14 10:35:22 -07:00
Tim Gross	d3341a2019	refactor: make it clear where we're accessing dstate The field name `Deployment.TaskGroups` contains a map of `DeploymentState`, which makes it a little harder to follow state updates when combined with inconsistent naming conventions, particularly when we also have the state store or actual `TaskGroup`s in scope. This changeset changes all uses to `dstate` so as not to be confused with actual TaskGroups.	2020-07-20 11:25:53 -04:00
Tim Gross	fd50b12ee2	multiregion: integrate with deploymentwatcher * `nextRegion` should take status parameter * thread Deployment/Job RPCs thru `nextRegion` * add `nextRegion` calls to `deploymentwatcher` * use a better description for paused for peer	2020-06-17 11:06:00 -04:00
Tim Gross	48e9f75c1e	multiregion: deploymentwatcher hooks This changeset establishes hooks in deploymentwatcher for multiregion deployments (for the enterprise version of Nomad).	2020-06-17 11:05:18 -04:00
Jasmine Dahilig	8d980edd2e	add create and modify timestamps to evaluations (#5881 )	2019-08-07 09:50:35 -07:00
Lang Martin	0f6f543a5f	deployment_watcher auto promote iff every task group is auto promotable	2019-05-22 12:34:57 -04:00
Lang Martin	0c668ecc7a	log error on autoPromoteDeployment failure	2019-05-22 12:32:08 -04:00
Lang Martin	b5fd735960	add update AutoPromote bool	2019-05-22 12:32:08 -04:00
Lang Martin	0bebf5d7f8	deployment_watcher when it's ok to autopromote, do so	2019-05-22 12:32:08 -04:00
Alex Dadgar	be54e56570	review fixes	2018-11-08 09:48:36 -08:00
Alex Dadgar	1c31970464	Fix multiple tgs with progress deadline handling Fix an issue in which the deployment watcher would fail the deployment based on the earliest progress deadline of the deployment regardless of if the task group has finished. Further fix an issue where the blocked eval optimization would make it so no evals were created to progress the deployment. To reproduce this issue, prior to this commit, you can create a job with two task groups. The first group has count 1 and resources such that it can not be placed. The second group has count 3, max_parallel=1, and can be placed. Run this first and then update the second group to do a deployment. It will place the first of three, but never progress since there exists a blocked eval. However, that doesn't capture the fact that there are two groups being deployed.	2018-11-05 16:06:17 -08:00
Alex Dadgar	de442226ae	Fix other instances of blocking queries	2018-09-24 13:52:39 -07:00
Alex Dadgar	7f0d241ef4	always handle failed allocation	2018-09-21 15:13:54 -07:00
Alex Dadgar	b2449ae1ce	Fix deployment watcher index usage Fixes three issues: 1. Retrieving the latest evaluation index was not properly selecting the greatest index. This would undermine checks we had to reduce the number of evaluations created when the latest eval index was greater than any alloc change 2. Fix an issue where the blocking query code was using the incorrect index such that the index was higher than necassary. 3. Special case handling of blocked evaluation since the create/snapshot index is no particularly useful since they can be reblocked.	2018-09-21 13:59:11 -07:00
Alex Dadgar	3c19d01d7a	server	2018-09-15 16:23:13 -07:00
Alex Dadgar	c6576ddac1	Fix make check errors	2018-09-04 16:03:52 -07:00
Preetha Appan	4e75456beb	Fix deadlock in deadline timer logic when progress deadline is passed and the deployment is updated.	2018-05-07 14:55:01 -05:00
Preetha Appan	4c377b112e	Fix panic in deployment watcher when deployment is not in the state store due to a gc	2018-05-07 14:55:01 -05:00
Alex Dadgar	768fec8505	Allow healthy canary deployment to skip progress deadline	2018-05-07 14:55:01 -05:00
Preetha Appan	b2b773e696	better comments and remove commented code	2018-05-07 14:50:01 -05:00
Preetha Appan	90a2311cef	Fix deadlock in deployment watcher when deployment starts with no allocations and eventually has failed allocations	2018-05-07 14:50:01 -05:00
Alex Dadgar	8d50955054	Fix typos	2018-05-07 14:50:01 -05:00
Alex Dadgar	8a81038cdb	Set Reschedule from deployment watcher	2018-05-07 14:50:01 -05:00
Alex Dadgar	a510774451	Use UpdateAllocDesiredTransistion instead of UpsertEval but no transistions yet	2018-05-07 14:50:01 -05:00
Alex Dadgar	fcf4f582d0	small review feedback fixes	2018-05-07 14:50:01 -05:00
Alex Dadgar	9bff9024b3	add latest eval back	2018-05-07 14:50:01 -05:00
Alex Dadgar	99e00fb774	Pass through timestamp	2018-05-07 14:50:01 -05:00
Alex Dadgar	c49b5f9949	Handle progressed deployments and tests	2018-05-07 14:50:01 -05:00
Alex Dadgar	9e75ea0a11	Deployment watcher based on deployment having progress deadline	2018-05-07 14:50:01 -05:00
Alex Dadgar	1336002255	Progress deadline in deployment state	2018-05-07 14:50:01 -05:00
Alex Dadgar	55b483709f	Fix tests	2018-05-07 14:50:01 -05:00
Alex Dadgar	ee50789c22	Initial implementation	2018-05-07 14:50:01 -05:00
Alex Dadgar	4844317cc2	Merge pull request #3890 from hashicorp/b-heartbeat Heartbeat improvements and handling failures during establishing leadership	2018-03-12 14:41:59 -07:00
Josh Soref	173ce63fe9	spelling: transition	2018-03-11 19:06:05 +00:00
Alex Dadgar	040599dae9	Fix leaking time.After function	2018-02-20 12:47:43 -08:00
Alex Dadgar	601177c250	Add escape hatches when non-leader	2018-02-20 10:22:15 -08:00
Preetha Appan	6468883cd1	Adds comment to handleRollbackValidity method and other small test readability fixes.	2017-11-03 17:05:15 -05:00
Preetha Appan	317fbf04b1	Adds SpecChanged check to alloc health and fail deployment end points, and other code review comments.	2017-11-03 15:33:34 -05:00
Preetha Appan	97474a1521	Clarify comment about infinite revert cycles	2017-11-03 14:25:14 -05:00
Preetha Appan	5505391663	Fixes auto revert to check if the job's spec has changed before reverting. This prevents infinite reverting when reverting to a job version that was previously stable, but not so after attempting a revert.	2017-11-02 19:53:27 -05:00
Michael Schurter	a66c53d45a	Remove `structs` import from `api` Goes a step further and removes structs import from api's tests as well by moving GenerateUUID to its own package.	2017-09-29 10:36:08 -07:00
Alex Dadgar	84d06f6abe	Sync namespace changes	2017-09-07 17:04:21 -07:00
Alex Dadgar	590ff91bf3	Deployment watcher takes state store	2017-08-30 18:51:59 -07:00
Alex Dadgar	f64b05a001	Deployment desc when no stable job and autorevert This PR adds a specialized description when the job has autorevert set and there is no job to revert to.	2017-08-12 15:50:51 -07:00
Luke Farnell	f0ced87b95	fixed all spelling mistakes for goreport	2017-08-07 17:13:05 -04:00
Alex Dadgar	30efd5a27a	Skip error log on shutdown This PR fixes the detection of a shutdown scenario and squelches the error log.	2017-07-19 11:15:53 -07:00

1 2

63 commits