open-nomad

Commit Graph

Author	SHA1	Message	Date
Mahmood Ali	37e0598344	api: alloc exec recovers from bad client connection If alloc exec fails to connect to the nomad client associated with the alloc, fail over to using a server. The code attempted to special case `net.Error` for failover to rule out other permanent non-networking errors, by reusing a pattern in the logging handling. But this pattern does not apply here. `net/http.Http` wraps all errors as `*url.Error` that is net.Error. The websocket doesn't, and instead returns the raw error. If the raw error isn't a `net.Error`, like in the case of TLS handshake errors, the api package would fail immediately rather than failover.	2020-03-04 17:43:00 -05:00
Mahmood Ali	b77fd8654b	cli: recover from client ACL lookup failures This fixes a bug in the CLI handling of node lookup failures when querying allocation and FS endpoints. Allocation and FS endpoint are handled by the client; one can query the relevant client directly, or query a server to have it forwarded transparently to relevant client. Querying the client directly is benefecial to avoid loading servers with IO. As an optimization, the CLI attempts to query the client directly, but then falls back to using server forwarding path if it encounters network or connection errors (e.g. clients are locked down or in a separate inaccessible network). Here, we fix a bug where if the CLI fails to find to lookup the client details because it lacks ACL capability or other unexpected reasons, the CLI will not go through fallback path.	2019-10-04 11:23:59 -04:00
Michael Schurter	d220e630c0	api: add missing Networks field to alloc resources	2019-07-31 01:04:06 -04:00
Chris Baker	83ee50d5ab	api: removed unused AllocID from AllocSignalRequest	2019-06-21 21:44:38 +00:00
Mahmood Ali	09931bcdce	add api support for nomad exec Adds nomad exec support in our API, by hitting the websocket endpoint. We introduce API structs that correspond to the drivers streaming exec structs. For creating the websocket connection, we reuse the transport setting from api http client.	2019-05-09 16:49:08 -04:00
Mahmood Ali	f920efb962	divest /api from nomad/structs The API package needs to be independent from rest of nomad packages, to avoid leaking internal packages and dependencies (e.g. raft, ugorji, etc)	2019-04-28 13:32:26 -04:00
Danielle Lancashire	3409e0be89	allocs: Add nomad alloc signal command This command will be used to send a signal to either a single task within an allocation, or all of the tasks if <task-name> is omitted. If the sent signal terminates the allocation, it will be treated as if the allocation has crashed, rather than as if it was operator-terminated. Signal validation is currently handled by the driver itself and nomad does not attempt to restrict or validate them.	2019-04-25 12:43:32 +02:00
Danielle	198a838b61	Merge pull request #5512 from hashicorp/dani/f-alloc-stop alloc-lifecycle: nomad alloc stop	2019-04-23 13:05:08 +02:00
Danielle Lancashire	832f607433	allocs: Add nomad alloc stop This adds a `nomad alloc stop` command that can be used to stop and force migrate an allocation to a different node. This is built on top of the AllocUpdateDesiredTransitionRequest and explicitly limits the scope of access to that transition to expose it under the alloc-lifecycle ACL. The API returns the follow up eval that can be used as part of monitoring in the CLI or parsed and used in an external tool.	2019-04-23 12:50:23 +02:00
Preetha Appan	22109d1e20	Add preemption related fields to AllocationListStub	2019-04-18 10:36:44 -05:00
Danielle Lancashire	e135876493	allocs: Add nomad alloc restart This adds a `nomad alloc restart` command and api that allows a job operator with the alloc-lifecycle acl to perform an in-place restart of a Nomad allocation, or a given subtask.	2019-04-11 14:25:49 +02:00
James Rasell	9470507cf4	Add NodeName to the alloc/job status outputs. Currently when operators need to log onto a machine where an alloc is running they will need to perform both an alloc/job status call and then a call to discover the node name from the node list. This updates both the job status and alloc status output to include the node name within the information to make operator use easier. Closes #2359 Cloess #1180	2019-04-10 10:34:10 -05:00
Mahmood Ali	7bdd43f3e0	api: avoid codegen for syncing Given that the values will rarely change, specially considering that any changes would be backward incompatible change. As such, it's simpler to keep syncing manually in the rare occasion and avoid the syncing code overhead.	2019-01-18 18:52:31 -05:00
Preetha Appan	5f0a9d2cfd	Show preemption output in plan CLI	2018-11-08 09:48:43 -06:00
Preetha Appan	5b3bfb63eb	structs and API changes to plan and alloc structs needed for preemption	2018-10-30 11:06:32 -05:00
Alex Dadgar	a78cefec18	use int64	2018-10-16 15:34:32 -07:00
Preetha Appan	7c0d8c646c	Change CPU/Disk/MemoryMB to int everywhere in new resource structs	2018-10-16 16:21:42 -05:00
Alex Dadgar	bac5cb1e8b	Scheduler uses allocated resources	2018-10-02 17:08:25 -07:00
Preetha Appan	751c0eb5a5	code review feedback	2018-09-04 16:10:11 -05:00
Preetha Appan	9bc0962527	Track top k nodes by norm score rather than top k nodes per scorer	2018-09-04 16:10:11 -05:00
Preetha Appan	6ed527c636	Use heap to store top K scoring nodes. Scoring metadata is now aggregated by scorer type to make it easier to parse when reading it in the CLI.	2018-09-04 16:10:11 -05:00
Alex Dadgar	f95ab4ade8	Mark canaries on creation, and unmark on promotion	2018-05-07 14:50:01 -05:00
Alex Dadgar	8a81038cdb	Set Reschedule from deployment watcher	2018-05-07 14:50:01 -05:00
Preetha Appan	274bed1892	Add RescheduleTracker to allocs list stub struct	2018-05-01 14:53:47 -05:00
Michael Schurter	2832853bfa	Add DesiredTransition.ShouldMigrate to api pkg	2018-03-21 16:51:45 -07:00
Michael Schurter	d1ec65d765	switch to new raft DesiredTransition message	2018-03-21 16:49:48 -07:00
Alex Dadgar	db4a634072	RPC, FSM, State Store for marking DesiredTransistion fix build tag	2018-03-21 16:49:48 -07:00
Preetha Appan	342c3fb961	Added FollowupEvalID field and helper methods to calculate reschedule eligibility based on delay	2018-03-14 16:10:32 -05:00
Alex Dadgar	aa98f8ba7b	Enhance API pkg to utilize Server's Client Tunnel This PR enhances the API package by having client only RPCs route through the server when they are low cost and for filesystem access to first attempt a direct connection to the node and then falling back to a server routed request.	2018-02-15 13:59:03 -08:00
Preetha Appan	9d15e0c05b	Code review feedback	2018-01-31 09:58:05 -06:00
Preetha Appan	5714a6b8bf	Add method on API alloc to calculate attempted and remaining reschedule events	2018-01-31 09:58:05 -06:00
Preetha Appan	e09ea8c0b0	Address code review comments	2018-01-31 09:58:05 -06:00
Preetha Appan	0c56a12a77	Add RescheduleTracker to allocations API struct	2018-01-31 09:56:53 -06:00
Preetha Appan	fd2fbefa4c	Add a field to track the next allocation during a replacement	2018-01-24 17:55:05 -06:00
Preetha Appan	39d70be009	Add ModifyTime to Allocation and update it both on plan applies and client initiated updates	2017-11-01 15:13:48 -05:00
Alex Dadgar	c1cc51dbee	sync	2017-10-13 14:36:02 -07:00
Alex Dadgar	84d06f6abe	Sync namespace changes	2017-09-07 17:04:21 -07:00
Michael Schurter	b145e04d5d	Refactor GetNodeClient weirdness - No need to for a pointer to a pointer - Properly set and use QueryOptions.Region	2017-08-28 14:41:21 -07:00
Michael Schurter	7363b50666	Fix TLS support in api pkg / cli Fixes #3013 It's a little weird that Client now has a method for returning a NewClient, but it's a convenient way to dedupe the logic to connect-directly-to-a-node which is nontrivial and had sutble differences between locations.	2017-08-28 11:46:28 -07:00
Alex Dadgar	40b04a5ea9	alloc-list shows version	2017-07-07 12:12:48 -07:00
Alex Dadgar	454083ba1b	Remove canary	2017-07-07 12:10:04 -07:00
Alex Dadgar	be34d9487d	Add deployment id to alloc	2017-07-07 12:07:08 -07:00
Alex Dadgar	d04877d23c	initial impl	2017-07-07 12:03:11 -07:00
Diptanu Choudhury	bb664835c2	Added the API for GC of allocations and nodes	2017-01-12 16:18:29 -08:00
Alex Dadgar	2c838a80f6	Detect newly created allocation's properly	2017-01-08 13:55:03 -08:00
Alex Dadgar	fde7a24865	Consul-template fixes + PreviousAlloc in api	2016-10-28 15:50:35 -07:00
Diptanu Choudhury	067fcda3fe	Making the cli use TLS if the client has enabled TLS	2016-10-26 11:13:53 -07:00
Alex Dadgar	b70baffcc0	address feedback	2016-10-25 11:31:09 -07:00
Alex Dadgar	7368b468d5	Don't query for node-status if the node is down and handle the errors	2016-10-20 18:05:58 -07:00
Alex Dadgar	e952540f6f	Allocation resources returned in a struct	2016-06-11 21:04:10 -07:00

1 2

68 Commits