open-nomad

Author	SHA1	Message	Date
Michael Lange	6e3e8d01e2	Stub out all the markup for the new drain strategy view	2020-01-23 16:34:22 -08:00
Michael Lange	47924d05d3	Polish for the drain popover	2020-01-23 16:34:21 -08:00
Michael Lange	deb1dc7ffe	Refactor parseDuration to support multi-character units	2020-01-23 16:34:20 -08:00
Michael Lange	c5e662a751	Test coverage for the parse duration util	2020-01-23 16:34:20 -08:00
Michael Lange	395ba979a7	Extract the duration parsing into a util	2020-01-23 16:34:19 -08:00
Michael Lange	fd16612be3	Factor out the drain popover and implement its behaviors	2020-01-23 16:34:18 -08:00
Michael Lange	d13af1bcb5	Add triggerClass support to the popover menu	2020-01-23 16:34:18 -08:00
Michael Lange	ada745a59f	Let dropdowns assume their full width	2020-01-23 16:34:17 -08:00
Michael Lange	a3f32e61c1	Initial layout of the drain options popover	2020-01-23 16:34:16 -08:00
Michael Lange	d1cf0e1054	More form styles as needed for the drain form	2020-01-23 16:34:16 -08:00
Michael Lange	01d0efb317	Multiline modifier for tooltips	2020-01-23 16:34:15 -08:00
Michael Lange	75c9e80251	Update the dropdown styles to be more similar to button styles	2020-01-23 16:34:14 -08:00
Michael Lange	20f83da76a	PopoverMenu component	2020-01-23 16:34:14 -08:00
Michael Lange	e685179c99	Eligibility toggling behavior	2020-01-23 16:34:13 -08:00
Michael Lange	bfa34d5273	Update the underlying node model when toggling eligibility in mirage	2020-01-23 16:34:12 -08:00
Michael Lange	a8af3d67be	Add tooltip to the eligibility control	2020-01-23 16:34:11 -08:00
Michael Lange	47daa980a5	Reformat the client detail page to use the two-row header design	2020-01-23 16:34:11 -08:00
Michael Lange	c8e29c7060	cancelDrain adapter method	2020-01-23 16:34:10 -08:00
Michael Lange	b162fa8a28	Update drain methods to properly wrap DrainSpec params	2020-01-23 16:34:09 -08:00
Michael Lange	dae33efa1d	drain and forceDrain adapter methods	2020-01-23 16:34:09 -08:00
Michael Lange	85066eee07	Mock the drain endpoint	2020-01-23 16:34:08 -08:00
Michael Lange	321b5fb6e6	Add isMigrating property to the allocation model	2020-01-23 16:34:07 -08:00
Michael Lange	aad8536949	Implement eligibility toggling in the data layer	2020-01-23 16:34:07 -08:00
Michael Lange	94955c8b08	Mock the eligibility endpoint in mirage	2020-01-23 16:34:06 -08:00
Michael Schurter	17d402b680	Merge pull request #6953 from TimHiggison/patch-2 Update configuring-tasks.html.md	2020-01-23 12:15:54 -08:00
Michael Schurter	ae0baf389b	Merge pull request #6952 from TimHiggison/patch-1 Update ecs.html.md	2020-01-23 12:15:07 -08:00
Charlie Voiselle	b0d7b4e1d7	Merge pull request #6780 from hashicorp/km.intro-video website: add ‘intro to nomad’ video to /intro	2020-01-23 11:32:13 -05:00
Tim Gross	457e3ad5c6	e2e: document e2e provisioning process (#6976 )	2020-01-22 16:55:17 -05:00
Mahmood Ali	e436d2701a	Handle Nomad leadership flapping Fixes a deadlock in leadership handling if leadership flapped. Raft propagates leadership transition to Nomad through a NotifyCh channel. Raft blocks when writing to this channel, so channel must be buffered or aggressively consumed[1]. Otherwise, Raft blocks indefinitely in `raft.runLeader` until the channel is consumed[1] and does not move on to executing follower related logic (in `raft.runFollower`). While Raft `runLeader` defer function blocks, raft cannot process any other raft operations. For example, `run{Leader\|Follower}` methods consume `raft.applyCh`, and while runLeader defer is blocked, all raft log applications or config lookup will block indefinitely. Sadly, `leaderLoop` and `establishLeader` makes few Raft calls! `establishLeader` attempts to auto-create autopilot/scheduler config [3]; and `leaderLoop` attempts to check raft configuration [4]. All of these calls occur without a timeout. Thus, if leadership flapped quickly while `leaderLoop/establishLeadership` is invoked and hit any of these Raft calls, Raft handler _deadlock_ forever. Depending on how many times it flapped and where exactly we get stuck, I suspect it's possible to get in the following case: * Agent metrics/stats http and RPC calls hang as they check raft.Configurations * raft.State remains in Leader state, and server attempts to handle RPC calls (e.g. node/alloc updates) and these hang as well As we create goroutines per RPC call, the number of goroutines grow over time and may trigger a out of memory errors in addition to missed updates. [1] `d90d6d6bda/config.go (L190-L193)` [2] `d90d6d6bda/raft.go (L425-L436)` [3] `2a89e47746/nomad/leader.go (L198-L202)` [4] `2a89e47746/nomad/leader.go (L877)`	2020-01-22 13:08:34 -05:00
Drew Bailey	936b96af5e	Merge pull request #6968 from hashicorp/b-system-sched-plan-ineligible Return FailedTGAlloc metric instead of no node err	2020-01-22 11:53:07 -05:00
Mahmood Ali	129c884105	extract leader step function	2020-01-22 10:55:48 -05:00
Drew Bailey	94ff586256	update changelog	2020-01-22 10:10:15 -05:00
Drew Bailey	63ddda71e1	Return FailedTGAlloc metric instead of no node err If an existing system allocation is running and the node its running on is marked as ineligible, subsequent plan/applys return an RPC error instead of a more helpful plan result. This change logs the error, and appends a failedTGAlloc for the placement.	2020-01-22 10:07:15 -05:00
Tim Gross	29e1ed6b05	e2e: ensure group script check tests interpolation (#6972 ) Fixes a bug introduced in 0aa58b9 where we're writing a test file to a taskdir-interpolated location, which works when we `alloc exec` but not in the jobspec for a group script check. This changeset also makes the test safe to run multiple times by namespacing the file with the alloc ID, which has the added bonus of exercising our alloc interpolation code for group script checks.	2020-01-22 09:54:54 -05:00
Tim Gross	2edbdfc8be	e2e: update framework to allow deploying Nomad (#6969 ) The e2e framework instantiates clients for Nomad/Consul but the provisioning of the actual Nomad cluster is left to Terraform. The Terraform provisioning process uses `remote-exec` to deploy specific versions of Nomad so that we don't have to bake an AMI every time we want to test a new version. But Terraform treats the resulting instances as immutable, so we can't use the same tooling to update the version of Nomad in-place. This is a prerequisite for upgrade testing. This changeset extends the e2e framework to provide the option of deploying Nomad (and, in the future, Consul/Vault) with specific versions to running infrastructure. This initial implementation is focused on deploying to a single cluster via `ssh` (because that's our current need), but provides interfaces to hook the test run at the start of the run, the start of each suite, or the start of a given test case. Terraform work includes: * provides Terraform output that written to JSON used by the framework to configure provisioning via `terraform output provisioning`. * provides Terraform output that can be used by test operators to configure their shell via `$(terraform output environment)` * drops `remote-exec` provisioning steps from Terraform * makes changes to the deployment scripts to ensure they can be run multiple times w/ different versions against the same host.	2020-01-22 08:48:52 -05:00
Buck Doyle	576bcf554f	UI: Migrate to Storybook (#6507 ) I originally planned to add component documentation, but as this dragged on and I found that JSDoc-to-Markdown sometimes needed hand-tuning, I decided to skip it and focus on replicating what was already present in Freestyle. Adding documentation is a finite task that can be revisited in the future. My goal was to migrate everything from Freestyle with as few changes as possible. Some adaptations that I found necessary: • the DelayedArray and DelayedTruth utilities that delay component rendering until slightly after initial render because without them: ◦ charts were rendering with zero width ◦ the JSON viewer was rendering with empty content • Storybook in Ember renders components in a routerless/controllerless context by default, so some component stories needed changes: ◦ table pagination/sorting stories access to query params, which necessitates some reaching into Ember internals to start routing and dynamically generate a Storybook route/controller to render components into ◦ some stories have a faux controller as part of their Storybook context that hosts setInterval-linked dynamic computed properties • some jiggery-pokery with anchor tags ◦ inert href='#' had to become href='javascript:; ◦ links that are actually meant to navigate need target='_parent' so they don’t navigate inside the Storybook iframe Maybe some of these could be addressed by fixes in ember-cli-storybook but I’m wary of digging around in there any more than I already have, as I’ve lost a lot of time to Storybook confusion and frustrations already 😞 The STORYBOOK=true environment variable tweaks some environment settings to get things working as expected in the Storybook context. I chose to: • use angle bracket invocation within stories rather than have to migrate them soon after having moved to Storybook • keep Freestyle around for now for its palette and typeface components	2020-01-21 15:46:32 -06:00
Tim Gross	d6aac915a7	e2e: use valid jobspec for group check test (#6967 ) Group service checks cannot interpolate task fields, because the task fields are not available at the time the script check hook is created for the group service. When f31482a was merged this e2e test began failing because we are now correctly matching the script check ID to the service ID, which revealed this jobspec was invalid.	2020-01-21 15:54:46 -05:00
Buck Doyle	946709d47a	Update changelog	2020-01-21 09:10:54 -06:00
Buck Doyle	66ab14144a	ui: Change Run Job availability based on ACLs (#5944 ) This builds on API changes in #6017 and #6021 to conditionally turn off the “Run Job” button based on the current token’s capabilities, or the capabilities of the anonymous policy if no token is present. If you try to visit the job-run route directly, it redirects to the job list.	2020-01-20 14:57:01 -06:00
Tim Higgison	7a9a33f7a7	Update configuring-tasks.html.md	2020-01-17 11:41:47 +10:00
Tim Higgison	6cdd659f5d	Update ecs.html.md	2020-01-17 08:51:09 +10:00
Mahmood Ali	6294fddb9c	Update golang to 1.12.15	2020-01-16 14:49:44 -05:00
Mahmood Ali	ee76fa85ee	add a script to update golang version	2020-01-16 14:49:43 -05:00
Mahmood Ali	72a54a88b7	Avoid unnecessary golang version reference	2020-01-16 14:01:35 -05:00
Tim Gross	1e600d573d	e2e: improve reusability of provisioning scripts (#6942 ) This changeset is part of the work to improve our E2E provisioning process to allow our upgrade tests: * Move more of the setup into the AMI image creation so it's a little more obvious to provisioning config authors which bits are essential to deploying a specific version of Nomad. * Make the service file update do a systemd daemon-reload so that we can update an already-running cluster with the same script we use to deploy it initially.	2020-01-16 09:29:36 -05:00
Mahmood Ali	f36cc54efd	actually always canonicalize alloc.Job alloc.Job may be stale as well and need to migrate it. It does cost extra cycles but should be negligible.	2020-01-15 09:02:48 -05:00
Mahmood Ali	b1b714691c	address review comments	2020-01-15 08:57:05 -05:00
Michael Schurter	ffbfb60f40	test: restore e2e-test target and use -integration	2020-01-14 13:47:51 -08:00
Michael Schurter	da4645e9a4	test: download Vault binaries for e2e test Modernize Vault integration/e2e test a bit: - Download from releases.hashicorp.com instead of using a hardcoded list - Remove old unused make target e2e-test - Use NOMAD_E2E env var instead of -integration flag - Add a README On my machine with ~250 Mbps internet it takes ~400s to download all Vault binaries.	2020-01-14 11:02:02 -08:00
Charlie Voiselle	a4aeea0058	Upgrade -> Update	2020-01-13 17:36:49 -05:00

1 2 3 4 5 ...

16927 commits