open-nomad

Commit Graph

Author	SHA1	Message	Date
Tim Gross	12d5eab2d1	docs: split out unsupported versions in changelog (#17704 ) Our changelog has become large enough that GitHub's rendering is very slow, resulting in error pages ("angry unicorns"). Split out the older unsupported versions of Nomad into their own file so that we only need to render the most recent versions, while keeping the older versions relatively searchable by having them in a single file.	2023-06-23 15:17:57 -04:00
grembo	7936c1e33f	Add `disable_file` parameter to job's `vault` stanza (#13343 ) This complements the `env` parameter, so that the operator can author tasks that don't share their Vault token with the workload when using `image` filesystem isolation. As a result, more powerful tokens can be used in a job definition, allowing it to use template stanzas to issue all kinds of secrets (database secrets, Vault tokens with very specific policies, etc.), without sharing that issuing power with the task itself. This is accomplished by creating a directory called `private` within the task's working directory, which shares many properties of the `secrets` directory (tmpfs where possible, not accessible by `nomad alloc fs` or Nomad's web UI), but isn't mounted into/bound to the container. If the `disable_file` parameter is set to `false` (its default), the Vault token is also written to the NOMAD_SECRETS_DIR, so the default behavior is backwards compatible. Even if the operator never changes the default, they will still benefit from the improved behavior of Nomad never reading the token back in from that - potentially altered - location.	2023-06-23 15:15:04 -04:00
Michael Lange	faa3377a56	Merge pull request #17691 from hashicorp/f/missing-chart-stories [UI] Missing chart stories	2023-06-23 08:17:34 -07:00
James Rasell	b9440965db	client: remove unused nsd check allocation result diff func (#17695 )	2023-06-23 15:26:06 +01:00
Seth Hoenig	2c7877658c	e2e: create a v3/ set of packages for creating Nomad e2e tests (#17620 ) * e2e: create a v3/ set of packages for creating Nomad e2e tests This PR creates an experimental set of packages under `e2e/v3/` for crafting Nomad e2e tests. Unlike previous generations, this is an attempt at providing a way to create tests in a declarative (ish) pattern, with a focus on being easy to use, easy to cleanup, and easy to debug. @shoenig is just trying this out to see how it goes. Lots of features need to be implemented. Many more docs need to be written. Breaking changes are to be expected. There are known and unknown bugs. No warranty. Quick run of `example` with verbose logging. ```shell ➜ NOMAD_E2E_VERBOSE=1 go test -v === RUN TestExample === RUN TestExample/testSleep util3.go:25: register (service) job: "sleep-809" util3.go:25: checking eval: 9f0ae04d-7259-9333-3763-44d0592d03a1, status: pending util3.go:25: checking eval: 9f0ae04d-7259-9333-3763-44d0592d03a1, status: complete util3.go:25: checking deployment: a85ad2f8-269c-6620-d390-8eac7a9c397d, status: running util3.go:25: checking deployment: a85ad2f8-269c-6620-d390-8eac7a9c397d, status: running util3.go:25: checking deployment: a85ad2f8-269c-6620-d390-8eac7a9c397d, status: running util3.go:25: checking deployment: a85ad2f8-269c-6620-d390-8eac7a9c397d, status: running util3.go:25: checking deployment: a85ad2f8-269c-6620-d390-8eac7a9c397d, status: successful util3.go:25: deployment a85ad2f8-269c-6620-d390-8eac7a9c397d was a success util3.go:25: deregister job "sleep-809" util3.go:25: system gc === RUN TestExample/testNamespace util3.go:25: apply namespace "example-291" util3.go:25: register (service) job: "sleep-967" util3.go:25: checking eval: a2a2303a-adf1-2621-042e-a9654292e569, status: pending util3.go:25: checking eval: a2a2303a-adf1-2621-042e-a9654292e569, status: complete util3.go:25: checking deployment: 3395e9a8-3ffc-8990-d5b8-cc0ce311f302, status: running util3.go:25: checking deployment: 3395e9a8-3ffc-8990-d5b8-cc0ce311f302, status: running util3.go:25: checking deployment: 3395e9a8-3ffc-8990-d5b8-cc0ce311f302, status: running util3.go:25: checking deployment: 3395e9a8-3ffc-8990-d5b8-cc0ce311f302, status: successful util3.go:25: deployment 3395e9a8-3ffc-8990-d5b8-cc0ce311f302 was a success util3.go:25: deregister job "sleep-967" util3.go:25: system gc util3.go:25: cleanup namespace "example-291" === RUN TestExample/testEnv util3.go:25: register (batch) job: "env-582" util3.go:25: checking eval: 600f3bce-ea17-6d13-9d20-9d9eb2a784f7, status: pending util3.go:25: checking eval: 600f3bce-ea17-6d13-9d20-9d9eb2a784f7, status: complete util3.go:25: deregister job "env-582" util3.go:25: system gc --- PASS: TestExample (10.08s) --- PASS: TestExample/testSleep (5.02s) --- PASS: TestExample/testNamespace (4.02s) --- PASS: TestExample/testEnv (1.03s) PASS ok github.com/hashicorp/nomad/e2e/example 10.079s ``` * cluster3: use filter for kernel.name instead of filtering manually	2023-06-23 09:10:49 -05:00
James Rasell	78cdf0d0d8	server: remove unused endpoints struct. (#17665 )	2023-06-23 08:20:33 +01:00
Luiz Aoqui	f785da4748	ci: fix flaky UI test (#17676 )	2023-06-22 23:07:36 -04:00
Michael Lange	41f6f7e04f	TopoViz story that is sourced from Mirage Unfortunately due to the split build nature of the ember app and storybook it isn't possible to import mirage in the storybook context to control scenarios via a knob :(	2023-06-22 16:55:36 -07:00
Michael Lange	85371941c4	Full TopoViz story	2023-06-22 16:55:25 -07:00
Michael Lange	cb30ef1a0f	TopoViz child component stories	2023-06-22 15:03:32 -07:00
Michael Lange	859374ecad	Standard usage story	2023-06-22 13:53:21 -07:00
Michael Lange	de09e3f51a	Basic recommendation-chart story with knobs	2023-06-22 13:53:21 -07:00
Phil Renaud	7373261b58	[ui] Versions added to deploying status panel (#17629 ) * Versions added to deploying status panel * Wrap the running and healthy title in a span * Versions in the deployment UI next to titles * Version count and label styles updated	2023-06-22 16:19:41 -04:00
Tim Gross	12ca68ec26	release pipeline: fix ref arguments in invoking workflow (#17684 ) Although #17669 fixed the permissions of the release pipeline to push new commits, there was still an error when invoking the `build` workflow. The format of the reference was changed in #17103 such that we're sending the git ref (a SHA) and not the "--ref" argument required by the GH actions workflow API, which in this case is apparently specially defined as "The branch or tag name which contains the version of the workflow file you'd like to run" and not what git calls a "ref". This changeset: * Removes the third-party action entirely so that we're using GitHub's own tooling. This removes one more thing from the supply chain to pin and ensures a 1:1 mapping of args to what's documented by GitHub. * Removes the `--ref` argument entirely, which causes it to default to the current branch that the release workflow is running on (which is always what we want).	2023-06-22 15:33:19 -04:00
Phil Renaud	cf21a246c0	[ui, deployments] job status panel legend: counts of 0 don't get links (#17644 )	2023-06-22 14:40:11 -04:00
Luiz Aoqui	e66a7bbefe	core: remove unnecessary call to SetNodes and adds DC downgrade test (#17655 )	2023-06-22 13:26:14 -04:00
Jai	7103ce1957	ui: create node pool model (#17301 ) Co-authored-by: Phil Renaud <phil@riotindustries.com> Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2023-06-22 13:11:44 -04:00
Luiz Aoqui	8f05eaaa68	np: check for license on RPC endpoints (#17656 )	2023-06-22 12:52:20 -04:00
Luiz Aoqui	53dd8835b8	ci: set `continue-on-error: true` on `test-ui` (#17646 ) Since the matrix exercises different test cases, it's better to allow all partitions to completely run, even if one of them fails, so it's easier to catch multiple test failures.	2023-06-22 11:31:49 -04:00
Tim Gross	11216d09af	client: send node secret with every client-to-server RPC (#16799 ) In Nomad 1.5.3 we fixed a security bug that allowed bypass of ACL checks if the request came thru a client node first. But this fix broke (knowingly) the identification of many client-to-server RPCs. These will be now measured as if they were anonymous. The reason for this is that many client-to-server RPCs do not send the node secret and instead rely on the protection of mTLS. This changeset ensures that the node secret is being sent with every client-to-server RPC request. In a future version of Nomad we can add enforcement on the server side, but this was left out of this changeset to reduce risks to the safe upgrade path. Sending the node secret as an auth token introduces a new problem during initial introduction of a client. Clients send many RPCs concurrently with `Node.Register`, but until the node is registered the node secret is unknown to the server and will be rejected as invalid. This causes permission denied errors. To fix that, this changeset introduces a gate on having successfully made a `Node.Register` RPC before any other RPCs can be sent (except for `Status.Ping`, which we need earlier but which also ignores the error because that handler doesn't do an authorization check). This ensures that we only send requests with a node secret already known to the server. This also makes client startup a little easier to reason about because we know `Node.Register` must succeed first, and it should make for a good place to hook in future plans for secure introduction of nodes. The tradeoff is that an existing client that has running allocs will take slightly longer (a second or two) to transition to ready after a restart, because the transition in `Node.UpdateStatus` is gated at the server by first submitting `Node.UpdateAlloc` with client alloc updates.	2023-06-22 11:06:49 -04:00
Luiz Aoqui	ca3c004130	ci: fix some flaky UI tests (#17648 ) These tests would fail depending on the value of the seed used.	2023-06-22 10:51:07 -04:00
Tim Gross	70a359048e	release pipeline: release workflow needs write permissions (#17669 ) In #17103 we set read-only permissions on all the workflows. Unfortunately we missed that the `release` workflow makes git commits and pushes them to the repository, so it needs to have write permissions.	2023-06-22 10:40:45 -04:00
Luiz Aoqui	0549b880ef	ui: display mirage scenario in header label (#17649 ) This information is useful when switching between different scenarios for testing.	2023-06-22 10:38:17 -04:00
Seth Hoenig	5138c5b99e	client: do not disable memory swappiness if kernel does not support it (#17625 ) * client: do not disable memory swappiness if kernel does not support it This PR adds a workaround for very old Linux kernels which do not support the memory swappiness interface file. Normally we write a "0" to the file to explicitly disable swap. In the case the kernel does not support it, give libcontainer a nil value so it does not write anything. Fixes #17448 * client: detect swappiness by writing to the file * fixup changelog Co-authored-by: James Rasell <jrasell@users.noreply.github.com> --------- Co-authored-by: James Rasell <jrasell@users.noreply.github.com>	2023-06-22 09:36:31 -05:00
Luiz Aoqui	9f5c02d947	ui: add tooltips to the Topology labels (#17647 ) Add tooltips to labels in nodes and datacenters for the Topology view page to clarify what each value represents.	2023-06-22 10:33:42 -04:00
Luiz Aoqui	3d761e712b	ui: remove redundant columns from child job table (#17645 ) Namespace, job type, and priority are already available from the parent job header, so displaying them in the table caused it to be too crowded.	2023-06-22 10:22:41 -04:00
James Rasell	4e2d019639	variables: remove unused state store functions. (#17660 )	2023-06-22 13:54:58 +01:00
James Rasell	71fdd7e891	core: use faster concatenation for alloc name generation. (#17591 )	2023-06-22 07:46:28 +01:00
Luiz Aoqui	ac08fc751b	node pools: apply node pool scheduler configuration (#17598 )	2023-06-21 20:31:50 -04:00
Phil Renaud	16886bf6bf	Moves to the current LTS release of Node for our build and release workflows (#17639 )	2023-06-21 15:17:24 -04:00
Phil Renaud	94507cc7b7	[ui] General status for steady-state jobs (#17599 ) * Degraded vs Healthy etc. status * Standardize the look of a deploying status panel * badge styles * remove job.status from title component in favour of in-panel status * Remove a redundant check * re-attrd fail-deployment button considered	2023-06-21 11:57:28 -04:00
VishnuJin	67efb19e94	fingerprint: added windows os.build attribute to host fingerprint (#17576 )	2023-06-21 10:53:50 -04:00
Michael Lange	0f5fd19950	Merge pull request #17626 from hashicorp/f/ui-test-splitting [UI, CI] Test splitting	2023-06-20 20:21:37 -07:00
Michael Lange	753a7dfea3	Tag the GHA run for percy to use Percy uses this to stitch parallel test runs back together into a single report.	2023-06-20 15:38:05 -07:00
Michael Lange	644b67b5c5	Simplify workflows After renovating everything, it's evident that the ember-exam sub-workflow can be inlined without any pesky duplication.	2023-06-20 15:05:17 -07:00
Michael Lange	495ada2072	Pipe secrets through to exam job	2023-06-20 14:49:57 -07:00
Michael Lange	918f503e51	Rip out the xUnit test reporter This was used to integrate with Circle CI's deeper test reporting (failures, flakes, reporting). It's strictly vestigial now that we're on GHA.	2023-06-20 14:49:56 -07:00
Michael Lange	125d2f3e7f	Use a matrix strategy to run exam partitions This will run partitions and parallel only after linting passes.	2023-06-20 13:51:45 -07:00
Michael Lange	370327fd13	Move the ember exam workflow into its own reusable job This will be called N times by the parent test-ui script.	2023-06-20 13:51:45 -07:00
Michael Lange	0e0d524f84	New generic exam:parallel yarn script This is intended to be used like `yarn exam:parallel -- more --options` This way a split and partition can be provided by CI without CI also needing to deal with percy details.	2023-06-20 13:51:45 -07:00
Michael Lange	f684882220	Merge pull request #17624 from hashicorp/b/ui-audit-workflow [UI, CI] Bump the ember-test-audit workflow to node 18	2023-06-20 13:48:31 -07:00
Phil Renaud	d5a2671969	[ui] Keyboard shortcuts for Promote Canary and Fail Deployment (#17568 )	2023-06-20 15:43:32 -04:00
Phil Renaud	7bac272095	Specific health_unknown getter that only looks at running allocs (#17566 )	2023-06-20 15:03:46 -04:00
Michael Lange	53643f05a7	Bump the ember-test-audit workflow to node 18	2023-06-20 10:31:24 -07:00
Tim Gross	ff9ba8ff73	scheduler: tolerate having only one dynamic port available (#17619 ) If the dynamic port range for a node is set so that the min is equal to the max, there's only one port available and this passes config validation. But the scheduler panics when it tries to pick a random port. Only add the randomness when there's more than one to pick from. Adds a test for the behavior but also adjusts the commentary on a couple of the existing tests that made it seem like this case was already covered if you didn't look too closely. Fixes: #17585	2023-06-20 13:29:25 -04:00
Daniel Bennett	e58ba84a9e	e2e: fix windows client docker (#17572 ) the windows docker install script stopped working. after trying various things to fix the script, I opted instead for a base image that comes with docker already installed. error output during build was: Installing Docker. WARNING: Cannot find path 'C:\Users\Administrator\AppData\Local\Temp\DockerMsftProvider\DockerDefault_DockerSearchIndex.json' because it does not exist. WARNING: Cannot bind argument to parameter 'downloadURL' because it is an empty string. WARNING: The property 'AbsoluteUri' cannot be found on this object. Verify that the property exists. WARNING: The property 'RequestMessage' cannot be found on this object. Verify that the property exists. Failed to install Docker. Install-Package : No match was found for the specified search criteria and package name 'docker'.	2023-06-20 10:17:16 -05:00
Luiz Aoqui	2f5df1d8a4	test: add MultiregionMinJob mock (#17614 )	2023-06-20 10:57:02 -04:00
James Rasell	86e4c6cb9d	state: move variables tests to use must library. (#17609 )	2023-06-20 15:46:16 +01:00
James Rasell	68df578c73	state: remove vague scaling event schema todo item. (#17610 )	2023-06-20 15:22:11 +01:00
Luiz Aoqui	a56b10e857	chore: fix typo and copyright header (#17605 )	2023-06-20 10:09:47 -04:00

1 2 3 4 5 ...

24798 Commits All Branches Search

24798 Commits

All Branches