open-nomad

Author	SHA1	Message	Date
hc-github-team-nomad-core	1a1e1d5d4d	Generate files for 1.6.0-rc.1 release	2023-07-11 15:19:54 +00:00
Tim Gross	6a9deaf7be	Prepare release 1.6.0-rc.1	2023-07-11 11:09:18 -04:00
hc-github-team-nomad-core	0951fe1c50	backport of commit 0a5e90120b18ff450457463d6bcee68ec6804bb0 (#17900 ) This pull request was automerged via backport-assistant	2023-07-11 10:00:05 -05:00
Kévin Dunglas	9f0f897077	docs: fix typo in regex_replace.mdx (#17891 )	2023-07-11 14:03:40 +01:00
Lance Haig	0455389534	Add the ability to customise the details of the CA (#17309 ) Co-authored-by: James Rasell <jrasell@users.noreply.github.com>	2023-07-11 08:53:09 +01:00
hashicorp-copywrite[bot]	2b85290d55	[COMPLIANCE] Add Copyright and License Headers (#17877 ) Co-authored-by: hashicorp-copywrite[bot] <110428419+hashicorp-copywrite[bot]@users.noreply.github.com>	2023-07-11 07:48:11 +01:00
Michael Schurter	c82f439a6d	remove empty file (#17853 )	2023-07-10 16:34:10 -07:00
Michael Schurter	278fd44a8b	docs: v1.6.0 requires ipc_lock cap for mlock (#17881 ) Fixes #17780	2023-07-10 11:53:07 -07:00
Tim Gross	ad7355e58b	CSI: persist previous mounts on client to restore during restart (#17840 ) When claiming a CSI volume, we need to ensure the CSI node plugin is running before we send any CSI RPCs. This extends even to the controller publish RPC because it requires the storage provider's "external node ID" for the client. This primarily impacts client restarts but also is a problem if the node plugin exits (and fingerprints) while the allocation that needs a CSI volume claim is being placed. Unfortunately there's no mapping of volume to plugin ID available in the jobspec, so we don't have enough information to wait on plugins until we either get the volume from the server or retrieve the plugin ID from data we've persisted on the client. If we always require getting the volume from the server before making the claim, a client restart for disconnected clients will cause all the allocations that need CSI volumes to fail. Even while connected, checking in with the server to verify the volume's plugin before trying to make a claim RPC is inherently racy, so we'll leave that case as-is and it will fail the claim if the node plugin needed to support a newly-placed allocation is flapping such that the node fingerprint is changing. This changeset persists a minimum subset of data about the volume and its plugin in the client state DB, and retrieves that data during the CSI hook's prerun to avoid re-claiming and remounting the volume unnecessarily. This changeset also updates the RPC handler to use the external node ID from the claim whenever it is available. Fixes: #13028	2023-07-10 13:20:15 -04:00
Devashish Taneja	0d9dee3cbe	Include parent job ID as a Docker container label (#17843 ) Fixes: #17751	2023-07-10 11:27:45 -04:00
Daniel Bennett	30b1b88332	ci: more self-hosted iops for checks workflow (#17852 )	2023-07-10 10:21:04 -05:00
James Rasell	3bfec68556	docs: detail Consul ACL token env var config option. (#17859 )	2023-07-10 14:26:18 +01:00
dependabot[bot]	771a96ee55	build(deps): bump github.com/hashicorp/cronexpr in /api (#17787 )	2023-07-10 11:23:00 +01:00
James Rasell	5571890974	e2e: respect timeout value when waiting for allocs in v3. (#17800 )	2023-07-10 09:47:10 +01:00
Tim Gross	5025731ebe	consul: handle "not found" errors from Consul when deleting tokens (#17847 ) In Consul 1.15.0, the Delete Token API was changed so as to return an error when deleting a non-existent ACL token. This means that if Nomad successfully deletes the token but fails to persist that fact, it will get stuck trying to delete a non-existent token forever. Update the token deletion function to ignore "not found" errors and treat them as successful deletions. Fixes: #17833	2023-07-07 16:22:13 -04:00
Daniel Bennett	30a99926dc	ci: pull secrets from Vault in nomad-enterprise (#17841 )	2023-07-07 14:27:12 -05:00
Seth Hoenig	4452f0623b	env/aws: updates from ec2info (#17835 )	2023-07-07 10:12:05 -05:00
Daniel Bennett	a3924db96c	ci: windows tests on public runners (#17829 ) currently our self-hosted windows runners lack `docker`, so for now just revert to public runners.	2023-07-06 17:06:55 -05:00
Yorick Gersie	3e66291b0e	cni: ensure to setup CNI addresses in deterministic order (#17766 ) * cni: ensure to setup CNI addresses in deterministic order Currently as commented in the code the go-cni library returns an unordered map of interfaces. In cases where there are multiple CNI interfaces being created this creates a problem with service registration and healthchecking because the first address in the map is being used. The use case we have where this is an issue is that we run CNI with the macvlan plugin to isolate workloads, but they still need to be able to access the host on a static address to be able to perform local resolving and hit host services like the Consul agent API. To make this work there are 2 options, you either add a macvlan interface on the host with an assigned address for each VLAN you have or you create an additional veth bridged interface in the container namespace. We chose the latter option through a custom CNI plugin but the ordering issue leaves us with incorrect service registration. * Updates after feedback * First check for the CNIResult interfaces length, if it's zero we don't need to proceed at all. * Use sorted interfaces list for the address fallback scenario as well. * Remove "found" log message logic, when an address isn't found an error is returned stating the allocation could not be configured as an address was missing from the CNIResult. If we still need a Warn message then we can add it to the condition that returns the error if no address could be found instead of using the "found" bool logic.	2023-07-06 13:25:29 -07:00
Seth Hoenig	edd0a405d7	website: use full registry name so it works with podman again (#17809 )	2023-07-06 13:22:12 -05:00
Daniel Bennett	c272cb0d5a	ci: clean GOCACHE before build (#17808 ) this is basically to avoid Fear/Uncertainty/Doubt the github action actions/setup-go (and, with a different chache key, hashicorp/setup-golang) caches both GOMODCACHE (go source files), which is good, and GOCACHE (build outputs), which might be bad, if the cache was built on an OS with an older glibc than we want to support. from `go help cache`: > [...] the build cache does not detect changes to > C libraries imported with cgo.	2023-07-06 12:47:43 -05:00
Daniel Bennett	4c2cb7b701	ci: dynamic runs-on values for oss/ent (#17775 ) so in enterprise we can use Vault for secrets, without merge conflicts from oss->ent. also: * use hashicorp/setup-golang * setup-js for self-hosted runners they don't come with yarn, nor chrome, and might not always match node version.	2023-07-06 12:41:17 -05:00
am-ak	3ca370dd03	docs: fix broken link in security model docs (#17812 ) correcting a broken link under "similar to consul" and correcting list formatting under "general mechanisms"	2023-07-06 10:01:36 -04:00
Patric Stout	ebb363d43e	metrics: add "total_ticks_count" for CPU metrics (#17579 ) This counter tells you the total amount of ticks for that CPU entry since the start of Nomad.	2023-07-05 10:28:55 -04:00
deverton-godaddy	f44793d377	[api] Add NetworkStatus to allocation response (#17280 ) Service discovery or mesh network systems consuming the Nomad event stream or API need to know the CNI assigned IP for the allocation. This data is returned by the underlying Nomad API but isn't mapped in the response struct.	2023-07-04 19:35:38 -04:00
James Rasell	4289de5986	docs: fix up constraint jobspec HCL format. (#17795 )	2023-07-04 13:33:46 +01:00
Phil Renaud	5cc0b39683	Report shows a 3rd party browser extension puts a banner at the top of page and awkwardly shifts nav; this fixes that (#17783 )	2023-06-30 17:09:42 -04:00
Phil Renaud	d559072e48	[ui] Text wrap long lines of code and logs (#17754 ) * Text and code wrapping as a localStorage var * task-log uses wrapping and kb shortcut * Word wrap keyboard labels * Wrapper as a toggle not a button * Changelog and fixed an extra space trailing log lines * Moves toggle to inside * Acceptance tests for ww and toggle click	2023-06-30 17:07:57 -04:00
Tim Gross	e7cc7f2123	docs: clarify network topology requirements for clients (#17779 ) The requirements for client-to-server and client-to-client topologies are not well-documented in the production install requirements docs. Document that clients make connections to servers (and not the other way around), and that clients don't need to communicate with each other (with some exceptions). Fixes: #17631	2023-06-30 10:46:29 -04:00
James Rasell	45073e8a05	job: ensure node pool is canonicalized for state restores. (#17765 )	2023-06-30 07:37:22 +01:00
Sarah Thompson	8c6cc5b1d8	Update the revision used by the docker build action. (#17755 ) Update the revision used by the docker action. This should always reflect the commit that's being built as this may differ from the default <github.sha> that the workflow was invoked at. Goes with https://github.com/hashicorp/actions-docker-build/pull/59 - and should not be merged until this PR is merged and a new version of the action is cut.	2023-06-29 09:19:54 -04:00
Phil Renaud	01d6a94eac	[ui] HCL-in-UI: Re-arrange buttons, add save-as-file (#17752 ) * Move buttons over as expected * Let a user download file locally * test mock fns for jobeditor * Changelog	2023-06-28 21:57:03 -04:00
Daniel Bennett	77a8d79bb5	e2e: use DNS instead of HTTP to get my_public_ipv4 (#17759 )	2023-06-28 13:11:57 -05:00
Tim Gross	70eca4e7f9	Merge pull request #17758 from hashicorp/post-release-1.6.0-beta.1 Post release 1.6.0 beta.1	2023-06-28 12:26:00 -04:00
hc-github-team-nomad-core	6f5401ae3a	Prepare for next release	2023-06-28 11:06:28 -04:00
hc-github-team-nomad-core	5c703a49b1	Generate files for 1.6.0-beta.1 release	2023-06-28 11:06:20 -04:00
Tim Gross	6a24ffac1c	release: submit build workflow from the file on the release's own branch	2023-06-28 11:06:13 -04:00
Tim Gross	06c7974120	Prepare release 1.6.0-beta.1	2023-06-28 11:06:05 -04:00
Phil Renaud	5f8c4b3d48	Link to allocations.allocation by ID reference, not by model (#17753 )	2023-06-28 10:00:59 -04:00
Phil Renaud	bd1ec095d3	[ui] Move Placement Failures notification above job status panel (#17750 ) * Moves the Placement Failures box above job status, should it exist * Move it for non-service job-types as well	2023-06-27 19:32:51 -04:00
Phil Renaud	7a60c69b73	[ui] links to allocations explicitly go through their route model hook (#17737 ) * links to allocations explicitly go through their route model hook * Acceptance test to make sure alloc clicking loads alloc endpoint obj	2023-06-27 10:01:50 -04:00
Seth Hoenig	cfb7efc478	fix changelog entry typo (#17743 )	2023-06-27 08:02:06 -05:00
Seth Hoenig	4771690582	deps: update cronexpr to capture license file in SBOM tools (#17733 )	2023-06-27 07:58:20 -05:00
Juana De La Cuesta	28b66d2400	Update checklist-rpc-endpoint.md (#17698 ) * Update checklist-rpc-endpoint.md * Update checklist-rpc-endpoint.md * Update contributing/checklist-rpc-endpoint.md Co-authored-by: Tim Gross <tgross@hashicorp.com> --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2023-06-27 10:52:38 +02:00
Phil Renaud	32af971bcb	Node Pools moved to after Type in jobs index columns (#17738 )	2023-06-26 17:00:01 -04:00
Seth Hoenig	d590123637	drivers/docker: refactor use of clients in docker driver (#17731 ) * drivers/docker: refactor use of clients in docker driver This PR refactors how we manage the two underlying clients used by the docker driver for communicating with the docker daemon. We keep two clients - one with a hard-coded timeout that applies to all operations no matter what, intended for use with short lived / async calls to docker. The other has no timeout and is the responsibility of the caller to set a context that will ensure the call eventually terminates. The use of these two clients has been confusing and mistakes were made in a number of places where calls were making use of the wrong client. This PR makes it so that a user must explicitly call a function to get the client that makes sense for that use case. Fixes #17023 * cr: followup items	2023-06-26 15:21:42 -05:00
sejalapeno	4c6906d873	Update allocations.go (#17726 ) * Update allocations.go updated missing client status "unknown" #17688 * changelog * Update .changelog/17726.txt adding relevant desc. Co-authored-by: Seth Hoenig <shoenig@duck.com> --------- Co-authored-by: Seth Hoenig <shoenig@duck.com>	2023-06-26 13:33:29 -05:00
nicoche	649831c1d3	deploymentwatcher: fail early whenever possible (#17341 ) Given a deployment that has a `progress_deadline`, if a task group runs out of reschedule attempts, allow it to fail at this time instead of waiting until the `progress_deadline` is reached. Fixes: #17260	2023-06-26 14:01:03 -04:00
Phil Renaud	81edceb2de	[ui] alignment and spacing for job status panel (#17708 ) * CSS alignment and spacing for job status panel * Only fade the count, not the legend icon, when count is 0 * Unrounded version corners * changelog * css has to only remove border radius when count is present * Seed stabilization for services test * Try consolidating the testfixes from before * Total test isolation and bonus logs * Drop the isolation but keep the logs * Remove bonus logging	2023-06-26 12:18:12 -04:00
hashicorp-copywrite[bot]	e901340c3f	[COMPLIANCE] Add Copyright and License Headers (#17732 ) Co-authored-by: hashicorp-copywrite[bot] <110428419+hashicorp-copywrite[bot]@users.noreply.github.com>	2023-06-26 11:11:17 -05:00

... 3 4 5 6 7 ...

25061 commits