open-nomad

Commit Graph

Author	SHA1	Message	Date
Ivo	703a7954f4	[terraform/aws] Fix NVidia GPG key error (#12985 ) * Fix NVidia GPG key error - NVidia rotated their repo keys, see https://forums.developer.nvidia.com/t/notice-cuda-linux-repository-key-rotation/212771	2022-05-16 06:49:01 -04:00
Karan Sharma	e0be868b79	docs: Fix typo in sidecar_service (#13021 )	2022-05-16 09:35:42 +02:00
Luiz Aoqui	525c0fadf4	add missing changelog entry for 1.2.7	2022-05-13 17:42:14 -04:00
Luiz Aoqui	d46acb7147	Merge release 1.3.0 files	2022-05-13 17:33:09 -04:00
hc-github-team-nomad-core	b28fcac665	Prepare for next release	2022-05-13 17:32:36 -04:00
hc-github-team-nomad-core	8c5dbe1a44	Generate files for 1.3.0 release	2022-05-13 17:32:20 -04:00
hc-github-team-nomad-core	214a4841b8	Prepare for next release	2022-05-13 17:32:11 -04:00
hc-github-team-nomad-core	b0ec54c885	Generate files for 1.3.0-rc.1 release	2022-05-13 17:31:57 -04:00
Phil Renaud	45dc1cfd58	12986 UI fails to load job when there is an "@" in job name in nomad 130 (#13012 ) * LastIndexOf and always append a namespace on job links * Confirmed the volume equivalent and simplified idWIthNamespace logic * Changelog added * PR comments addressed * Drop the redirect for the time being * Tests updated to reflect namespace on links * Task detail test default namespace link for test	2022-05-13 17:01:27 -04:00
Tim Gross	faeb3fcd44	scheduler: volume updates should always be destructive (#13008 )	2022-05-13 11:34:04 -04:00
dependabot[bot]	4ae15399bd	build(deps): bump cross-fetch from 3.1.4 to 3.1.5 in /website (#12818 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-05-13 09:31:49 -05:00
James Rasell	636b647a30	agent: fix panic when logging about protocol version config use. (#12962 ) The log line comes before the agent logger has been setup, therefore we need to use the UI logging to avoid panic.	2022-05-13 09:28:43 +02:00
Michael Schurter	7f8cf9e2dc	docs: link s/port-plan-failure to more helpful doc (#12968 ) The shortlink /s/port-plan-failure is logged when a plan for a node is rejected to help users debug and mitigate repeated `plan for node rejected` failures. The current link to #9506 is... less than useful. It is not clear to users what steps they should take to either fix their cluster or contribute to the issue. While .../monitoring-nomad#progess isn't as comprehensive as it could be, it's a much more gentle introduction to the class of bug than the original issue.	2022-05-12 13:59:17 -07:00
Tim Gross	6e5d6eb3b5	docs: note that already-dispatched jobs cannot be updated (#12973 )	2022-05-12 16:18:42 -04:00
Phil Renaud	916dbdcd2f	Visual diff tests seed-stabilized by default (#12965 ) * Seed-stabilization by default * Hide right-column of topology viz route * Remove seedless run from thee test:* suite * Related evals paths render too late * Vis:Hidden another topo viz unstable item	2022-05-12 16:09:19 -04:00
Tim Gross	ae2d7d6727	docs: remove beta tag for CSI from sidebar (#12970 )	2022-05-12 14:12:40 -04:00
Eng Zer Jun	97d1bc735c	test: use `T.TempDir` to create temporary test directory (#12853 ) * test: use `T.TempDir` to create temporary test directory This commit replaces `ioutil.TempDir` with `t.TempDir` in tests. The directory created by `t.TempDir` is automatically removed when the test and all its subtests complete. Prior to this commit, temporary directory created using `ioutil.TempDir` needs to be removed manually by calling `os.RemoveAll`, which is omitted in some tests. The error handling boilerplate e.g. defer func() { if err := os.RemoveAll(dir); err != nil { t.Fatal(err) } } is also tedious, but `t.TempDir` handles this for us nicely. Reference: https://pkg.go.dev/testing#T.TempDir Signed-off-by: Eng Zer Jun <engzerjun@gmail.com> * test: fix TestLogmon_Start_restart on Windows Signed-off-by: Eng Zer Jun <engzerjun@gmail.com> * test: fix failing TestConsul_Integration t.TempDir fails to perform the cleanup properly because the folder is still in use testing.go:967: TempDir RemoveAll cleanup: unlinkat /tmp/TestConsul_Integration2837567823/002/191a6f1a-5371-cf7c-da38-220fe85d10e5/web/secrets: device or resource busy Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2022-05-12 11:42:40 -04:00
Michael Schurter	5a43d3c675	docs: add `sysbatch` to scheduling internals (#12954 )	2022-05-11 17:06:17 -07:00
Luiz Aoqui	ed7798fbdf	prepare for next release (#12956 )	2022-05-11 17:42:53 -04:00
Seth Hoenig	466f0c3fd2	build: use new version of hc-install (#12937 ) https://github.com/shoenig/hc-install/pull/2 Uses new version of hc-install which supports the new json content type reported by api.releases.hashicorp.com	2022-05-10 15:28:29 -04:00
Georges-Etienne Legendre	864be37c73	Fix Exec not working with reverse proxy X-Nomad-Token (#12925 ) * Capture token secret on fetch * Fix tests * Fix lint errors	2022-05-10 13:42:12 -04:00
modrake	cad8c00f9a	Merge pull request #12913 from hashicorp/mdrake/svc-acct-codeowner add service acct to codeowners for backport merging	2022-05-06 10:44:31 -07:00
Morgan Drake	52b09953ab	add service acct to codeowners for backport merging	2022-05-06 10:06:20 -07:00
Chetan Sarva	14752cd2c0	docs: add version note to nomad services template (#12910 )	2022-05-06 17:39:27 +02:00
Phil Renaud	dd824ac3f8	Changelog for visual diff tests (#12909 )	2022-05-06 11:29:10 -04:00
Luiz Aoqui	eb50273b32	ci: update backport assitant workflow (#12899 ) Remove the step to automatically backport `backport/website` PRs to the latest release. This will be done manually by adding the proper tags. Also use squash backports to match the pattern we use for `main`.	2022-05-06 10:15:59 -04:00
James Rasell	9ea1a6faf6	fsm: add service registration snapshot persistence. (#12896 )	2022-05-06 15:53:27 +02:00
Luiz Aoqui	c502a249b9	ci: revert file changes and add some checks (#12873 ) During the release there are several files that need to be modified: - .release/ci.hcl: the notification channel needs to be updated to a channel with greater team visibility during the release. - version/version.go: the Version and VersionPrerelease variables need to be set so they match the release version. After the release these files need to be reverted. For GA releases the following additional changes also need to happen: - version/version.go: the Version variable needs to be bumped to the next version number. - GNUMakefile: the LAST_RELEASE variable needs to be set to the version that was just released. Since the release process will commit file changes to the branch being used for the release, it should _never_ run on main, so the first step is now to protect against that. It also adds a validation to make the user input version is correct. After looking at the different release options and steps I noticed that automatic CHANGELOG generation is actually the exception, so it would be better to have the default to be false.	2022-05-05 18:07:51 -04:00
Phil Renaud	6a8f98723e	Chronological most-recent evals by default (#12847 ) * Chronological most-recent evals by default * Adding reverse: true to the list of expected queryparams in test * changelog	2022-05-05 16:11:27 -04:00
Phil Renaud	b67bd4c377	Percy snapshot tests (#12872 ) * Sample percy test added * Node engine up to 14.x for UI prep * Force ui test rerun * Updated config.yml * Node v upgraded to 14 for docker image * Expect length in test * Running ember tests under percy exec * Percy exec format * Percy cli added * Noop to rerun tests with updated percy_token * Evals full list and details open snapshots * Pretty legit use of assert so disable the warning * Jobs list tests * Snapshots for top-level clients, servers, ACL, topology, and storage lists * Expect caveat for Topology test * Stabilizing tests with faker seeded to 1 * Seed-stabilizing any tests with percySnapshots * Faker import * Drop unused param * Assets and test audit using an older node version * New strategy: avoid seeding, just use percyCSS to hide certain things	2022-05-05 16:05:13 -04:00
Seth Hoenig	90ff784dcf	Merge pull request #12875 from hashicorp/b-cgroupsv2-task-restarts cgroups: make sure cgroup still exists after task restart	2022-05-05 10:54:29 -05:00
Tim Gross	26b9f88ef3	docs: add missing `set_contains_any` constraint docs (#12886 ) This constraint and affinity was added in 0.9.x but was only documented for affinities. Close that documentation gap.	2022-05-05 11:11:05 -04:00
Bryce Kalow	e9319abc78	website: remove source code and related CI jobs (#12596 ) * remove website source code and related circle jobs * remove data files * updates platform-cli * update local instructions * updates package-lock	2022-05-05 09:53:22 -05:00
Seth Hoenig	96ec19788d	cgroups: make sure cgroup still exists after task restart This PR modifies raw_exec and exec to ensure the cgroup for a task they are driving still exists during a task restart. These drivers have the same bug but with different root cause. For raw_exec, we were removing the cgroup in 2 places - the cpuset manager, and in the unix containment implementation (the thing that uses freezer cgroup to clean house). During a task restart, the containment would remove the cgroup, and when the task runner hooks went to start again would block on waiting for the cgroup to exist, which will never happen, because it gets created by the cpuset manager which only runs as an alloc pre-start hook. The fix here is to simply not delete the cgroup in the containment implementation; killing the PIDs is enough. The removal happens in the cpuset manager later anyway. For exec, it's the same idea, except DestroyTask is called on task failure, which in turn calls into libcontainer, which in turn deletes the cgroup. In this case we do not have control over the deletion of the cgroup, so instead we hack the cgroup back into life after the call to DestroyTask. All of this only applies to cgroups v2.	2022-05-05 09:51:03 -05:00
James Rasell	a05114fdac	core: add namespace to plan for node rejected log line. (#12868 )	2022-05-05 10:56:40 +02:00
James Rasell	e1bf9138a1	release: fix hcl linting error within CI file. (#12867 )	2022-05-04 10:48:42 +02:00
Michele Degges	9c85ddcb7f	Add config key to the promote-staging event (#12857 )	2022-05-03 20:33:14 -07:00
Michele Degges	417d3ca232	Add config key to the promote-staging event	2022-05-03 08:51:19 -07:00
Tim Gross	45b238ec82	CSI: node drain should end once only plugins remain (#12846 ) In #12324 we made it so that plugins wait until the node drain is complete, as we do for system jobs. But we neglected to mark the node drain as complete once only plugins (or system jobs) remaining, which means that the node drain is left in a draining state until the `deadline` time expires. This was incorrectly documented as expected behavior in #12324.	2022-05-03 10:20:22 -04:00
Alex Carpenter	d59b517ab2	[WIP] feat: homepage and use case pages redesign (#11873 ) * feat: connect homepage and use case pages * fix: internalLink usage * fix: query name * chore: add homepage patterns * chore: remove offerings * chore: add intro features * chore: bump subnav * chore: updating patterns * chore: add use case to the subnav * chore: cleanup unused import * chore: remove subnav border	2022-05-03 09:06:00 -04:00
Luiz Aoqui	6cd9881d2d	Update CHANGELOG for 1.3.0-rc.1 (#12849 )	2022-05-02 16:52:00 -04:00
Seth Hoenig	35728cbc58	Merge pull request #12740 from hashicorp/cleanup-makefile-help build: add missing help descriptions to makefile	2022-05-02 10:33:22 -05:00
Seth Hoenig	b8d807c320	Merge pull request #12840 from hashicorp/docs-nvidia-updates docs: update nvidia driver documentation	2022-05-02 10:07:02 -05:00
Luiz Aoqui	758e85bc84	ui: fix an error when navigating to a task group (#12832 ) Clicking in a task group row in the job details page would throw the error: Uncaught Error: You didn't provide enough string/numeric parameters to satisfy all of the dynamic segments for route jobs.job.task-group. Missing params: name createParamHandlerInfo http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4814 applyToHandlers http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4804 applyToState http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4801 getTransitionByIntent http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4843 transitionByIntent http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4836 refresh http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4885 refresh http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:2254 queryParamsDidChange http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:2326 k http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:2423 triggerEvent http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:2349 fireQueryParamDidChange http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4863 getTransitionByIntent http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4848 transitionByIntent http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4836 doTransition http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4853 transitionTo http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:4882 _doTransition http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:2392 transitionTo http://localhost:4646/ui/assets/vendor-194b1e0d68d11ef7a4bf334eb30ba74d.js:2177 gotoTaskGroup http://localhost:4646/ui/assets/nomad-ui-4a2c1941e03e60e1feef715f23cf268c.js:623 ... This was caused because the attribute being passed to the transitionTo function was not the task group name, but the whole model.	2022-05-02 11:01:19 -04:00
Seth Hoenig	684abb9e28	docs: update nvidia driver documentation notably: - name of the compiled binary is 'nomad-device-nvidia', not 'nvidia-gpu' - link to Nvidia docs for installing the container runtime toolkit - list docker v19.03 as minimum version, to track with nvidia's new container runtime toolkit	2022-05-02 09:11:05 -05:00
Matus Goljer	a741cc76b5	nomad can also install autocomplete for fish shell (#12834 )	2022-05-02 09:26:55 -04:00
Luiz Aoqui	59e2bcd809	ci: remove unused CircleCI Makefile (#12828 ) This Makefile was used to generate the full config.yml from smaller sub-files, but this is not done anymore.	2022-04-29 15:25:23 -04:00
Tim Gross	d06ad50538	docs: clarify `capacity_min/max` for volumes (#12825 ) The capacity fields for `create volume` set bounds on the resulting size of the volume, but the ultimate size of the volume will be determined by the storage provider (between the min and max). Clarify this in the documentation and provide a suggestion for how to set a exact size.	2022-04-29 13:38:30 -04:00
Thomas Wunderlich	245d2a463b	Fix formatting	2022-04-29 10:02:20 -04:00
Thomas Wunderlich	c86e287de9	Remove debug log lines	2022-04-28 19:14:31 -04:00

1 2 3 4 5 ...

23129 Commits All Branches Search

23129 Commits

All Branches