open-nomad

Author	SHA1	Message	Date
Kris Hicks	93155ba3da	Add gocritic to golangci-lint config (#9556 )	2020-12-08 12:47:04 -08:00
Tim Gross	d286d941dc	docker: kill signal API should include timeout context When the Docker driver kills as task, we send a request via the Docker API for dockerd to fire the signal. We send that signal and then block for the `kill_timeout` waiting for the container to exit. But if the Docker API blocks, we will block indefinitely because we haven't configured the API call with the same timeout. This changeset is a minimal intervention to add the timeout to the Docker API call _only_ when we have the `kill_timeout` set. Future work should examine whether we should be threading contexts through other `go-dockerclient` API calls.	2020-12-02 16:51:57 -05:00
Mahmood Ali	98c02851c8	use comment ignores (#9448 ) Use targetted ignore comments for the cases where we are bound by backward compatibility. I've left some file based linters, especially when the file is riddled with linter voilations (e.g. enum names), or if it's a property of the file (e.g. package and file names). I encountered an odd behavior related to RPC_REQUEST_RESPONSE_UNIQUE and RPC_REQUEST_STANDARD_NAME. Apparently, if they target a `stream` type, we must separate them into separate lines so that the ignore comment targets the type specifically.	2020-11-25 16:03:01 -05:00
Mahmood Ali	b2a8752c5f	honor task user when execing into raw_exec task (#9439 ) Fix #9210 . This update the executor so it honors the User when using nomad alloc exec. The bug was that the exec task didn't honor the init command when execing.	2020-11-25 09:34:10 -05:00
Nick Ethier	c9bd7e89ca	command: use correct port mapping syntax in examples	2020-11-23 10:25:30 -06:00
Mahmood Ali	d92d413ffd	Merge pull request #8291 from shishir-a412ed/cpusets Add cpuset_cpus to docker driver.	2020-11-11 17:13:27 -05:00
Mahmood Ali	a89da9982d	raw_exec: don't use cgroups when no_cgroup is set (#9328 ) When raw_exec is configured with [`no_cgroups`](https://www.nomadproject.io/docs/drivers/raw_exec#no_cgroups), raw_exec shouldn't attempt to create a cgroup. Prior to this change, we accidentally always required freezer cgroup to do stats PID tracking. We already have the proper fallback in place for metrics, so only need to ensure that we don't create a cgroup for the task. Fixes https://github.com/hashicorp/nomad/issues/8565	2020-11-11 16:20:34 -05:00
Shishir Mahajan	572c398187	Fix review comments.	2020-11-11 12:30:00 -08:00
Shishir Mahajan	9192100d4e	Fix circleci.	2020-11-11 12:30:00 -08:00
Shishir Mahajan	c30fea5cd3	Add cpuset_cpus to docker driver.	2020-11-11 12:30:00 -08:00
Tim Gross	0ef0b17b82	docker: disallow volume mounts from host by default (#9321 ) The default behavior for `docker.volumes.enabled` is intended to be `false`, but the HCL schema defaults to `true` if the value is unset. Set the default literal value to `true`. Additionally, Docker driver mounts of type "volume" (but not "bind") are not being properly sandboxed with that setting. Disable Docker mounts with type "volume" entirely whenever the `docker.volumes.enabled` flag is set to false. Note this is unrelated to the `volume_mount` feature, which is constrained to preconfigured host volumes or whatever is mounted by a CSI plugin. This changeset includes updates to unit tests that should have been failing under the documented behavior but were not.	2020-11-11 10:03:46 -05:00
Mahmood Ali	2d4634bcc3	Merge pull request #9304 from hashicorp/b-legacy-executors-are-executors Legacy executors are executors after all	2020-11-10 12:54:03 -05:00
Kris Hicks	9d03cf4c5f	protos: Update .proto files not to use Go package name (#9301 ) Previously, it was required that you `go get github.com/hashicorp/nomad` to be able to build protos, as the protoc invocation added an include directive that pointed to `$GOPATH/src`, which is how dependent protos were discovered. As Nomad now uses Go modules, it won't necessarily be cloned to `$GOPATH`. (Additionally, if you _had_ go-gotten Nomad at some point, protoc compilation would have possibly used the _wrong_ protos, as those wouldn't necessarily be the most up-to-date ones.) This change modifies the proto files and the `protoc` invocation to handle discovering dependent protos via protoc plugin modifier statements that are specific to the protoc plugin being used. In this change, `make proto` was run to recompile the protos, which results in changes only to the gzipped `FileDescriptorProto`.	2020-11-10 08:42:35 -08:00
Mahmood Ali	ac185b41e2	Legacy executors are executors after all This fixes a bug where pre-0.9 executors fail to recover after an upgrade. The bug is that legacyExecutorWrappers didn't get updated with ExecStreaming function, and thus failed to implement the Executor function. Sadly, this meant that all recovery attempts fail, as the runtime check in `b312aacbc9/drivers/shared/executor/utils.go (L103-L110)` .	2020-11-10 10:20:07 -05:00
Russell Rollins	538aa90d92	Use Dockerhub Mirror. (#9220 ) Dockerhub is going to rate limit unauthenticated pulls. Use our HashiCorp internal mirror for builds run through CircleCI. Co-authored-by: Mahmood Ali <mahmood@hashicorp.com>	2020-11-02 09:28:02 -05:00
Charlie Voiselle	16b6098df8	Fix for Java fingerprinter on macOS (#9225 ) Use alternative test for macOS JVM with /usr/libexec/java_home	2020-11-01 13:20:31 -05:00
Tim Gross	f9e659164f	docker: image_delay default missing without gc stanza (#9101 ) In the Docker driver plugin config for garbage collection, the `image_delay` field was missing from the default we set if the entire `gc` stanza is missing. This results in a default of 0s and immediate GC of Docker images. Expanded docker gc config test fields.	2020-10-15 12:36:01 -04:00
Michael Schurter	9c3972937b	s/0.13/1.0/g 1.0 here we come!	2020-10-14 15:17:47 -07:00
Yoan Blanc	891accb89a	use allow/deny instead of the colored alternatives (#9019 ) Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-10-12 08:47:05 -04:00
Seth Hoenig	a8869bd304	docs: document docker signal fix, add tests This PR adds a version specific upgrade note about the docker stop signal behavior. Also adds test for the signal logic in docker driver. Closes #8932 which was fixed in #8933	2020-10-02 10:06:43 -05:00
Mahmood Ali	f4450db775	tests: use system path On host with systemd-resolved, we copy /run/systemd/resolve/resolv.conf actually.	2020-10-01 10:23:19 -04:00
Mahmood Ali	f4b0aa0c1c	tests: copy permissions when copying files On the failover path, copy the permission bits (a.k.a. file mode), specially the execution bit.	2020-10-01 10:23:14 -04:00
Mahmood Ali	cd060db42a	tests: ignore empty cgroup My latest Vagrant box contains an empty cgroup name that isn't used for isolation: ``` $ cat /proc/self/cgroup \| grep :: 0::/user.slice/user-1000.slice/session-17.scope ```	2020-10-01 10:23:13 -04:00
Mahmood Ali	91376cccf2	tests: failover to copying when symlinking fails Symlinking busybox may fail when the test code and the test temporary directory live on different volumes/partitions; so we should copy instead. This situation arises in the Vagrant setup, where the code repository live on special file sharing volume. Somewhat unrelated, remove `f.Sync()` invocation from a test copyFile helper function. Sync is useful only for crash recovery, and isn't necessary in our test setup. The sync invocation is a significant overhead as it requires the OS to flush any cached writes to disk.	2020-09-30 09:58:22 -04:00
Seth Hoenig	6d9a6786e5	Merge pull request #8933 from jf/fix_docker_stopsignal drivers/docker/driver.go: change default signal for docker driver to SIGTERM?	2020-09-29 10:51:04 -05:00
Seth Hoenig	fd2a31a331	drivers/docker: detect arch for default infra_image The 'docker.config.infra_image' would default to an amd64 container. It is possible to reference the correct image for a platform using the `runtime.GOARCH` variable, eliminating the need to explicitly set the `infra_image` on non-amd64 platforms. Also upgrade to Google's pause container version 3.1 from 3.0, which includes some enhancements around process management. Fixes #8926	2020-09-23 13:54:30 -05:00
Jeffrey 'jf' Lim	b84d63c4ba	drivers/docker/driver.go: change default signal for docker driver to SIGTERM?	2020-09-20 03:09:07 +08:00
Mahmood Ali	d4f385d6e1	Upgrade to golang 1.15 (#8858 ) Upgrade to golang 1.15 Starting with golang 1.5, setting Ctty value result in `Setctty set but Ctty not valid in child` error, as part of https://github.com/golang/go/issues/29458 . This commit lifts the fix in https://github.com/creack/pty/pull/97 .	2020-09-09 15:59:29 -04:00
Shengjing Zhu	7a4f48795d	Adjust cgroup change in libcontainer	2020-08-20 00:31:07 +08:00
Nick Ethier	1849a20b66	docker: use Nomad managed resolv.conf when DNS options are set (#8600 )	2020-08-17 10:22:08 -04:00
James Rasell	dab8282be5	Merge pull request #8589 from hashicorp/f-gh-5718 driver/docker: allow configurable pull context timeout setting.	2020-08-14 16:07:59 +02:00
James Rasell	bc42cd2e5e	driver/docker: allow configurable pull context timeout setting. Pulling large docker containers can take longer than the default context timeout. Without a way to change this it is very hard for users to utilise Nomad properly without hacky work arounds. This change adds an optional pull_timeout config parameter which gives operators the possibility to account for increase pull times where needed. The infra docker image also has the option to set a custom timeout to keep consistency.	2020-08-12 08:58:07 +01:00
Nick Ethier	e39574be59	docker: support group allocated ports and host_networks (#8623 ) * docker: support group allocated ports * docker: add new ports driver config to specify which group ports are mapped * docker: update port mapping docs	2020-08-11 18:30:22 -04:00
Drew Bailey	27b8cadcc4	removes nvidia import from docker test (#8312 )	2020-06-30 09:34:59 -04:00
Shishir Mahajan	182e68ca7a	Add notes.	2020-06-25 13:46:45 -07:00
Shishir Mahajan	0bc2c835fe	Remove dead tests.	2020-06-25 13:22:46 -07:00
Mahmood Ali	998f80d4cb	add a allowlist for qemu image paths	2020-06-24 08:03:19 -04:00
Mahmood Ali	5796719124	docker: disable host volume binding by default	2020-06-23 13:43:37 -04:00
Nick Ethier	1e4ea699ad	fix test failures from rebase	2020-06-18 11:05:32 -07:00
Nick Ethier	0bc0403cc3	Task DNS Options (#7661 ) Co-Authored-By: Tim Gross <tgross@hashicorp.com> Co-Authored-By: Seth Hoenig <shoenig@hashicorp.com>	2020-06-18 11:01:31 -07:00
Niam Jen Wei	d2de515f0c	Fix docker driver MemorySwap value Fixes an incorrect value being assigned to MemorySwap when `memory_hard_limit` flag is being used. Issue raised in https://github.com/hashicorp/nomad/issues/8153	2020-06-12 20:11:28 +01:00
Seth Hoenig	4bfa0548d9	Merge pull request #8087 from hashicorp/f-docker-mem-config driver/docker: enable setting hard/soft memory limits	2020-06-01 12:16:55 -05:00
Seth Hoenig	a792c64f57	driver/docker: add integration test around setting memory_hard_limit	2020-06-01 12:00:47 -05:00
Seth Hoenig	675f50b502	driver/docker: use pointer parameter on driver because locks	2020-06-01 09:35:17 -05:00
Seth Hoenig	ad91ba865c	driver/docker: enable setting hard/soft memory limits Fixes #2093 Enable configuring `memory_hard_limit` in the docker config stanza for tasks. If set, this field will be passed to the container runtime as `--memory`, and the `memory` configuration from the task resource configuration will be passed as `--memory_reservation`, creating hard and soft memory limits for tasks using the docker task driver.	2020-06-01 09:22:45 -05:00
Mahmood Ali	1fcc7970e4	tests: ensure that test is long enough to configure cgroups	2020-05-31 10:42:06 -04:00
Mahmood Ali	8ef1b85ce9	don't GC images in tests by default	2020-05-26 21:24:55 -04:00
Mahmood Ali	d9543a1a80	tests: don't delete images after tests complete Fix some docker test flakiness where image cleanup process may contaminate other tests. A clean up process may attempt to delete an image while it's used by another test.	2020-05-26 18:53:24 -04:00
Mahmood Ali	2588b3bc98	cleanup driver eventor goroutines This fixes few cases where driver eventor goroutines are leaked during normal operations, but especially so in tests. This change makes few modifications: First, it switches drivers to use `Context`s to manage shutdown events. Previously, it relied on callers invoking `.Shutdown()` function that is specific to internal drivers only and require casting. Using `Contexts` provide a consistent idiomatic way to manage lifecycle for both internal and external drivers. Also, I discovered few places where we don't clean up a temporary driver instance in the plugin catalog code, where we dispense a driver to inspect and validate the schema config without properly cleaning it up.	2020-05-26 11:04:04 -04:00
Tim Gross	aa8927abb4	volumes: return better error messages for unsupported task drivers (#8030 ) When an allocation runs for a task driver that can't support volume mounts, the mounting will fail in a way that can be hard to understand. With host volumes this usually means failing silently, whereas with CSI the operator gets inscrutable internals exposed in the `nomad alloc status`. This changeset adds a MountConfig field to the task driver Capabilities response. We validate this when the `csi_hook` or `volume_hook` fires and return a user-friendly error. Note that we don't currently have a way to get driver capabilities up to the server, except through attributes. Validating this when the user initially submits the jobspec would be even better than what we're doing here (and could be useful for all our other capabilities), but that's out of scope for this changeset. Also note that the MountConfig enum starts with "supports all" in order to support community plugins in a backwards compatible way, rather than cutting them off from volume mounting unexpectedly.	2020-05-21 09:18:02 -04:00

1 2 3 4 5 ...

597 commits