open-nomad

Commit Graph

Author	SHA1	Message	Date
Mahmood Ali	0d692f0931	Add a knob to restrict docker runtimes	2020-05-12 10:14:43 -04:00
Ben Buzbee	769a3cd8b3	Rename OCIRuntime to Runtime; allow gpu conflicts is they are the same runtime; add conflict test	2020-04-03 12:15:11 -07:00
Ben Buzbee	d4f26d1eee	Support custom docker runtimes This enables customers who want to use gvisor and have it configured on their clients.	2020-04-03 11:07:37 -07:00
Mahmood Ali	db4c263180	Merge pull request #7554 from benbuzbee/benbuz/fix-seccomp-file Parse security_opts before sending them to docker daemon	2020-03-31 11:54:17 -04:00
Ben Buzbee	4f6ea87ec4	Parse security_opts before sending them to docker daemon Fixes #6720 Copy the parsing function from the docker CLI. Docker daemon expects to see JSON for seccomp file not a path.	2020-03-31 08:34:41 -07:00
Mahmood Ali	452a057a8c	driver/docker: fix memory swapping MemorySwappiness can only be set in non-Windows options: https://ci.appveyor.com/project/hashicorp/nomad/builds/31832149 Also fixes https://github.com/hashicorp/nomad/issues/6085	2020-03-30 16:51:16 -04:00
Mahmood Ali	8f57f78087	vendors: update fsouza/go-docker-client to v.1.6.3	2020-03-30 15:10:53 -04:00
Mahmood Ali	fd5d033e32	Revert "vendor: fsouza/go-docker-client v1.6.3"	2020-03-23 10:48:47 -04:00
Yoan Blanc	ed8dcccb54	docker: disable swap in Windows only Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-03-23 08:35:09 +01:00
Yoan Blanc	d9ea68e807	fixup! fixup! vendor: fsouza/go-docker-client v1.6.3 Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-03-22 10:04:52 +01:00
Yoan Blanc	8e744d1877	vendor: fsouza/go-docker-client v1.6.3 Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-03-22 09:25:46 +01:00
John Schlederer	8b35c75206	Making pull activity timeout configurable in Docker * Making pull activity timeout configurable in Docker plugin config, first pass * Fixing broken function call * Fixing broken tests * Fixing linter suggestion * Adding documentation on new parameter in Docker plugin config * Adding unit test * Setting min value for pull_activity_timeout, making pull activity duration a private var	2019-12-18 12:58:53 +01:00
Mahmood Ali	0b7085ba3a	driver: allow disabling log collection Operators commonly have docker logs aggregated using various tools and don't need nomad to manage their docker logs. Worse, Nomad uses a somewhat heavy docker api call to collect them and it seems to cause problems when a client runs hundreds of log collections. Here we add a knob to disable log aggregation completely for nomad. When log collection is disabled, we avoid running logmon and docker_logger for the docker tasks in this implementation. The downside here is once disabled, `nomad logs ...` commands and API no longer return logs and operators must corrolate alloc-ids with their aggregated log info. This is meant as a stop gap measure. Ideally, we'd follow up with at least two changes: First, we should optimize behavior when we can such that operators don't need to disable docker log collection. Potentially by reverting to using pre-0.9 syslog aggregation in linux environments, though with different trade-offs. Second, when/if logs are disabled, nomad logs endpoints should lookup docker logs api on demand. This ensures that the cost of log collection is paid sparingly.	2019-12-08 14:15:03 -05:00
Tim Gross	b1b20cd479	remove misleading networking log line (#6588 ) When a job has a task group network, this log line ends up being misleading if you're trying to debug networking issues. We really only care about this when there's no port map set, in which case we get the error returned anyways.	2019-10-30 13:23:33 -04:00
Mahmood Ali	414e01b6a6	only set a single label for now Other labels aren't strictly necessary here, and we may follow up with a better way to customize.	2019-10-18 15:31:13 -04:00
Mahmood Ali	3aec7b56ea	Only start reconciler once in main driver driver.SetConfig is not appropriate for starting up reconciler goroutine. Some ephemeral driver instances are created for validating config and we ought not to side-effecting goroutines for those. We currently lack a lifecycle hook to inject these, so I picked the `Fingerprinter` function for now, and reconciler should only run after fingerprinter started. Use `sync.Once` to ensure that we only start reconciler loop once.	2019-10-18 14:43:23 -04:00
Mahmood Ali	e24c3fac56	add docker labels	2019-10-17 10:45:12 -04:00
Mahmood Ali	c01c6de481	address code review comments	2019-10-17 08:36:02 -04:00
Danielle Lancashire	4fbcc668d0	volumes: Add support for mount propagation This commit introduces support for configuring mount propagation when mounting volumes with the `volume_mount` stanza on Linux targets. Similar to Kubernetes, we expose 3 options for configuring mount propagation: - private, which is equivalent to `rprivate` on Linux, which does not allow the container to see any new nested mounts after the chroot was created. - host-to-task, which is equivalent to `rslave` on Linux, which allows new mounts that have been created _outside of the container_ to be visible inside the container after the chroot is created. - bidirectional, which is equivalent to `rshared` on Linux, which allows both the container to see new mounts created on the host, but importantly _allows the container to create mounts that are visible in other containers an don the host_ private and host-to-task are safe, but bidirectional mounts can be dangerous, as if the code inside a container creates a mount, and does not clean it up before tearing down the container, it can cause bad things to happen inside the kernel. To add a layer of safety here, we require that the user has ReadWrite permissions on the volume before allowing bidirectional mounts, as a defense in depth / validation case, although creating mounts should also require a priviliged execution environment inside the container.	2019-10-14 14:09:58 +02:00
Tim Gross	d965a15490	driver/networking: don't recreate existing network namespaces	2019-09-25 14:58:17 -04:00
Mahmood Ali	1d945994d0	docker: remove containers on creation failures The docker creation API calls may fail with http errors (e.g. timeout) even if container was successfully created. Here, we force remove container if we got unexpected failure. We already do this in some error handlers, and this commit updates all paths. I stopped short from a more aggressive refactoring, as the code is ripe for refactoring and would rather do that in another PR.	2019-09-18 08:45:59 -04:00
Mahmood Ali	75ede5a685	add exponential backoff for docker api calls	2019-09-18 08:12:54 -04:00
Mahmood Ali	ac329a5e07	retry transient docker errors within function	2019-09-13 15:25:31 -04:00
Mahmood Ali	e8d73e3d72	docker: defensive against failed starts This handles a bug where we may start a container successfully, yet we fail due to retries and startContainer not being idempotent call. Here, we ensure that when starting a container fails with 500 error, the retry succeeds if container was started successfully.	2019-09-13 13:02:35 -04:00
Mahmood Ali	87f0457973	fix qemu and update docker with tests	2019-09-04 11:27:51 -04:00
Jasmine Dahilig	5b6e39b37c	fix portmap envvars in docker driver	2019-09-04 11:26:13 -04:00
Danielle Lancashire	fb63259921	docker: Fix issue where an exec may never timeout	2019-08-16 15:40:03 +02:00
Ilya Guterman	1e6ea0af8c	driver/docker: use default network mode fallback to docker default network mode instead of explicit bridge for linux or nat for windows	2019-07-31 21:07:46 +03:00
Nick Ethier	0e40063092	docker: add nil check on network isolation spec	2019-07-31 01:03:21 -04:00
Nick Ethier	d752734719	docker: add additional commens	2019-07-31 01:03:20 -04:00
Nick Ethier	1fc5f86a7c	docker: support shared network namespaces	2019-07-31 01:03:20 -04:00
Nick Ethier	2d60ef64d9	plugins/driver: make DriverNetworkManager interface optional	2019-07-31 01:03:19 -04:00
Nick Ethier	548f78ef15	ar: initial driver based network management	2019-07-31 01:03:17 -04:00
Jasmine Dahilig	cece83dd9c	default to json-file log rotation for docker driver	2019-07-03 09:04:45 -07:00
Chris Baker	3ca97d52db	docker/driver: downgraded log level for error in DestroyTask	2019-06-03 21:21:32 +00:00
Chris Baker	2af897c76f	drivers/docker: modify container/image cleanup to be robust to containers removed out of band	2019-06-03 19:52:28 +00:00
Chris Baker	9442c26cff	docker: DestroyTask was not cleaning up Docker images because it was erroring early due to an attempt to inspect an image that had already been removed	2019-06-03 19:04:27 +00:00
Mahmood Ali	13c83ee38e	drivers/docker: implement streaming exec	2019-05-09 16:49:08 -04:00
Mahmood Ali	0ee771b020	driver/docker: Support `volumes` field in Windows Support Docker `volumes` field in Windows. Previously, volumes parser assumed some Unix-ism (e.g. didn't expect `:` in mount paths). Here, we use the Docker parser to identify host and container paths. Docker parsers use different validation logic from our previous unix implementation: Docker parser accepts single path as a volume entry (parsing it as a container path with auto-created volume) and enforces additional checks (e.g. validity of mode). Thereforce, I opted to use Docker parser only for Windows, and keep Nomad's linux parser to preserve current behavior.	2019-04-25 09:02:44 -04:00
Mahmood Ali	df2b579c6b	driver/docker: collect tty container logs Fixes https://github.com/hashicorp/nomad/issues/5475 When container is a tty container, we need to get raw terminal output without any additional processing.	2019-04-24 22:01:51 -04:00
Danielle Lancashire	a096a7f112	Switch to pre-0.9 behaviour for handling volumes In Nomad 0.9, we made volume driver handling the same for `""`, and `"local"` volumes. Prior to Nomad 0.9 however these had slightly different behaviour for relative paths and named volumes. Prior to 0.9 the empty string would expand relative paths within the task dir, and `"local"` volumes that are not absolute paths would be treated as docker named volumes. This commit reverts to the previous behaviour as follows: \| Nomad Version \| Driver \| Volume Spec \| Behaviour \| \|------------------------------------------------------------------------- \| all \| "" \| testing:/testing \| allocdir/testing \| \| 0.8.7 \| "local" \| testing:/testing \| "testing" as named volume \| \| 0.9.0 \| "local" \| testing:/testing \| allocdir/testing \| \| 0.9.1 \| "local" \| testing:/testing \| "testing" as named volume \|	2019-04-18 14:28:45 +02:00
Mahmood Ali	b4d84fd6a9	Allow compiling without nvidia integration nvidia library use of dynamic library seems to conflict with alpine and musl based OSes. This adds a `nonvidia` tag to allow compiling nomad for alpine images. The nomad releases currently only support glibc based OS environments, so we default to compiling with nvidia.	2019-04-10 09:19:12 -04:00
Nick Ethier	4bbdb80b73	drivers/docker: fix image name handleing when prefixed with https://	2019-04-04 22:10:18 -04:00
Michael Schurter	294d405397	docker: restore pre-0.9 container names As far as I can tell Nomad itself does not use the container name after container creation, so this should be safe. OP: https://groups.google.com/d/topic/nomad-tool/kYkyERfVRXE/discussion v0.8.7 code: https://github.com/hashicorp/nomad/blob/v0.8.7/client/driver/docker.go#L1530-L1531	2019-03-29 13:55:43 -07:00
Mahmood Ali	4726cb2207	logging.Type over logging.Driver	2019-02-28 16:40:18 -05:00
Mahmood Ali	104869c0e1	drivers/docker: rename logging `type` to `driver` Docker uses the term logging `driver` in its public documentations: in `docker` daemon config[1], `docker run` arguments [2] and in docker compose file[3]. Interestingly, docker used `type` in its API [4] instead of everywhere else. It's unfortunate that Nomad used `type` modeling after the Docker API rather than the user facing documents. Nomad using `type` feels very non-user friendly as it's disconnected from how Docker markets the flag and shows internal representation instead. Here, we rectify the situation by introducing `driver` field and prefering it over `type` in logging. [1] https://docs.docker.com/config/containers/logging/configure/ [2] https://docs.docker.com/engine/reference/run/#logging-drivers---log-driver [3] https://docs.docker.com/compose/compose-file/#logging [4] https://docs.docker.com/engine/api/v1.39/#operation/ContainerCreate	2019-02-28 16:04:03 -05:00
Danielle Tomlinson	e250aad31b	Merge pull request #5355 from hashicorp/dani/windows-dockerstats docker: Support Stats on Windows	2019-02-26 16:39:48 +01:00
Danielle Tomlinson	e3dc80bea3	docker: Return undetected before first detection This commit causes the docker driver to return undetected before it first establishes a connection to the docker daemon. This fixes a bug where hosts without docker installed would return as unhealthy, rather than undetected.	2019-02-25 11:02:42 +01:00
Danielle Tomlinson	8aff115fca	docker: Support stats on Windows	2019-02-22 14:19:58 +01:00
Danielle Tomlinson	2610e2d9ef	docker: Avoid leaking containers during Reattach Currently if a docker_logger cannot be reattached to, we will leak the container that was being used. This is problematic if e.g using static ports as it means you can never recover your task, or if a service is expensive to run and will then be running without supervision.	2019-02-20 17:47:06 +01:00

1 2

96 Commits