open-nomad

Commit Graph

Author	SHA1	Message	Date
Danielle Lancashire	4fbcc668d0	volumes: Add support for mount propagation This commit introduces support for configuring mount propagation when mounting volumes with the `volume_mount` stanza on Linux targets. Similar to Kubernetes, we expose 3 options for configuring mount propagation: - private, which is equivalent to `rprivate` on Linux, which does not allow the container to see any new nested mounts after the chroot was created. - host-to-task, which is equivalent to `rslave` on Linux, which allows new mounts that have been created _outside of the container_ to be visible inside the container after the chroot is created. - bidirectional, which is equivalent to `rshared` on Linux, which allows both the container to see new mounts created on the host, but importantly _allows the container to create mounts that are visible in other containers an don the host_ private and host-to-task are safe, but bidirectional mounts can be dangerous, as if the code inside a container creates a mount, and does not clean it up before tearing down the container, it can cause bad things to happen inside the kernel. To add a layer of safety here, we require that the user has ReadWrite permissions on the volume before allowing bidirectional mounts, as a defense in depth / validation case, although creating mounts should also require a priviliged execution environment inside the container.	2019-10-14 14:09:58 +02:00
Tim Gross	d965a15490	driver/networking: don't recreate existing network namespaces	2019-09-25 14:58:17 -04:00
Mahmood Ali	1d945994d0	docker: remove containers on creation failures The docker creation API calls may fail with http errors (e.g. timeout) even if container was successfully created. Here, we force remove container if we got unexpected failure. We already do this in some error handlers, and this commit updates all paths. I stopped short from a more aggressive refactoring, as the code is ripe for refactoring and would rather do that in another PR.	2019-09-18 08:45:59 -04:00
Mahmood Ali	75ede5a685	add exponential backoff for docker api calls	2019-09-18 08:12:54 -04:00
Mahmood Ali	ac329a5e07	retry transient docker errors within function	2019-09-13 15:25:31 -04:00
Mahmood Ali	e8d73e3d72	docker: defensive against failed starts This handles a bug where we may start a container successfully, yet we fail due to retries and startContainer not being idempotent call. Here, we ensure that when starting a container fails with 500 error, the retry succeeds if container was started successfully.	2019-09-13 13:02:35 -04:00
Mahmood Ali	87f0457973	fix qemu and update docker with tests	2019-09-04 11:27:51 -04:00
Jasmine Dahilig	5b6e39b37c	fix portmap envvars in docker driver	2019-09-04 11:26:13 -04:00
Danielle Lancashire	fb63259921	docker: Fix issue where an exec may never timeout	2019-08-16 15:40:03 +02:00
Ilya Guterman	1e6ea0af8c	driver/docker: use default network mode fallback to docker default network mode instead of explicit bridge for linux or nat for windows	2019-07-31 21:07:46 +03:00
Nick Ethier	0e40063092	docker: add nil check on network isolation spec	2019-07-31 01:03:21 -04:00
Nick Ethier	d752734719	docker: add additional commens	2019-07-31 01:03:20 -04:00
Nick Ethier	1fc5f86a7c	docker: support shared network namespaces	2019-07-31 01:03:20 -04:00
Nick Ethier	2d60ef64d9	plugins/driver: make DriverNetworkManager interface optional	2019-07-31 01:03:19 -04:00
Nick Ethier	548f78ef15	ar: initial driver based network management	2019-07-31 01:03:17 -04:00
Jasmine Dahilig	cece83dd9c	default to json-file log rotation for docker driver	2019-07-03 09:04:45 -07:00
Chris Baker	3ca97d52db	docker/driver: downgraded log level for error in DestroyTask	2019-06-03 21:21:32 +00:00
Chris Baker	2af897c76f	drivers/docker: modify container/image cleanup to be robust to containers removed out of band	2019-06-03 19:52:28 +00:00
Chris Baker	9442c26cff	docker: DestroyTask was not cleaning up Docker images because it was erroring early due to an attempt to inspect an image that had already been removed	2019-06-03 19:04:27 +00:00
Mahmood Ali	13c83ee38e	drivers/docker: implement streaming exec	2019-05-09 16:49:08 -04:00
Mahmood Ali	0ee771b020	driver/docker: Support `volumes` field in Windows Support Docker `volumes` field in Windows. Previously, volumes parser assumed some Unix-ism (e.g. didn't expect `:` in mount paths). Here, we use the Docker parser to identify host and container paths. Docker parsers use different validation logic from our previous unix implementation: Docker parser accepts single path as a volume entry (parsing it as a container path with auto-created volume) and enforces additional checks (e.g. validity of mode). Thereforce, I opted to use Docker parser only for Windows, and keep Nomad's linux parser to preserve current behavior.	2019-04-25 09:02:44 -04:00
Mahmood Ali	df2b579c6b	driver/docker: collect tty container logs Fixes https://github.com/hashicorp/nomad/issues/5475 When container is a tty container, we need to get raw terminal output without any additional processing.	2019-04-24 22:01:51 -04:00
Danielle Lancashire	a096a7f112	Switch to pre-0.9 behaviour for handling volumes In Nomad 0.9, we made volume driver handling the same for `""`, and `"local"` volumes. Prior to Nomad 0.9 however these had slightly different behaviour for relative paths and named volumes. Prior to 0.9 the empty string would expand relative paths within the task dir, and `"local"` volumes that are not absolute paths would be treated as docker named volumes. This commit reverts to the previous behaviour as follows: \| Nomad Version \| Driver \| Volume Spec \| Behaviour \| \|------------------------------------------------------------------------- \| all \| "" \| testing:/testing \| allocdir/testing \| \| 0.8.7 \| "local" \| testing:/testing \| "testing" as named volume \| \| 0.9.0 \| "local" \| testing:/testing \| allocdir/testing \| \| 0.9.1 \| "local" \| testing:/testing \| "testing" as named volume \|	2019-04-18 14:28:45 +02:00
Mahmood Ali	b4d84fd6a9	Allow compiling without nvidia integration nvidia library use of dynamic library seems to conflict with alpine and musl based OSes. This adds a `nonvidia` tag to allow compiling nomad for alpine images. The nomad releases currently only support glibc based OS environments, so we default to compiling with nvidia.	2019-04-10 09:19:12 -04:00
Nick Ethier	4bbdb80b73	drivers/docker: fix image name handleing when prefixed with https://	2019-04-04 22:10:18 -04:00
Michael Schurter	294d405397	docker: restore pre-0.9 container names As far as I can tell Nomad itself does not use the container name after container creation, so this should be safe. OP: https://groups.google.com/d/topic/nomad-tool/kYkyERfVRXE/discussion v0.8.7 code: https://github.com/hashicorp/nomad/blob/v0.8.7/client/driver/docker.go#L1530-L1531	2019-03-29 13:55:43 -07:00
Mahmood Ali	4726cb2207	logging.Type over logging.Driver	2019-02-28 16:40:18 -05:00
Mahmood Ali	104869c0e1	drivers/docker: rename logging `type` to `driver` Docker uses the term logging `driver` in its public documentations: in `docker` daemon config[1], `docker run` arguments [2] and in docker compose file[3]. Interestingly, docker used `type` in its API [4] instead of everywhere else. It's unfortunate that Nomad used `type` modeling after the Docker API rather than the user facing documents. Nomad using `type` feels very non-user friendly as it's disconnected from how Docker markets the flag and shows internal representation instead. Here, we rectify the situation by introducing `driver` field and prefering it over `type` in logging. [1] https://docs.docker.com/config/containers/logging/configure/ [2] https://docs.docker.com/engine/reference/run/#logging-drivers---log-driver [3] https://docs.docker.com/compose/compose-file/#logging [4] https://docs.docker.com/engine/api/v1.39/#operation/ContainerCreate	2019-02-28 16:04:03 -05:00
Danielle Tomlinson	e250aad31b	Merge pull request #5355 from hashicorp/dani/windows-dockerstats docker: Support Stats on Windows	2019-02-26 16:39:48 +01:00
Danielle Tomlinson	e3dc80bea3	docker: Return undetected before first detection This commit causes the docker driver to return undetected before it first establishes a connection to the docker daemon. This fixes a bug where hosts without docker installed would return as unhealthy, rather than undetected.	2019-02-25 11:02:42 +01:00
Danielle Tomlinson	8aff115fca	docker: Support stats on Windows	2019-02-22 14:19:58 +01:00
Danielle Tomlinson	2610e2d9ef	docker: Avoid leaking containers during Reattach Currently if a docker_logger cannot be reattached to, we will leak the container that was being used. This is problematic if e.g using static ports as it means you can never recover your task, or if a service is expensive to run and will then be running without supervision.	2019-02-20 17:47:06 +01:00
Danielle Tomlinson	2f18441a47	docker: Respawn docker logger during recovery Sometimes the nomad docker_logger may be killed by a service manager when restarting the client for upgrades or reliability reasons. Currently if this happens, we leak the users container and try to reschedule over it. This commit adds a new step to the recovery process that will spawn a new docker logger process that will fetch logs from _the current timestamp_. This is to avoid restarting users tasks because our logging sidecar has failed.	2019-02-20 17:12:56 +01:00
Danielle Tomlinson	a3a1491958	drivers/docker: SIGTERM to stop containers Windows Docker daemon does not support SIGINT, SIGTERM is the semantic equivalent that allows for graceful shutdown before being followed up by a SIGKILL.	2019-02-14 15:38:54 +00:00
Nick Ethier	be976d9c9a	Merge branch 'master' into f-driver-upgradepath-test * master: (23 commits) tests: avoid assertion in goroutine spell check ci: run checkscripts tests: deflake TestRktDriver_StartWaitRecoverWaitStop drivers/rkt: Remove unused github.com/rkt/rkt drivers/rkt: allow development on non-linux cli: Hide `nomad docker_logger` from help output api: test api and structs are in sync goimports until make check is happy nil check node resources to prevent panic tr: use context in as select statement move pluginutils -> helper/pluginutils vet goimports gofmt Split hclspec move hclutils Driver tests do not use hcl2/hcl, hclspec, or hclutils move reattach config loader and singleton ...	2019-01-23 21:01:24 -05:00
Nick Ethier	5b9013528e	drivers: add docker upgrade path and e2e test	2019-01-23 14:44:42 -05:00
Alex Dadgar	b2c7268843	move reattach config	2019-01-22 15:11:58 -08:00
Nick Ethier	e3c6f89b9a	drivers: use consts for task handle version	2019-01-18 18:31:01 -05:00
Nick Ethier	6804450c69	cleanup code comments and small fixes from refactor	2019-01-18 18:31:01 -05:00
Mahmood Ali	5df63fda7c	Merge pull request #5190 from hashicorp/f-memory-usage Track Basic Memory Usage as reported by cgroups	2019-01-18 16:46:02 -05:00
Preetha Appan	55319b05d1	clean up read access	2019-01-16 11:04:11 -06:00
Preetha Appan	469a286b1b	Refactor logging in drivers to use a tri-state boolean Changes logging warnings/errors only if the state changes from healthy to unhealthy	2019-01-16 10:19:31 -06:00
Preetha Appan	0c2c0a2d43	Make docker driver logging less redundant	2019-01-16 10:16:57 -06:00
oleksii.shyman	e41fbf7577	Add support for docker runtimes - docker fingerprint issues a docker api system info call to get the list of supported OCI runtimes. - OCI runtimes are reported as comma separated list of names - docker driver is aware of GPU runtime presence - docker driver throws an error when user tries to run container with GPU, when GPU runtime is not present - docker GPU runtime name is configurable	2019-01-15 11:34:47 -08:00
Danielle Tomlinson	272a8726d7	docker: Terminate dockerlogger Previously, we did not attempt to stop Docker Logger processes until DestroyTask, which means that under many circumstances, we will never successfully close the plugin client. This commit terminates the plugin process when `run` terminates, or when `DestroyTask` is called. Steps to repro: ``` $ nomad agent -dev $ nomad init $ nomad run example.nomad $ nomad stop example $ ps aux \| grep nomad # See docker logger process running $ signal the dev agent $ ps aux \| grep nomad # See docker logger process running ```	2019-01-15 14:58:05 +01:00
Mahmood Ali	9909d98bee	Track Basic Memory Usage as reported by cgroups Track current memory usage, `memory.usage_in_bytes`, in addition to `memory.max_memory_usage_in_bytes` and friends. This number is closer what Docker reports. Related to https://github.com/hashicorp/nomad/issues/5165 .	2019-01-14 18:47:52 -05:00
Nick Ethier	9fea54e0dc	executor: implement streaming stats API plugins/driver: update driver interface to support streaming stats client/tr: use streaming stats api TODO: * how to handle errors and closed channel during stats streaming * prevent tight loop if Stats(ctx) returns an error drivers: update drivers TaskStats RPC to handle streaming results executor: better error handling in stats rpc docker: better control and error handling of stats rpc driver: allow stats to return a recoverable error	2019-01-12 12:18:22 -05:00
Mahmood Ali	64f80343fc	drivers: re-export ResourceUsage structs Re-export the ResourceUsage structs in drivers package to avoid drivers directly depending on the internal client/structs package directly. I attempted moving the structs to drivers, but that caused some import cycles that was a bit hard to disentagle. Alternatively, I added an alias here that's sufficient for our purposes of avoiding external drivers depend on internal packages, while allowing us to restructure packages in future without breaking source compatibility.	2019-01-08 09:11:47 -05:00
Mahmood Ali	916a40bb9e	move cstructs.DeviceNetwork to drivers pkg	2019-01-08 09:11:47 -05:00
Danielle Tomlinson	a9b9ad34dc	drivers: Implement InternalPluginDriver interface This implements the InternalPluginDriver interface in each driver, and calls the cancellation fn for their respective eventers. This fixes a per task goroutine leak during test suite execution.	2019-01-08 13:49:31 +01:00

1 2

78 Commits