open-nomad

Commit Graph

Author	SHA1	Message	Date
Nick Ethier	729dd9018c	docker: set default cpu cfs period (#6737 ) * docker: set default cpu cfs period Co-Authored-By: Michael Schurter <mschurter@hashicorp.com>	2019-11-19 19:05:15 -05:00
Mahmood Ali	bc893829bc	changelog and comment	2019-11-19 15:51:08 -05:00
Mahmood Ali	ea221cfe87	always destroy	2019-11-18 21:31:29 -05:00
Mahmood Ali	abd700bf8f	Add tests for orphaned processes	2019-11-18 21:31:29 -05:00
Tim Gross	b1b20cd479	remove misleading networking log line (#6588 ) When a job has a task group network, this log line ends up being misleading if you're trying to debug networking issues. We really only care about this when there's no port map set, in which case we get the error returned anyways.	2019-10-30 13:23:33 -04:00
Mahmood Ali	fe14993582	docs: Docker driver supports task user option Also, add a test case.	2019-10-24 14:00:37 -04:00
Mahmood Ali	977b86f924	driver/docker: ensure that defaults are populated Looks like we may need to pass default literal at each layer to be able, so defaults are set properly.	2019-10-18 18:27:28 -04:00
Mahmood Ali	1bdfcdcab7	add timeouts for docker reconciler docker calls	2019-10-18 15:31:13 -04:00
Mahmood Ali	414e01b6a6	only set a single label for now Other labels aren't strictly necessary here, and we may follow up with a better way to customize.	2019-10-18 15:31:13 -04:00
Mahmood Ali	3aec7b56ea	Only start reconciler once in main driver driver.SetConfig is not appropriate for starting up reconciler goroutine. Some ephemeral driver instances are created for validating config and we ought not to side-effecting goroutines for those. We currently lack a lifecycle hook to inject these, so I picked the `Fingerprinter` function for now, and reconciler should only run after fingerprinter started. Use `sync.Once` to ensure that we only start reconciler loop once.	2019-10-18 14:43:23 -04:00
Mahmood Ali	ac3b555cc8	docker label refactoring and additional tests	2019-10-17 10:45:13 -04:00
Mahmood Ali	e24c3fac56	add docker labels	2019-10-17 10:45:12 -04:00
Mahmood Ali	8739cc2a62	refactor reconciler code and address comments	2019-10-17 09:42:23 -04:00
Mahmood Ali	c01c6de481	address code review comments	2019-10-17 08:36:02 -04:00
Mahmood Ali	2a63caafba	docker: explicit grace period for initial container reconcilation Ensure we wait for some grace period before killing docker containers that may have launched in earlier nomad restore.	2019-10-17 08:36:02 -04:00
Mahmood Ali	aa59280edc	docker: periodically reconcile containers When running at scale, it's possible that Docker Engine starts containers successfully but gets wedged in a way where API call fails. The Docker Engine may remain unavailable for arbitrary long time. Here, we introduce a periodic reconcilation process that ensures that any container started by nomad is tracked, and killed if is running unexpectedly. Basically, the periodic job inspects any container that isn't tracked in its handlers. A creation grace period is used to prevent killing newly created containers that aren't registered yet. Also, we aim to avoid killing unrelated containters started by host or through raw_exec drivers. The logic is to pattern against containers environment variables and mounts to infer if they are an alloc docker container. Lastly, the periodic job can be disabled to avoid any interference if need be.	2019-10-17 08:36:01 -04:00
Danielle Lancashire	4fbcc668d0	volumes: Add support for mount propagation This commit introduces support for configuring mount propagation when mounting volumes with the `volume_mount` stanza on Linux targets. Similar to Kubernetes, we expose 3 options for configuring mount propagation: - private, which is equivalent to `rprivate` on Linux, which does not allow the container to see any new nested mounts after the chroot was created. - host-to-task, which is equivalent to `rslave` on Linux, which allows new mounts that have been created _outside of the container_ to be visible inside the container after the chroot is created. - bidirectional, which is equivalent to `rshared` on Linux, which allows both the container to see new mounts created on the host, but importantly _allows the container to create mounts that are visible in other containers an don the host_ private and host-to-task are safe, but bidirectional mounts can be dangerous, as if the code inside a container creates a mount, and does not clean it up before tearing down the container, it can cause bad things to happen inside the kernel. To add a layer of safety here, we require that the user has ReadWrite permissions on the volume before allowing bidirectional mounts, as a defense in depth / validation case, although creating mounts should also require a priviliged execution environment inside the container.	2019-10-14 14:09:58 +02:00
Nick Ethier	0c19bf6f04	executor: run exec commands in netns if set (#6405 ) executor: run exec commands in netns if set	2019-10-01 14:45:43 -04:00
Nick Ethier	8b881d83d5	executor: rename wrapNetns to withNetworkIsolation	2019-09-30 21:38:31 -04:00
Nick Ethier	5127caef11	comment wrapNetns	2019-09-30 12:06:52 -04:00
Nick Ethier	67ac161565	executor: removed unused field from exec_utils.go	2019-09-30 11:57:34 -04:00
Nick Ethier	6fd773eb88	executor: run exec commands in netns if set	2019-09-30 11:50:22 -04:00
Tim Gross	9efca131be	driver/java: pass task network isolation to executor Without passing the network isolation configuration to the executor, java tasks are not placed in the same network namespace as the other processes in their task group, which breaks Consul Connect.	2019-09-27 08:26:54 -04:00
Tim Gross	d965a15490	driver/networking: don't recreate existing network namespaces	2019-09-25 14:58:17 -04:00
Nick Ethier	53d3ea8ebd	driver: set correct network isolation caps for exec and java dr… (#6368 )	2019-09-25 11:48:14 -04:00
rpramodd	0d09b564fa	utils: add missing error info in case of cmd failure (#6355 )	2019-09-24 09:33:27 -04:00
Mahmood Ali	1d945994d0	docker: remove containers on creation failures The docker creation API calls may fail with http errors (e.g. timeout) even if container was successfully created. Here, we force remove container if we got unexpected failure. We already do this in some error handlers, and this commit updates all paths. I stopped short from a more aggressive refactoring, as the code is ripe for refactoring and would rather do that in another PR.	2019-09-18 08:45:59 -04:00
Mahmood Ali	75ede5a685	add exponential backoff for docker api calls	2019-09-18 08:12:54 -04:00
Mahmood Ali	ac329a5e07	retry transient docker errors within function	2019-09-13 15:25:31 -04:00
Mahmood Ali	e8d73e3d72	docker: defensive against failed starts This handles a bug where we may start a container successfully, yet we fail due to retries and startContainer not being idempotent call. Here, we ensure that when starting a container fails with 500 error, the retry succeeds if container was started successfully.	2019-09-13 13:02:35 -04:00
Mahmood Ali	87f0457973	fix qemu and update docker with tests	2019-09-04 11:27:51 -04:00
Jasmine Dahilig	5b6e39b37c	fix portmap envvars in docker driver	2019-09-04 11:26:13 -04:00
Michael Schurter	8fe42fccb0	Merge pull request #6000 from Iqoqo/docker-convert-host-paths-to-host-native driver/docker: convert host bind path to os native	2019-09-03 09:34:56 -07:00
Danielle Lancashire	724586ba1d	docker: Fix driver spec hclspec.NewLiteral does not quote its values, which caused `3m` to be parsed as a nonsensical literal which broke the plugin loader during initialization. By quoting the value here, it starts correctly.	2019-09-03 08:53:37 +02:00
Zhiguang Wang	832df1091b	Add default value "3m" to image_delay, making it consistent with docs.	2019-09-02 16:40:00 +08:00
Mahmood Ali	f98d4ee3f1	tests: enable raw_exec driver	2019-08-29 20:26:50 -04:00
Mahmood Ali	28e473aaff	raw_exec: be defensive when disabled Ensure that no raw_exec task can run on a client where it's disabled, even if a flaw lead to client being assigned a raw_exec task unexpectedly.	2019-08-29 09:09:40 -04:00
Danielle Lancashire	fb63259921	docker: Fix issue where an exec may never timeout	2019-08-16 15:40:03 +02:00
Michael Schurter	83dbac65b2	docker: reword FromSlash(hostPath) comment	2019-08-12 14:38:31 -07:00
ilya guterman	92ce8a0a49	Update utils.go	2019-08-12 19:31:34 +03:00
Ilya Guterman	c4b4d7fa43	add comment	2019-08-12 19:31:33 +03:00
Ilya Guterman	52aab40fb3	driver/docker: convert host bind path to os native relative mounting can be specified using backslashes or forward slashes. so no prior knowledge of host OS is needed for relative volumes mounting	2019-08-12 19:31:33 +03:00
Michael Schurter	aeeec126f5	Merge pull request #5999 from Iqoqo/use-default-network-for-docker driver/docker: use default network mode	2019-08-01 09:58:12 -07:00
Ilya Guterman	a4931ba25b	driver/docker: support unix destination mount path in windows This reverts commit a6c96eade56f0b8880edbec3c4392934492f09bf.	2019-08-01 19:54:08 +03:00
Ilya Guterman	1e6ea0af8c	driver/docker: use default network mode fallback to docker default network mode instead of explicit bridge for linux or nat for windows	2019-07-31 21:07:46 +03:00
Nick Ethier	1dae42ab81	docker: allow configuration of infra image	2019-07-31 01:04:07 -04:00
Nick Ethier	533b2850fc	executor: cleanup netns handling in executor	2019-07-31 01:04:05 -04:00
Nick Ethier	b8a1ebb3b7	executor: support network namespacing on universal executor	2019-07-31 01:03:58 -04:00
Nick Ethier	0e40063092	docker: add nil check on network isolation spec	2019-07-31 01:03:21 -04:00
Nick Ethier	f50fa7ef08	docker: fix driver test from changed func args	2019-07-31 01:03:20 -04:00

1 2 3 4 5 ...

497 Commits