open-nomad

Commit Graph

Author	SHA1	Message	Date
Nick Ethier	533b2850fc	executor: cleanup netns handling in executor	2019-07-31 01:04:05 -04:00
Nick Ethier	b8a1ebb3b7	executor: support network namespacing on universal executor	2019-07-31 01:03:58 -04:00
Nick Ethier	0e40063092	docker: add nil check on network isolation spec	2019-07-31 01:03:21 -04:00
Nick Ethier	f50fa7ef08	docker: fix driver test from changed func args	2019-07-31 01:03:20 -04:00
Nick Ethier	d752734719	docker: add additional commens	2019-07-31 01:03:20 -04:00
Nick Ethier	1fc5f86a7c	docker: support shared network namespaces	2019-07-31 01:03:20 -04:00
Nick Ethier	971c8c9c2b	Driver networking support Adds support for passing network isolation config into drivers and implements support in the rawexec driver as a proof of concept	2019-07-31 01:03:20 -04:00
Nick Ethier	2d60ef64d9	plugins/driver: make DriverNetworkManager interface optional	2019-07-31 01:03:19 -04:00
Nick Ethier	548f78ef15	ar: initial driver based network management	2019-07-31 01:03:17 -04:00
Michael Schurter	ea7fac7bcf	Revert "driver/docker: support unix destination mount path in windows"	2019-07-29 09:09:54 -07:00
Michael Schurter	5bd655e87d	Merge pull request #6001 from Iqoqo/allow-unix-container-path-in-lcow driver/docker: support unix destination mount path in windows	2019-07-29 09:09:41 -07:00
Lang Martin	1e33da5fd1	executor_universal_linux log a link to the docs on cgroup error	2019-07-24 12:37:33 -04:00
Ilya Guterman	cb2284fc3a	driver/docker: support unix destination mount path in windows	2019-07-24 11:57:28 +03:00
Lang Martin	a1d496c05c	executor_universal_linux raw_exec cgroup failure is not fatal	2019-07-22 15:16:36 -04:00
Lang Martin	a0fe1ffdd5	default e.getAllPids in executor_basic	2019-07-18 10:57:27 -04:00
Lang Martin	9d0c0c459d	executor_unix and _windows stub getAllPids ByScanning	2019-07-17 17:34:06 -04:00
Lang Martin	e071f6b022	executor_universal_linux getAllPids chooses cgroup when available	2019-07-17 17:33:55 -04:00
Lang Martin	e1bab541ad	executor use e.getAllPids()	2019-07-17 17:33:11 -04:00
Lang Martin	18597c4917	resource_container_linux new getAllPidsByCgroup	2019-07-17 17:31:36 -04:00
Lang Martin	2e981a812e	pid_collector getAllPids -> getAllPidsByScanning	2019-07-17 17:31:20 -04:00
Jasmine Dahilig	1c1e81b294	Merge pull request #5846 from hashicorp/f-docker-log-constraints add log rotation to docker driver log defaults	2019-07-03 10:17:19 -07:00
Jasmine Dahilig	cece83dd9c	default to json-file log rotation for docker driver	2019-07-03 09:04:45 -07:00
Mahmood Ali	6c245c9b6a	Merge pull request #5811 from cloudbuy/b-win32-volume-split lift code from docker/volume/mounts for splitting windows volumes	2019-06-18 21:19:15 -04:00
Mahmood Ali	ac64509c59	comment on use of init() for plugin handlers	2019-06-18 20:54:55 -04:00
Damien Churchill	9ee17f32c0	run new file through goimports	2019-06-18 08:35:25 +01:00
Damien Churchill	dba5bd96cd	run gofmt over the new file	2019-06-18 08:35:25 +01:00
Damien Churchill	4d7d352d7e	drivers/docker: move lifted code out to separate file and link the source & license	2019-06-18 08:35:25 +01:00
Damien Churchill	70daca3395	lift code from docker/volume/mounts for splitting windows volumes Using the API as provided from the `mounts` package imposes validation on the `src:dest` which shouldn't be performed at this time. To workaround that lift the internal code from that library required to only perform the split.	2019-06-18 08:35:25 +01:00
Mahmood Ali	962921f86c	Use init to handle plugin invocation Currently, nomad "plugin" processes (e.g. executor, logmon, docker_logger) are started as CLI commands to be handled by command CLI framework. Plugin launchers use `discover.NomadBinary()` to identify the binary and start it. This has few downsides: The trivial one is that when running tests, one must re-compile the nomad binary as the tests need to invoke the nomad executable to start plugin. This is frequently overlooked, resulting in puzzlement. The more significant issue with `executor` in particular is in relation to external driver: * Plugin must identify the path of invoking nomad binary, which is not trivial; `discvoer.NomadBinary()` now returns the path to the plugin rather than to nomad, preventing external drivers from launching executors. * The external driver may get a different version of executor than it expects (specially if we make a binary incompatible change in future). This commit addresses both downside by having the plugin invocation handling through an `init()` call, similar to how libcontainer init handler is done in [1] and recommened by libcontainer [2]. `init()` will be invoked and handled properly in tests and external drivers. For external drivers, this change will cause external drivers to launch the executor that's compiled against. There a are a couple of downsides to this approach: * These specific packages (i.e executor, logmon, and dockerlog) need to be careful in use of `init()`, package initializers. Must avoid having command execution rely on any other init in the package. I prefixed files with `z_` (golang processes files in lexical order), but ensured we don't depend on order. * The command handling is spread in multiple packages making it a bit less obvious how plugin starts are handled. [1] drivers/shared/executor/libcontainer_nsenter_linux.go [2] `eb4aeed24f/libcontainer (using-libcontainer)`	2019-06-13 16:48:01 -04:00
Mahmood Ali	5734c8a648	update comment	2019-06-11 13:00:26 -04:00
Mahmood Ali	f7608c4cef	exec: use an independent name=systemd cgroup path We aim for containers to be part of a new cgroups hierarchy independent from nomad agent. However, we've been setting a relative path as libcontainer `cfg.Cgroups.Path`, which makes libcontainer concatinate the executor process cgroup with passed cgroup, as set in [1]. By setting an absolute path, we ensure that all cgroups subsystem (including `name=systemd` get a dedicated one). This matches behavior in Nomad 0.8, and behavior of how Docker and OCI sets CgroupsPath[2] Fixes #5736 [1] `d7edf9b2e4/vendor/github.com/opencontainers/runc/libcontainer/cgroups/fs/apply_raw.go (L326-L340)` [2] `238f8eaa31/vendor/github.com/containerd/containerd/oci/spec.go (L229)`	2019-06-10 22:00:12 -04:00
Chris Baker	3ca97d52db	docker/driver: downgraded log level for error in DestroyTask	2019-06-03 21:21:32 +00:00
Chris Baker	2af897c76f	drivers/docker: modify container/image cleanup to be robust to containers removed out of band	2019-06-03 19:52:28 +00:00
Chris Baker	be6c6e8ce1	docker/tests: - modified tests to cleanup now that RemoveContainer isn't in StartTask - fix some broken tests by removing docker images/containers before test	2019-06-03 19:05:08 +00:00
Chris Baker	9442c26cff	docker: DestroyTask was not cleaning up Docker images because it was erroring early due to an attempt to inspect an image that had already been removed	2019-06-03 19:04:27 +00:00
Mahmood Ali	cb554a015f	Fix test comparisons	2019-05-24 21:38:22 -05:00
Mahmood Ali	99637c8bbc	Test for expected capabilities specifically	2019-05-24 16:07:05 -05:00
Mahmood Ali	7455c746aa	use /bin/bash	2019-05-24 14:50:23 -04:00
Mahmood Ali	68813def56	special case root capabilities	2019-05-24 14:10:10 -04:00
Mahmood Ali	01d5c90cbb	tests: Fix binary dir permissions	2019-05-24 11:31:12 -04:00
Mahmood Ali	00081b15d6	fix	2019-05-20 15:30:07 -04:00
Mahmood Ali	807e7b90e0	drivers/exec: Restore 0.8 capabilities Nomad 0.9 incidentally set effective capabilities that is higher than what's expected of a `nobody` process, and what's set in 0.8. This change restores the capabilities to ones used in Nomad 0.9.	2019-05-20 13:11:29 -04:00
Lang Martin	0256cf700d	Merge pull request #5649 from hashicorp/b-lookup-exe-chroot lookup executables inside chroot	2019-05-17 15:07:41 -04:00
Michael Schurter	8589233a0e	drivers/mock: implement InspectTask	2019-05-14 10:53:27 -07:00
Mahmood Ali	dd8762e348	typo: "atleast" -> "at least"	2019-05-13 10:01:19 -04:00
Mahmood Ali	d1526571a5	implement nomad exec for rkt Implement the streaming exec handler for the rkt driver	2019-05-12 18:59:00 -04:00
Mahmood Ali	b4df061fef	use pty/tty terminology similar to github.com/kr/pty	2019-05-10 19:17:14 -04:00
Mahmood Ali	a4640db7a6	drivers: implement streaming exec for executor based drivers These simply delegate call to backend executor.	2019-05-10 19:17:14 -04:00
Mahmood Ali	3055fd53df	executors: implement streaming exec Implements streamign exec handling in both executors (i.e. universal and libcontainer). For creation of TTY, some incidental complexity leaked in. The universal executor uses github.com/kr/pty for creation of TTYs. On the other hand, libcontainer expects a console socket and for libcontainer to create the underlying console object on process start. The caller can then use `libcontainer.utils.RecvFd()` to get tty master end. I chose github.com/kr/pty for managing TTYs here. I tried `github.com/containerd/console` package (which is already imported), but the package did not work as expected on macOS.	2019-05-10 19:17:14 -04:00
Mahmood Ali	085d2ef759	executor: scaffolding for executor grpc handling Prepare executor to handle streaming exec API calls that reuse drivers protobuf structs.	2019-05-10 19:17:14 -04:00
Lang Martin	99359d7fbe	executor_linux only do path resolution in the taskDir, not local split out lookPathIn to show it's similarity to exec.LookPath	2019-05-10 11:33:35 -04:00
Mahmood Ali	13c83ee38e	drivers/docker: implement streaming exec	2019-05-09 16:49:08 -04:00
Mahmood Ali	ce97689b79	drivers/mock: implement nomad exec interface	2019-05-09 16:49:08 -04:00
Lang Martin	3ae276cfd2	executor_linux_test call lookupTaskBin with an ExecCommand	2019-05-08 10:01:51 -04:00
Lang Martin	743a2a2875	executor_linux pass the command to lookupTaskBin to get path	2019-05-08 10:01:20 -04:00
Lang Martin	8db3fe047c	executor/* Launch log at top of Launch is more explicit, trace	2019-05-07 17:01:05 -04:00
Lang Martin	87585e950d	move lookupTaskBin to executor_linux, for os dependency clarity	2019-05-07 16:58:27 -04:00
Lang Martin	de807a410a	driver_test leave cat in the test, but add cat to the chroot	2019-05-07 16:14:01 -04:00
Lang Martin	1e5d851d23	executor_test cleanup old lookupBin tests	2019-05-04 10:21:59 -04:00
Lang Martin	c777271374	driver_test StartWait task calls a program inside it's chroot	2019-05-03 16:34:31 -04:00
Lang Martin	c0741e392d	executor lookupTaskBin also does PATH expansion, anchored in taskDIR	2019-05-03 16:22:09 -04:00
Lang Martin	1619d3e3cb	executor_linux_test test PATH lookup inside the container	2019-05-03 16:21:58 -04:00
Lang Martin	22e99e41c1	executor and executor_linux debug launch prep and process start	2019-05-03 14:42:57 -04:00
Lang Martin	47b9fc3d26	executor_linux_test new TestExecutor_EscapeContainer	2019-05-03 14:38:42 -04:00
Lang Martin	1cf936e90f	executor_test test for more edges of lookupBin behavior	2019-05-03 11:55:19 -04:00
Lang Martin	88ce590dac	executor_linux call new lookupTaskBin	2019-05-03 11:55:19 -04:00
Lang Martin	ed63d6743b	executor split up lookupBin	2019-05-03 11:55:19 -04:00
Mahmood Ali	7131ac3eef	drivers/mock: extract command related operations Extract command parsing and execution mocking into a separate struct. Also, allow mocking of different fs_isolation for testing.	2019-04-30 14:02:16 -04:00
Mahmood Ali	0ee771b020	driver/docker: Support `volumes` field in Windows Support Docker `volumes` field in Windows. Previously, volumes parser assumed some Unix-ism (e.g. didn't expect `:` in mount paths). Here, we use the Docker parser to identify host and container paths. Docker parsers use different validation logic from our previous unix implementation: Docker parser accepts single path as a volume entry (parsing it as a container path with auto-created volume) and enforces additional checks (e.g. validity of mode). Thereforce, I opted to use Docker parser only for Windows, and keep Nomad's linux parser to preserve current behavior.	2019-04-25 09:02:44 -04:00
Mahmood Ali	df2b579c6b	driver/docker: collect tty container logs Fixes https://github.com/hashicorp/nomad/issues/5475 When container is a tty container, we need to get raw terminal output without any additional processing.	2019-04-24 22:01:51 -04:00
Mahmood Ali	6014a884be	comment on using init() for libcontainer handling	2019-04-19 09:49:04 -04:00
Mahmood Ali	4322055301	comment what refer to	2019-04-19 09:49:04 -04:00
Mahmood Ali	18993421f2	Move libcontainer helper to executor package	2019-04-19 09:49:04 -04:00
Danielle Lancashire	a096a7f112	Switch to pre-0.9 behaviour for handling volumes In Nomad 0.9, we made volume driver handling the same for `""`, and `"local"` volumes. Prior to Nomad 0.9 however these had slightly different behaviour for relative paths and named volumes. Prior to 0.9 the empty string would expand relative paths within the task dir, and `"local"` volumes that are not absolute paths would be treated as docker named volumes. This commit reverts to the previous behaviour as follows: \| Nomad Version \| Driver \| Volume Spec \| Behaviour \| \|------------------------------------------------------------------------- \| all \| "" \| testing:/testing \| allocdir/testing \| \| 0.8.7 \| "local" \| testing:/testing \| "testing" as named volume \| \| 0.9.0 \| "local" \| testing:/testing \| allocdir/testing \| \| 0.9.1 \| "local" \| testing:/testing \| "testing" as named volume \|	2019-04-18 14:28:45 +02:00
Mahmood Ali	01a13a0947	locking and opening streams in goroutine comment	2019-04-16 11:02:19 -04:00
Mahmood Ali	357b86adc3	open fifo on background goroutine	2019-04-15 21:20:09 -04:00
Mahmood Ali	9e48fab225	Merge pull request #5537 from hashicorp/b-nonvidia-flag Allow compiling without nvidia integration	2019-04-10 13:40:55 -04:00
Mahmood Ali	b4d84fd6a9	Allow compiling without nvidia integration nvidia library use of dynamic library seems to conflict with alpine and musl based OSes. This adds a `nonvidia` tag to allow compiling nomad for alpine images. The nomad releases currently only support glibc based OS environments, so we default to compiling with nvidia.	2019-04-10 09:19:12 -04:00
Mahmood Ali	f7d39d6186	ci: move docker unix-y test Fix AppVeyor failing builds, by moving docker image url test to run on unix systems only. The used paused image is a linux image only, not available on Windows.	2019-04-09 19:59:58 -04:00
Nick Ethier	4bbdb80b73	drivers/docker: fix image name handleing when prefixed with https://	2019-04-04 22:10:18 -04:00
Mahmood Ali	77a5edd3ae	an alternative order	2019-04-02 20:00:54 -04:00
Mahmood Ali	334c6e9f5f	try not without checking stat first	2019-04-02 19:55:44 -04:00
Mahmood Ali	17df86acda	basic test for #4809	2019-04-02 19:50:35 -04:00
Michael Schurter	923cd91850	Merge pull request #5504 from hashicorp/b-exec-path executor/linux: make chroot binary paths absolute	2019-04-02 14:09:50 -07:00
Michael Schurter	47bed4316f	executor/linux: comment this bizarre code	2019-04-02 11:25:45 -07:00
Michael Schurter	1d569a27dc	Revert "executor/linux: add defensive checks to binary path" This reverts commit cb36f4537e63d53b198c2a87d1e03880895631bd.	2019-04-02 11:17:12 -07:00
Michael Schurter	fc5487dbbc	executor/linux: add defensive checks to binary path	2019-04-02 09:40:53 -07:00
Michael Schurter	ff639f9ccc	docker: improve stats names and comments	2019-04-02 09:18:38 -07:00
Mahmood Ali	e2723399f3	Update drivers/docker/stats.go comment Co-Authored-By: schmichael <michael.schurter@gmail.com>	2019-04-02 09:09:17 -07:00
Michael Schurter	df3467a6ce	docker: fix send after close panic in stats destCh was being written to by one goroutine and closed by another goroutine. This panic occurred in Travis: ``` === FAIL: drivers/docker TestDockerCoordinator_ConcurrentPulls (117.66s) === PAUSE TestDockerCoordinator_ConcurrentPulls === CONT TestDockerCoordinator_ConcurrentPulls panic: send on closed channel goroutine 5358 [running]: github.com/hashicorp/nomad/drivers/docker.dockerStatsCollector(0xc0003a4a20, 0xc0003a49c0, 0x3b9aca00) /home/travis/gopath/src/github.com/hashicorp/nomad/drivers/docker/stats.go:108 +0x167 created by github.com/hashicorp/nomad/drivers/docker.TestDriver_DockerStatsCollector /home/travis/gopath/src/github.com/hashicorp/nomad/drivers/docker/stats_test.go:33 +0x1ab ``` The 2 ways to fix this kind of error are to either (1) add extra coordination around multiple goroutines writing to a chan or (2) make it so only one goroutines writes to a chan. I implemented (2) first as it's simpler, but @notnoop pointed out since the same destCh in reused in the stats loop there's now a double close panic possible! So this implements (1) by adding a *usageSender struct for handling concurrent senders and closing.	2019-04-02 08:28:08 -07:00
Michael Schurter	7d49bc4c71	executor/linux: make chroot binary paths absolute Avoid libcontainer.Process trying to lookup the binary via $PATH as the executor has already found where the binary is located.	2019-04-01 15:45:31 -07:00
Mahmood Ali	cb16ad7e3f	comment configureTLogging	2019-04-01 16:52:58 -04:00
Mahmood Ali	81f4f07ed7	rename fifo methods for clarity	2019-04-01 16:52:58 -04:00
Mahmood Ali	88dc4a255a	avoid opening files just to close them	2019-04-01 13:24:18 -04:00
Mahmood Ali	dac2cd3df3	Add test cases for waiting on children Also, make the test use files just like in the non-test case.	2019-04-01 13:24:18 -04:00
Mahmood Ali	2648d4176c	Merge pull request #5468 from hashicorp/b-test-exec-kill-wait Adding tests for killing task without grace	2019-03-31 20:37:15 -04:00
Michael Schurter	b8d1dd95a0	Update drivers/shared/executor/executor_test.go Co-Authored-By: notnoop <mahmood@notnoop.com>	2019-03-31 20:34:24 -04:00
Michael Schurter	294d405397	docker: restore pre-0.9 container names As far as I can tell Nomad itself does not use the container name after container creation, so this should be safe. OP: https://groups.google.com/d/topic/nomad-tool/kYkyERfVRXE/discussion v0.8.7 code: https://github.com/hashicorp/nomad/blob/v0.8.7/client/driver/docker.go#L1530-L1531	2019-03-29 13:55:43 -07:00
Mahmood Ali	df5d7ba50d	fix test setup	2019-03-26 09:15:22 -04:00
Mahmood Ali	d737a9836a	test kill wait	2019-03-26 09:15:22 -04:00
Mahmood Ali	b2b7618a1c	clarify unknown signal log line (#5466 )	2019-03-25 17:19:43 -04:00
Mahmood Ali	2a7b18aec4	Revert "executor: synchronize exitState accesses" (#5449 ) Reverts hashicorp/nomad#5433 Apparently, channel communications can constitute Happens-Before even for proximate variables, so this syncing isn't necessary. > _The closing of a channel happens before a receive that returns a zero value because the channel is closed._ https://golang.org/ref/mem#tmp_7	2019-03-20 07:33:05 -04:00
Nick Ethier	505e36ff7a	Merge pull request #5429 from hashicorp/b-blocking-executor-shutdown executor: block shutdown on process exiting	2019-03-19 15:18:01 -04:00
Mahmood Ali	a1776dba34	executor: synchronize exitState accesses exitState is set in `wait()` goroutine but accessed in a different `Wait()` goroutine, so accesses must be synchronized by a lock.	2019-03-17 11:56:58 -04:00
Nick Ethier	7418d09cf0	executor: block shutdown on process exiting	2019-03-15 23:50:17 -04:00
Mahmood Ali	8ec49fc133	Handle when cannot fetch docker logs (#5420 ) Fix #5418 When using a docker logger that doesn't support log streaming through API, currently docker logger runs a tight loop of Docker API calls unexpectedly. This change ensures we stop fetching logs early. Also, this adds some basic backoff strategy when Docker API logging fails unexpectedly, to avoid accidentally DoSing the docker daemon.	2019-03-14 16:23:11 -04:00
Mahmood Ali	fb55717b0c	Regenerate Proto files (#5421 ) Noticed that the protobuf files are out of sync with ones generated by 1.2.0 protoc go plugin. The cause for these files seem to be related to release processes, e.g. [0.9.0-beta1 preperation](`ecec3d38de (diff-da4da188ee496377d456025c2eab4e87)`), and [0.9.0-beta3 preperation](`b849d84f2f`). This restores the changes to that of the pinned protoc version and fails build if protobuf files are out of sync. Sample failing Travis job is that of the first commit change: https://travis-ci.org/hashicorp/nomad/jobs/506285085	2019-03-14 10:56:27 -04:00
Preetha Appan	7f0d9e0c8e	minor review feedback	2019-03-13 13:27:28 -05:00
Preetha Appan	273f1e993d	Validate all auth fields being empty rather than just email This fixes a regression in 0.9 beta3 compared to 0.8.7 in validating docker auth config	2019-03-13 11:47:37 -05:00
Preetha Appan	549ae657f0	Dont require email address for docker auth	2019-03-13 11:08:56 -05:00
Preetha	7759166b0d	Merge pull request #5380 from quasilyte/patch-1 drivers/shared/executor: fix strings.Replace call	2019-03-06 11:47:01 -06:00
Mahmood Ali	bb32ba8784	Support driver config fields being set to nil (#5391 ) To pick up https://github.com/hashicorp/hcl2/pull/90	2019-03-05 21:47:06 -05:00
Iskander (Alex) Sharipov	e69909fbd3	drivers/shared/executor: fix strings.Replace call strings.Replace call with n=0 argument makes no sense as it will do nothing. Probably -1 is intended. Signed-off-by: Iskander Sharipov <quasilyte@gmail.com>	2019-03-02 00:33:17 +03:00
Mahmood Ali	4726cb2207	logging.Type over logging.Driver	2019-02-28 16:40:18 -05:00
Mahmood Ali	104869c0e1	drivers/docker: rename logging `type` to `driver` Docker uses the term logging `driver` in its public documentations: in `docker` daemon config[1], `docker run` arguments [2] and in docker compose file[3]. Interestingly, docker used `type` in its API [4] instead of everywhere else. It's unfortunate that Nomad used `type` modeling after the Docker API rather than the user facing documents. Nomad using `type` feels very non-user friendly as it's disconnected from how Docker markets the flag and shows internal representation instead. Here, we rectify the situation by introducing `driver` field and prefering it over `type` in logging. [1] https://docs.docker.com/config/containers/logging/configure/ [2] https://docs.docker.com/engine/reference/run/#logging-drivers---log-driver [3] https://docs.docker.com/compose/compose-file/#logging [4] https://docs.docker.com/engine/api/v1.39/#operation/ContainerCreate	2019-02-28 16:04:03 -05:00
Mahmood Ali	67e2a0ac05	docker: report unhealthy in unsupported Windows (#5356 ) On Windows, Nomad only supports Windows containers, so report as unhealthy otherwise.	2019-02-27 08:10:23 -05:00
Michael Schurter	812f1679e2	Merge pull request #5352 from hashicorp/b-leaked-logmon logmon fixes	2019-02-26 08:35:46 -08:00
Danielle Tomlinson	e250aad31b	Merge pull request #5355 from hashicorp/dani/windows-dockerstats docker: Support Stats on Windows	2019-02-26 16:39:48 +01:00
Danielle Tomlinson	e3dc80bea3	docker: Return undetected before first detection This commit causes the docker driver to return undetected before it first establishes a connection to the docker daemon. This fixes a bug where hosts without docker installed would return as unhealthy, rather than undetected.	2019-02-25 11:02:42 +01:00
Danielle Tomlinson	8aff115fca	docker: Support stats on Windows	2019-02-22 14:19:58 +01:00
Michael Schurter	38821954b7	plugins: squelch context Canceled error logs As far as I can tell this is the most straightforward and resilient way to skip error logging on context cancellation with grpc streams. You cannot compare the error against context.Canceled directly as it is of type `*status.statusError`. The next best solution I found was: ```go resp, err := stream.Recv() if code, ok := err.(interface{ Code() code.Code }); ok { if code.Code == code.Canceled { return } } ``` However I think checking ctx.Err() directly makes the code much easier to read and is resilient against grpc API changes.	2019-02-21 15:32:18 -08:00
Mahmood Ali	6d30284ec9	Merge pull request #5341 from hashicorp/ci-windows-docker Run Docker tests in Windows AppVeyor CI	2019-02-21 13:17:33 -05:00
Danielle Tomlinson	2610e2d9ef	docker: Avoid leaking containers during Reattach Currently if a docker_logger cannot be reattached to, we will leak the container that was being used. This is problematic if e.g using static ports as it means you can never recover your task, or if a service is expensive to run and will then be running without supervision.	2019-02-20 17:47:06 +01:00
Danielle Tomlinson	953755ce24	Merge pull request #5335 from hashicorp/dani/docker-logger-spawn Increase resiliency of docker driver logging	2019-02-20 17:16:05 +01:00
Michael Schurter	a1645edb0b	Update drivers/docker/docklog/docker_logger.go Co-Authored-By: dantoml <dani@tomlinson.io>	2019-02-20 17:12:56 +01:00
Danielle Tomlinson	2f18441a47	docker: Respawn docker logger during recovery Sometimes the nomad docker_logger may be killed by a service manager when restarting the client for upgrades or reliability reasons. Currently if this happens, we leak the users container and try to reschedule over it. This commit adds a new step to the recovery process that will spawn a new docker logger process that will fetch logs from _the current timestamp_. This is to avoid restarting users tasks because our logging sidecar has failed.	2019-02-20 17:12:56 +01:00
Mahmood Ali	8c82c19831	tests: IsTravis() -> IsCI() Replace IsTravis() references that is intended for more CI environments rather than for Travis environment specifically.	2019-02-20 08:21:03 -05:00
Mahmood Ali	fedab3d7b0	driver/docker: Skip failing Windows tests Skip currently Docker tests that fail on Windows for further investigation.	2019-02-20 07:48:02 -05:00
Mahmood Ali	dd8a5c862a	Merge pull request #5321 from hashicorp/b-portmap-regression drivers: restore port_map old json support	2019-02-19 20:58:37 -05:00
Mahmood Ali	4def8529db	driver/docker: use BlockAttrs for storage_opts storage_opts is a new field in 0.9 cycle and doesn't have backward compatibility constraints.	2019-02-19 20:35:28 -05:00
Mahmood Ali	a394cd63f4	CVE-2019-5736: Update libcontainer depedencies (#5334 ) * CVE-2019-5736: Update libcontainer depedencies Libcontainer is vulnerable to a runc container breakout, that was reported as CVE-2019-5736[1]. Upgrading vendored libcontainer with the fix. The runc changes are captured in `369b920277` . [1] https://seclists.org/oss-sec/2019/q1/119	2019-02-19 20:21:18 -05:00
Danielle Tomlinson	3cf3ac7eac	dlogger: Increase resilience to docker api failure This commit adds some extra resiliency to the docker logger in the case of API failure from the docker daemon, by restarting the stream from the current point in time if the stream returns and the container is still running.	2019-02-19 15:17:54 +01:00
Mahmood Ali	46cd3c3f55	drivers: restore port_map old json support This ensures that `port_map` along with other block like attribute declarations (e.g. ulimit, labels, etc) can handle various hcl and json syntax that was supported in 0.8. In 0.8.7, the following declarations are effectively equivalent: ``` // hcl block port_map { http = 80 https = 443 } // hcl assignment port_map = { http = 80 https = 443 } // json single element array of map (default in API response) {"port_map": [{"http": 80, "https": 443}]} // json array of individual maps (supported accidentally iiuc) {"port_map: [{"http": 80}, {"https": 443}]} ``` We achieve compatbility by using `NewAttr("...", "list(map(string))", false)` to be serialized to a `map[string]string` wrapper, instead of using `BlockAttrs` declaration. The wrapper merges the list of maps automatically, to ease driver development. This approach is closer to how v0.8.7 implemented the fields [1][2], and despite its verbosity, seems to perserve 0.8.7 behavior in hcl2. This is only required for built-in types that have backward compatibility constraints. External drivers should use `BlockAttrs` instead, as they see fit. [1] https://github.com/hashicorp/nomad/blob/v0.8.7/client/driver/docker.go#L216 [2] https://github.com/hashicorp/nomad/blob/v0.8.7/client/driver/docker.go#L698-L700	2019-02-16 11:37:33 -05:00
Danielle Tomlinson	be431cb83d	Merge pull request #5326 from hashicorp/dani/json-submission api: Fix compatibility with pre 0.9 API jobs	2019-02-14 18:56:13 +01:00
Mahmood Ali	1430f94b2a	Update drivers/docker/config_test.go Co-Authored-By: dantoml <dani@tomlinson.io>	2019-02-14 18:55:10 +01:00
Danielle Tomlinson	3f696be06b	Add regression test for parsing null mounts	2019-02-14 18:03:35 +01:00
Danielle Tomlinson	a3a1491958	drivers/docker: SIGTERM to stop containers Windows Docker daemon does not support SIGINT, SIGTERM is the semantic equivalent that allows for graceful shutdown before being followed up by a SIGKILL.	2019-02-14 15:38:54 +00:00
Mahmood Ali	f7102cd01d	tests: add hcl task driver config parsing tests (#5314 ) * drivers: add config parsing tests Add basic tests for parsing and encoding task config. * drivers/docker: fix some config declarations * refactor and document config parse helpers	2019-02-12 14:46:37 -05:00
Mahmood Ali	aec9120994	drivers/java: restore 0.8.7 java version detection (#5317 ) Restore 0.8.x behavior where java driver is marked as detected when `java -version` exits with 0 but returns unexpected output. Furthermore, we restore behavior when `java -version` where we parse the first three lines of `java -version` but ignore rest. If `java -version` returns less than 3 lines, Nomad 0.8.7 would panic. In this implementation, we'd still mark java as detected but returns empty version. The 0.8.7 logic for detecting java version is found in https://github.com/hashicorp/nomad/blob/v0.8.7/client/driver/java.go#L132-L172 . I punt on revamping how we can be more resilient to java -version syntax, and aimed for preserving existing behavior instead.	2019-02-12 13:41:26 -05:00
Michael Schurter	3b84e08fa4	Merge pull request #5297 from hashicorp/b-docker-logging Docker: Fix logging config parsing	2019-02-11 06:57:52 -08:00
Gertjan Roggemans	94ca78354b	docker: Fix volume driver_config options spec (#5309 ) Fixes #5308	2019-02-11 09:18:44 -05:00
Michael Schurter	e1e4b10884	docker: fix logging config parsing Fixes https://groups.google.com/d/topic/nomad-tool/B3Uo6Kns2BI/discussion	2019-02-04 11:07:57 -08:00
Nick Ethier	e7ea26449e	client: fix bug during 0.8 state up grade that causes external drivers to fail	2019-01-30 14:22:29 -05:00
Alex Dadgar	bc804dda2e	Nomad 0.9.0-beta1 generated code	2019-01-30 10:49:44 -08:00
Nick Ethier	bb9a8afe9b	executor: fix bug and add tests for incorrect stats timestamp reporting	2019-01-28 21:57:45 -05:00
Nick Ethier	bcbed3c532	Merge pull request #5248 from hashicorp/b-rawexec-leak Fix leaked executor in raw_exec	2019-01-28 21:18:31 -05:00
Alex Dadgar	991bcc3ef1	Don't fall through	2019-01-28 09:53:19 -08:00
Alex Dadgar	403faa0d7c	comment	2019-01-28 09:47:53 -08:00
Nick Ethier	1f4c26e19e	raw_exec: ensure executor is killed after task is stopped	2019-01-25 23:06:31 -05:00
Alex Dadgar	68ced492fb	Fix killing non-existant container with a kill timeout	2019-01-25 16:21:51 -08:00
Nick Ethier	448f93f1e9	Merge pull request #5237 from hashicorp/b-docker-09upgrade-shutdown drivers/docker: handle shutdown of upgraded tasks correctly	2019-01-25 12:13:03 -05:00
Nick Ethier	ac693fe6ac	docker: only log missing bridge_ip on initial fingerprint	2019-01-24 22:34:57 -05:00
Nick Ethier	c7cc81924d	drivers/docker: handle shutdown of upgraded tasks correctly	2019-01-24 14:21:59 -05:00
Nick Ethier	a5823a270c	Merge pull request #5209 from hashicorp/f-driver-upgradepath-test e2e tests for upgrade path to driver plugins	2019-01-23 21:03:29 -05:00
Nick Ethier	be976d9c9a	Merge branch 'master' into f-driver-upgradepath-test * master: (23 commits) tests: avoid assertion in goroutine spell check ci: run checkscripts tests: deflake TestRktDriver_StartWaitRecoverWaitStop drivers/rkt: Remove unused github.com/rkt/rkt drivers/rkt: allow development on non-linux cli: Hide `nomad docker_logger` from help output api: test api and structs are in sync goimports until make check is happy nil check node resources to prevent panic tr: use context in as select statement move pluginutils -> helper/pluginutils vet goimports gofmt Split hclspec move hclutils Driver tests do not use hcl2/hcl, hclspec, or hclutils move reattach config loader and singleton ...	2019-01-23 21:01:24 -05:00
Nick Ethier	5b9013528e	drivers: add docker upgrade path and e2e test	2019-01-23 14:44:42 -05:00
Alex Dadgar	48dfbbcad9	fix docker runtime attribute	2019-01-23 10:58:46 -08:00
Mahmood Ali	698f0a1575	Merge pull request #5229 from hashicorp/r-grabbag-201901019 Grab bag of small changes	2019-01-23 13:06:51 -05:00
Mahmood Ali	8d76bb0e82	tests: avoid assertion in goroutine	2019-01-23 12:34:16 -05:00
Mahmood Ali	464a70fda6	tests: deflake TestRktDriver_StartWaitRecoverWaitStop	2019-01-23 10:54:49 -05:00
Mahmood Ali	a64ad99b94	drivers/rkt: allow development on non-linux Avoid depending on OS-specific library for some simple structs.	2019-01-23 10:53:49 -05:00
Michael Schurter	32daa7b47b	goimports until make check is happy	2019-01-23 06:27:14 -08:00
Michael Schurter	be0bab7c3f	move pluginutils -> helper/pluginutils I wanted a different color bikeshed, so I get to paint it	2019-01-22 15:50:08 -08:00
Alex Dadgar	4bdccab550	goimports	2019-01-22 15:44:31 -08:00
Alex Dadgar	b7a65676fe	gofmt	2019-01-22 15:43:34 -08:00
Alex Dadgar	72a5691897	Driver tests do not use hcl2/hcl, hclspec, or hclutils	2019-01-22 15:43:34 -08:00
Alex Dadgar	b2c7268843	move reattach config	2019-01-22 15:11:58 -08:00
Alex Dadgar	cdcd3c929c	loader and singleton	2019-01-22 15:11:57 -08:00
Michael Schurter	9edff19625	test: port SignalFailure test from 0.8 Also fix signal error handling in mock_driver.	2019-01-22 08:08:08 -08:00
Nick Ethier	b840a2eb7b	drivers: fix func naming	2019-01-18 18:31:02 -05:00
Nick Ethier	e3c6f89b9a	drivers: use consts for task handle version	2019-01-18 18:31:01 -05:00
Nick Ethier	9dd4eb3581	drivers: add upgrade path for rawexec, java, rkt and qemu	2019-01-18 18:31:01 -05:00
Nick Ethier	6804450c69	cleanup code comments and small fixes from refactor	2019-01-18 18:31:01 -05:00
Nick Ethier	05bd369d1f	driver: add pre09 migration logic	2019-01-18 18:31:01 -05:00
Nick Ethier	e5a6fc9271	executor: add pre 0.9 client and wrapper	2019-01-18 18:30:58 -05:00
Mahmood Ali	5df63fda7c	Merge pull request #5190 from hashicorp/f-memory-usage Track Basic Memory Usage as reported by cgroups	2019-01-18 16:46:02 -05:00
Danielle Tomlinson	b65bf78513	docker: Fix missing import	2019-01-17 18:44:27 +01:00
Danielle Tomlinson	7fca934509	chore: General Cleanup	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	e73962d8d6	docker: Only run Cleanup test on unix os'	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	af202f347f	chore: goimports exec driver	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	30a5e25d94	fixup: Typo in docker test	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	82018cd030	chore: Fix docklog linting	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	3b2ff2005b	chore: Fix docker test linting Due to https://github.com/tsenart/deadcode/issues/3 we can't specify these consts on their own. This moves them into the _platform_test.go files to avoid creating a package that only exposes a couple of values.	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	15b1571882	drivers/exec: SIGINT unavailable on windows	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	d78120f890	rawexec: Fix Exec test on windows	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	65457dd2f2	rawexec: SIGINT is not available on Windows	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	de86435cf8	docker: Test cleanup for windows * Docker for Windows does not support ulimits * Use filepath.ToSlash to test workdir * Convert expected mount paths to system style * Skip security-opt test on windows - Windows does not support seccomp, and it's unclear which options are available. * Skip StartN due to lack of sigint * docker: Use api to get image info on windows * No bridge on windows * Stop hardcoding /bin/	2019-01-17 18:43:14 +01:00
Danielle Tomlinson	3d2f0bb7da	docker: ExpandPath tests validate slashpath	2019-01-17 18:43:13 +01:00
Danielle Tomlinson	37190a5595	dockerlogger: Fix tests on windows Uses the home directory and windows path expansion, as c:\tmp doesn't necessarily exist, and mktemp would involve unnecessarily complicating the commands.	2019-01-17 18:43:13 +01:00
Danielle Tomlinson	e6c0738b65	Expand unix build definition	2019-01-17 18:43:13 +01:00
Mahmood Ali	677e11da86	fix imports	2019-01-16 19:53:48 -05:00
Michael Schurter	bea4d297b9	Merge pull request #5197 from hashicorp/b-rkt-cpu rkt: revert to pre-0.9 --cpu flag	2019-01-16 15:05:21 -08:00
Preetha Appan	55319b05d1	clean up read access	2019-01-16 11:04:11 -06:00
Preetha Appan	469a286b1b	Refactor logging in drivers to use a tri-state boolean Changes logging warnings/errors only if the state changes from healthy to unhealthy	2019-01-16 10:19:31 -06:00
Preetha Appan	0c2c0a2d43	Make docker driver logging less redundant	2019-01-16 10:16:57 -06:00
Michael Schurter	1879a7f788	rkt: revert to pre-0.9 --cpu flag See https://github.com/hashicorp/nomad/issues/3394#issuecomment-453296121 for details. During 0.9 development we switched to shares, but we'd prefer to maintain backward compat.	2019-01-15 13:15:28 -08:00
oleksii.shyman	e41fbf7577	Add support for docker runtimes - docker fingerprint issues a docker api system info call to get the list of supported OCI runtimes. - OCI runtimes are reported as comma separated list of names - docker driver is aware of GPU runtime presence - docker driver throws an error when user tries to run container with GPU, when GPU runtime is not present - docker GPU runtime name is configurable	2019-01-15 11:34:47 -08:00
Danielle Tomlinson	b918d25e62	Merge pull request #5192 from hashicorp/dani/executor-close executor: Always close stdout/stderr fifos	2019-01-15 17:49:04 +01:00
Danielle Tomlinson	f120c8f8f6	Merge pull request #5184 from hashicorp/dani/b-logmon-reattach docker: Terminate dockerlogger	2019-01-15 16:48:40 +01:00
Danielle Tomlinson	7f1ff3fab6	executor: Always close stdout/stderr fifos	2019-01-15 16:47:27 +01:00

... 2 3 4 5 6 ...

601 Commits