open-nomad

Commit Graph

Author	SHA1	Message	Date
Mahmood Ali	84ded28c6d	drivers/docker: enforce volumes.enabled (#4983 ) When volumes.enable flag is off in Docker driver, disable all mounts of paths outside alloc dir.	2018-12-11 14:22:50 -05:00
Danielle Tomlinson	d11c62fa3a	Merge pull request #4963 from hashicorp/dani/f-preempt-alloc-wait client: Wait for preemptions to terminate	2018-12-11 18:06:34 +01:00
Danielle Tomlinson	ed1791f4bf	client: Style: use fluent style for building loggers	2018-12-11 18:03:45 +01:00
Danielle Tomlinson	805669ead4	client: Correctly pass a noop PrevAllocMigrator when restoring	2018-12-11 15:46:58 +01:00
Danielle Tomlinson	6fb5ca6ad5	allocrunner: Test alloc runners should include a noop migrator	2018-12-11 13:12:35 +01:00
Danielle Tomlinson	4b4b85e3f4	allocwatcher: Cleanup new migrator/watcher interface	2018-12-11 13:12:35 +01:00
Danielle Tomlinson	83720575de	client: Unify handling of previous and preempted allocs	2018-12-11 13:12:35 +01:00
Michael Schurter	8808ab9cea	Merge pull request #4953 from hashicorp/b-script-context-wrapper consul: add ScriptExecutor context wrapper	2018-12-10 17:22:53 -08:00
Michael Schurter	4c5f3ae82c	Merge pull request #4952 from hashicorp/b-script-context consul: fix script checks exiting after 1 run	2018-12-10 17:22:15 -08:00
Danielle Tomlinson	dff7093243	client: Wait for preempted allocs to terminate When starting an allocation that is preempting other allocs, we create a new group allocation watcher, and then wait for the allocations to terminate in the allocation PreRun hooks. If there's no preempted allocations, then we simply provide a NoopAllocWatcher.	2018-12-11 00:59:18 +01:00
Danielle Tomlinson	2cdef6a7b4	allocwatcher: Add Group AllocWatcher The Group Alloc watcher is an implementation of a PrevAllocWatcher that can wait for multiple previous allocs before terminating. This is to be used when running an allocation that is preempting upstream allocations, and thus only supports being ran with a local alloc watcher. It also currently requires all of its child watchers to correctly handle context cancellation. Should this be a problem, it should be fairly easy to implement a replacement using channels rather than a waitgroup. It obeys the PrevAllocWatcher interface for convenience, but it may be better to extract Migration capabilities into a seperate interface for greater clarity.	2018-12-11 00:58:27 +01:00
Alex Dadgar	457c6eb398	typo	2018-12-10 15:35:26 -08:00
Alex Dadgar	508a3dfa49	merge 087 and 090 changelog	2018-12-10 15:34:21 -08:00
Mahmood Ali	97829a3f02	fix dtestutil.NewDriverHarness ref	2018-12-08 09:58:23 -05:00
Mahmood Ali	021d3720b5	Merge pull request #4950 from hashicorp/b-exc-libcontainer-kill executor: kill all container processes	2018-12-08 09:52:42 -05:00
Nick Ethier	32057b6f7f	Merge pull request #4973 from emate/recover-filerotator-from-io-errors Recover from any possible io error when invoking Write on FileRotator	2018-12-08 00:05:42 -05:00
Alex Dadgar	695fa416a6	Merge pull request #4965 from hashicorp/b-gc-running Don't GC running but desired stop allocations	2018-12-07 13:36:33 -08:00
Marcin Matlaszek	39eec70f31	Recover from any possible io error when invoking Write on FileRotator As of now, FileRotator uses bufio.Write under the hood to write data to configured output file. Due to the way how bufio handles any occurred io error - saves it into `err` variable never resetting it automatically - any operation like `Write`, `Flush` etc will become a no-op, returning the very same, saved error (eg. Out of disk space) even when the problem is fixed (eg. disk space is available again). That automatically means that FileRotator will stop writing any logs, reporting the same error over and over again, even if it's no longer valid. This PR fixes it by resetting the bufio Writer, which resets any errors and tries to write requested data.	2018-12-07 18:22:29 +01:00
Mahmood Ali	7d5b5bb5f9	Merge pull request #4933 from hashicorp/f-mount-device Mount Devices in container based drivers	2018-12-07 10:32:03 -05:00
Mahmood Ali	91a67f347d	Vendor libcontainer/devices	2018-12-07 09:13:27 -05:00
Danielle Tomlinson	8100252116	Merge pull request #4960 from hashicorp/dani/b-gc-tests Re-enable Client GC tests	2018-12-06 23:18:36 +01:00
Mahmood Ali	a7b205daf2	Merge pull request #4955 from hashicorp/fix-docker-tests-20181203 Fix docker driver tests	2018-12-06 16:41:33 -05:00
Danielle Tomlinson	e3621c55fa	gc: Fix maxallocs integration test	2018-12-06 21:50:50 +01:00
Mahmood Ali	9e825f880c	Use absolute path in example device plugin deviceDir is used for specifying mount/device host paths, and those should be absolute paths.	2018-12-06 15:46:35 -05:00
Mahmood Ali	bdc53b1d8e	driver/rkt: mount plugin devices	2018-12-06 15:46:35 -05:00
Mahmood Ali	2c0fd2a902	driver/lxc: mount plugin devices Also, LXC requires target paths to be relative. Container paths in LXC binds should never be absolute paths, so we strip any preceeding `/`, even if a user sets one.	2018-12-06 15:46:35 -05:00
Mahmood Ali	699875eb1c	fixup: add missed docker utils test	2018-12-06 15:46:35 -05:00
Mahmood Ali	e9557ae596	tests: ensure image is loaded as test setup	2018-12-06 15:36:43 -05:00
Michael Lange	81c2d8b4a2	Merge pull request #4967 from hashicorp/b-ui-stat-charts-can-escape-canvas UI: Keep line charts in their canvases at all times	2018-12-06 10:56:37 -08:00
Danielle Tomlinson	62b98e64ca	client/gc: Replace GC integration test with unit The previous integration test was broken during the client refactor, and it seems to be some sort of race with state updating. I'm going to try and construct a replacement test as part of work on performance, but for now, the underlying behaviour is still being tested.	2018-12-06 12:28:23 +01:00
Danielle Tomlinson	f6e474fd55	client: Re-enable GC tests	2018-12-06 12:28:23 +01:00
Danielle Tomlinson	d043532cb0	allocrunner: Basic test alloc runner	2018-12-06 12:28:23 +01:00
Michael Lange	795ea7eade	Grow the default 0 to 1 bounds to the domain of the data when necessary	2018-12-05 22:07:44 -08:00
Alex Dadgar	b18a0f77a2	Merge pull request #4966 from hashicorp/b-failure-event Fix various bugs with task events	2018-12-05 14:43:50 -08:00
Alex Dadgar	b39c21d49c	Fix various bugs with task events Fixes the following: * Emitting events when the task fails to start * Don't double emit events on task shutdown (nomad stop) * Don't emit a OOM kill metric unless actually OOM'd	2018-12-05 14:27:07 -08:00
Alex Dadgar	14a61ea3ea	Don't GC running but desired stop allocations This PR fixes an edge case where we could GC an allocation that was in a desired stop state but had not terminated yet. This can be hit if the client hasn't shutdown the allocation yet or if the allocation is still shutting down (long kill_timeout). Fixes https://github.com/hashicorp/nomad/issues/4940	2018-12-05 13:01:12 -08:00
Mahmood Ali	b55fb642f1	driver/docker: honor plugin devices	2018-12-04 21:31:28 -05:00
Mahmood Ali	a580cef986	refactor device manipulation	2018-12-04 20:55:59 -05:00
Mahmood Ali	3a18105d06	drivers/exec: refactor stop/kill tests Simplify the tests to do all assertions within the main goroutine and account for status propagation delay.	2018-12-04 20:34:43 -05:00
Mahmood Ali	adb4d69576	Merge pull request #4956 from hashicorp/b-vault-client-tweaks-followup server/vault: Lock Vault expiration tracking	2018-12-04 19:46:59 -05:00
Mahmood Ali	366f478f8f	Merge pull request #4959 from hashicorp/fix-rkt-tests-20181204 tests: fix rkt tests	2018-12-04 19:46:41 -05:00
Mahmood Ali	428d35a5a9	executor: Keep 0.8.6 exit code for wait() failures 0.8.6 uses exit code 1 when `proc.Wait()` fails: https://github.com/hashicorp/nomad/blob/v0.8.6/client/driver/executor/executor.go#L442	2018-12-04 19:38:25 -05:00
Mahmood Ali	8df9de6fd5	driver/rkt: use rkt environment The rkt command itself needs an environment with PATH set to find iptables.	2018-12-04 14:00:45 -05:00
Preetha	8068d9f64e	Merge pull request #4949 from hashicorp/b-neg-running-summary Add guards around subtracting summary count	2018-12-04 12:52:58 -06:00
Mahmood Ali	f8efc40b8b	tests: stop integration tests tasks explicitly Also update the new recommended `nomad job` subcommands	2018-12-04 11:50:59 -05:00
Dan Brown	8aebe8c47d	Add Reference Architecture and Deployment Guide (#4768 ) * Add Nomad RA * Add deployment guide and nav * Deployment Guide update * Minor typo fixes * Update diagrams * Fixes for review * Link fixes and typo fix * Edits following review - Update image text from "zone" to "datacenter" to match Nomad terminology - Clean up text based on Preetha's feedback * Text updates Based on feedback from Rob * Update diagrams * fixing spelling * Add suggestions from Preetha and Omar	2018-12-04 11:49:35 -05:00
Mahmood Ali	06a5cadf35	drivers/rkt: use image isolation for rkt	2018-12-04 11:40:10 -05:00
Mahmood Ali	178365848e	tests: don't assert in WaitForResult WaitForResult expects body to fail and retries few times before giving up. Assertions inside the testfn body causes it to terminate abruptly without retrying.	2018-12-04 11:40:10 -05:00
Mahmood Ali	50e38104a5	server/nomad: Lock Vault expiration tracking `currentExpiration` field is accessed in multiple goroutines: Stats and renewal, so needs locking. I don't anticipate high contention, so simple mutex suffices.	2018-12-04 09:29:48 -05:00
Mahmood Ali	f8ceeebf11	no t.Parallel() in excutor table driven tests (#4948 ) When `t.Parallel()` is used inside a `t.Run()` sub-set, the closure doesn't behave as expected, and some cases effectively get skipped. More details can be found in https://gist.github.com/posener/92a55c4cd441fc5e5e85f27bca008721	2018-12-04 09:04:04 -05:00

1 2 3 4 5 ...

13454 Commits All Branches Search

13454 Commits

All Branches