Mahmood Ali
fa9b9028a5
Use max 3 precision in displaying floats
...
When formating floats in `nomad node status`, use a maximum precision of
3.
2018-12-10 12:18:24 -05:00
Mahmood Ali
14668f48d1
device attributes in nomad node status -verbose
...
This reports device attributes like the following:
```
$ nomad node status -self -verbose
ID = f7adb958-29e1-2a5a-2303-9d61ffaab33a
Name = mars.local
Class = <none>
DC = dc1
Drain = false
Eligibility = eligible
Status = ready
Uptime = 12h40m13s
Drivers
Driver Detected Healthy Message Time
docker true true healthy 2018-12-10T11:47:19-05:00
...
Attributes
cpu.arch = amd64
cpu.frequency = 2200
cpu.modelname = Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz
cpu.numcores = 12
...
Device Group Attributes
Device Group = nomad/file/mock
block_device = sda1
filesystem = ext4
size = 63.2 GB
Meta
```
2018-12-10 12:18:24 -05:00
Mahmood Ali
9f69b8bfec
Rename helper_stats -> helper_devices
2018-12-10 12:18:24 -05:00
Mahmood Ali
ced76978bb
devices/nvidia: memory state as the summary stat
2018-12-10 12:18:24 -05:00
Mahmood Ali
97829a3f02
fix dtestutil.NewDriverHarness ref
2018-12-08 09:58:23 -05:00
Mahmood Ali
021d3720b5
Merge pull request #4950 from hashicorp/b-exc-libcontainer-kill
...
executor: kill all container processes
2018-12-08 09:52:42 -05:00
Nick Ethier
35268fdb54
executor: misspell
2018-12-08 01:52:06 -05:00
Nick Ethier
32057b6f7f
Merge pull request #4973 from emate/recover-filerotator-from-io-errors
...
Recover from any possible io error when invoking Write on FileRotator
2018-12-08 00:05:42 -05:00
Preetha Appan
f60c52c8ba
Score combinations of allocs from multiple devices for preemption
2018-12-07 18:35:47 -06:00
Chris Baker
4bbb8106c1
updated memberlist dependency to latest, which is missing NMD-1173 error
2018-12-07 22:15:05 +00:00
Chris Baker
59beae35df
nomad/rpc listener: modified to throttle logging on "permanent" Accept() errors as well (with a higher delay cap)
2018-12-07 22:14:15 +00:00
Alex Dadgar
695fa416a6
Merge pull request #4965 from hashicorp/b-gc-running
...
Don't GC running but desired stop allocations
2018-12-07 13:36:33 -08:00
Chris Baker
707bac0a7b
rpc accept loop: added backoff on logging for failed connections, in case there is a fast fail loop (NMD-1173)
2018-12-07 20:12:55 +00:00
Nick Ethier
86e9c11ec2
executor: don't drop errors when configuring libcontainer cfg, add nil check on resources
2018-12-07 14:03:42 -05:00
Marcin Matlaszek
39eec70f31
Recover from any possible io error when invoking Write on FileRotator
...
As of now, FileRotator uses bufio.Write under the hood to write data to
configured output file. Due to the way how bufio handles any occurred io
error - saves it into `err` variable never resetting it automatically -
any operation like `Write`, `Flush` etc will become a no-op, returning the very same,
saved error (eg. Out of disk space) even when the problem is fixed (eg. disk
space is available again).
That automatically means that FileRotator will stop writing any logs,
reporting the same error over and over again, even if it's no longer
valid.
This PR fixes it by resetting the bufio Writer, which resets any errors
and tries to write requested data.
2018-12-07 18:22:29 +01:00
Mahmood Ali
7d5b5bb5f9
Merge pull request #4933 from hashicorp/f-mount-device
...
Mount Devices in container based drivers
2018-12-07 10:32:03 -05:00
Mahmood Ali
91a67f347d
Vendor libcontainer/devices
2018-12-07 09:13:27 -05:00
Nick Ethier
47df1dde10
Merge branch 'master' into f-grpc-executor
2018-12-06 21:42:38 -05:00
Nick Ethier
19a695308f
executor: fix tests
2018-12-06 21:39:53 -05:00
Nick Ethier
913efed9f5
executor: fix broken non-linux build
2018-12-06 21:33:20 -05:00
Nick Ethier
2283cb2c39
executor: use drivers.Resources as resource model
2018-12-06 21:22:02 -05:00
Nick Ethier
29ef54c0ee
executor: merge plugin shim with executor package
2018-12-06 21:13:45 -05:00
Nick Ethier
71353a88d4
executor: remove structs package
2018-12-06 20:54:14 -05:00
Alex Dadgar
c918a96490
Warn if IOPS is being used
2018-12-06 16:17:09 -08:00
Alex Dadgar
1e3c3cb287
Deprecate IOPS
...
IOPS have been modelled as a resource since Nomad 0.1 but has never
actually been detected and there is no plan in the short term to add
detection. This is because IOPS is a bit simplistic of a unit to define
the performance requirements from the underlying storage system. In its
current state it adds unnecessary confusion and can be removed without
impacting any users. This PR leaves IOPS defined at the jobspec parsing
level and in the api/ resources since these are the two public uses of
the field. These should be considered deprecated and only exist to allow
users to stop using them during the Nomad 0.9.x release. In the future,
there should be no expectation that the field will exist.
2018-12-06 15:09:26 -08:00
Danielle Tomlinson
8100252116
Merge pull request #4960 from hashicorp/dani/b-gc-tests
...
Re-enable Client GC tests
2018-12-06 23:18:36 +01:00
Mahmood Ali
a7b205daf2
Merge pull request #4955 from hashicorp/fix-docker-tests-20181203
...
Fix docker driver tests
2018-12-06 16:41:33 -05:00
Danielle Tomlinson
e3621c55fa
gc: Fix maxallocs integration test
2018-12-06 21:50:50 +01:00
Mahmood Ali
9e825f880c
Use absolute path in example device plugin
...
deviceDir is used for specifying mount/device host paths, and those
should be absolute paths.
2018-12-06 15:46:35 -05:00
Mahmood Ali
bdc53b1d8e
driver/rkt: mount plugin devices
2018-12-06 15:46:35 -05:00
Mahmood Ali
2c0fd2a902
driver/lxc: mount plugin devices
...
Also, LXC requires target paths to be relative. Container paths in LXC
binds should never be absolute paths, so we strip any preceeding `/`,
even if a user sets one.
2018-12-06 15:46:35 -05:00
Mahmood Ali
699875eb1c
fixup: add missed docker utils test
2018-12-06 15:46:35 -05:00
Mahmood Ali
e9557ae596
tests: ensure image is loaded as test setup
2018-12-06 15:36:43 -05:00
Alex Dadgar
c4b5f80918
Make alloc health watcher a postrun hook rather than shutdown hook
2018-12-06 12:30:31 -08:00
Michael Lange
81c2d8b4a2
Merge pull request #4967 from hashicorp/b-ui-stat-charts-can-escape-canvas
...
UI: Keep line charts in their canvases at all times
2018-12-06 10:56:37 -08:00
Danielle Tomlinson
62b98e64ca
client/gc: Replace GC integration test with unit
...
The previous integration test was broken during the client refactor, and
it seems to be some sort of race with state updating.
I'm going to try and construct a replacement test as part of work on
performance, but for now, the underlying behaviour is still being
tested.
2018-12-06 12:28:23 +01:00
Danielle Tomlinson
f6e474fd55
client: Re-enable GC tests
2018-12-06 12:28:23 +01:00
Danielle Tomlinson
d043532cb0
allocrunner: Basic test alloc runner
2018-12-06 12:28:23 +01:00
Michael Lange
795ea7eade
Grow the default 0 to 1 bounds to the domain of the data when necessary
2018-12-05 22:07:44 -08:00
Alex Dadgar
b18a0f77a2
Merge pull request #4966 from hashicorp/b-failure-event
...
Fix various bugs with task events
2018-12-05 14:43:50 -08:00
Alex Dadgar
b39c21d49c
Fix various bugs with task events
...
Fixes the following:
* Emitting events when the task fails to start
* Don't double emit events on task shutdown (nomad stop)
* Don't emit a OOM kill metric unless actually OOM'd
2018-12-05 14:27:07 -08:00
Alex Dadgar
14a61ea3ea
Don't GC running but desired stop allocations
...
This PR fixes an edge case where we could GC an allocation that was in a
desired stop state but had not terminated yet. This can be hit if the
client hasn't shutdown the allocation yet or if the allocation is still
shutting down (long kill_timeout).
Fixes https://github.com/hashicorp/nomad/issues/4940
2018-12-05 13:01:12 -08:00
Nick Ethier
57ffece7f8
executor: update test references
2018-12-05 11:07:48 -05:00
Nick Ethier
02f4b0fac5
executor: update driver references
2018-12-05 11:04:18 -05:00
Nick Ethier
8b20de4801
executor: use grpc instead of netrpc as plugin protocol
...
* Added protobuf spec for executor
* Seperated executor structs into their own package
2018-12-05 11:03:56 -05:00
Mahmood Ali
b55fb642f1
driver/docker: honor plugin devices
2018-12-04 21:31:28 -05:00
Mahmood Ali
a580cef986
refactor device manipulation
2018-12-04 20:55:59 -05:00
Mahmood Ali
3a18105d06
drivers/exec: refactor stop/kill tests
...
Simplify the tests to do all assertions within the main goroutine and
account for status propagation delay.
2018-12-04 20:34:43 -05:00
Mahmood Ali
adb4d69576
Merge pull request #4956 from hashicorp/b-vault-client-tweaks-followup
...
server/vault: Lock Vault expiration tracking
2018-12-04 19:46:59 -05:00
Mahmood Ali
366f478f8f
Merge pull request #4959 from hashicorp/fix-rkt-tests-20181204
...
tests: fix rkt tests
2018-12-04 19:46:41 -05:00