open-nomad/drivers
Mahmood Ali aa59280edc docker: periodically reconcile containers
When running at scale, it's possible that Docker Engine starts
containers successfully but gets wedged in a way where API call fails.
The Docker Engine may remain unavailable for arbitrary long time.

Here, we introduce a periodic reconcilation process that ensures that any
container started by nomad is tracked, and killed if is running
unexpectedly.

Basically, the periodic job inspects any container that isn't tracked in
its handlers.  A creation grace period is used to prevent killing newly
created containers that aren't registered yet.

Also, we aim to avoid killing unrelated containters started by host or
through raw_exec drivers.  The logic is to pattern against containers
environment variables and mounts to infer if they are an alloc docker
container.

Lastly, the periodic job can be disabled to avoid any interference if
need be.
2019-10-17 08:36:01 -04:00
..
docker docker: periodically reconcile containers 2019-10-17 08:36:01 -04:00
exec driver: set correct network isolation caps for exec and java dr… (#6368) 2019-09-25 11:48:14 -04:00
java driver/java: pass task network isolation to executor 2019-09-27 08:26:54 -04:00
mock ar: initial driver based network management 2019-07-31 01:03:17 -04:00
qemu fix qemu and update docker with tests 2019-09-04 11:27:51 -04:00
rawexec tests: enable raw_exec driver 2019-08-29 20:26:50 -04:00
rkt plugins/driver: make DriverNetworkManager interface optional 2019-07-31 01:03:19 -04:00
shared volumes: Add support for mount propagation 2019-10-14 14:09:58 +02:00