open-nomad

Author	SHA1	Message	Date
Seth Hoenig	113b7eb727	client: cgroups v2 code review followup	2022-03-24 13:40:42 -05:00
Seth Hoenig	2e5c6de820	client: enable support for cgroups v2 This PR introduces support for using Nomad on systems with cgroups v2 [1] enabled as the cgroups controller mounted on /sys/fs/cgroups. Newer Linux distros like Ubuntu 21.10 are shipping with cgroups v2 only, causing problems for Nomad users. Nomad mostly "just works" with cgroups v2 due to the indirection via libcontainer, but not so for managing cpuset cgroups. Before, Nomad has been making use of a feature in v1 where a PID could be a member of more than one cgroup. In v2 this is no longer possible, and so the logic around computing cpuset values must be modified. When Nomad detects v2, it manages cpuset values in-process, rather than making use of cgroup heirarchy inheritence via shared/reserved parents. Nomad will only activate the v2 logic when it detects cgroups2 is mounted at /sys/fs/cgroups. This means on systems running in hybrid mode with cgroups2 mounted at /sys/fs/cgroups/unified (as is typical) Nomad will continue to use the v1 logic, and should operate as before. Systems that do not support cgroups v2 are also not affected. When v2 is activated, Nomad will create a parent called nomad.slice (unless otherwise configured in Client conifg), and create cgroups for tasks using naming convention <allocID>-<task>.scope. These follow the naming convention set by systemd and also used by Docker when cgroups v2 is detected. Client nodes now export a new fingerprint attribute, unique.cgroups.version which will be set to 'v1' or 'v2' to indicate the cgroups regime in use by Nomad. The new cpuset management strategy fixes #11705, where docker tasks that spawned processes on startup would "leak". In cgroups v2, the PIDs are started in the cgroup they will always live in, and thus the cause of the leak is eliminated. [1] https://www.kernel.org/doc/html/latest/admin-guide/cgroup-v2.html Closes #11289 Fixes #11705 #11773 #11933	2022-03-23 11:35:27 -05:00
Dave May	3c04d7927b	cli: refactor operator debug capture (#11466 ) * debug: refactor Consul API collection * debug: refactor Vault API collection * debug: cleanup test timing * debug: extend test to multiregion * debug: save cmdline flags in bundle * debug: add cli version to output * Add changelog entry	2021-11-05 19:43:10 -04:00
Nick Ethier	155a2ca5fb	client/ar: thread through cpuset manager	2021-04-13 13:28:36 -04:00
Yoan Blanc	225c9c1215	fixup! vendor: explicit use of hashicorp/go-msgpack Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-03-31 09:48:07 -04:00
Yoan Blanc	761d014071	vendor: explicit use of hashicorp/go-msgpack Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-03-31 09:45:21 -04:00
Mahmood Ali	a8d6950007	Remove rkt as a built-in driver Rkt has been archived and is no longer an active project: * https://github.com/rkt/rkt * https://github.com/rkt/rkt/issues/4024 The rkt driver will continue to live as an external plugin.	2020-02-26 22:16:41 -05:00
Mahmood Ali	4b2ba62e35	acl: check ACL against object namespace Fix a bug where a millicious user can access or manipulate an alloc in a namespace they don't have access to. The allocation endpoints perform ACL checks against the request namespace, not the allocation namespace, and performs the allocation lookup independently from namespaces. Here, we check that the requested can access the alloc namespace regardless of the declared request namespace. Ideally, we'd enforce that the declared request namespace matches the actual allocation namespace. Unfortunately, we haven't documented alloc endpoints as namespaced functions; we suspect starting to enforce this will be very disruptive and inappropriate for a nomad point release. As such, we maintain current behavior that doesn't require passing the proper namespace in request. A future major release may start enforcing checking declared namespace.	2019-10-08 12:59:22 -04:00
Michael Schurter	59e0b67c7f	connect: task hook for bootstrapping envoy sidecar Fixes #6041 Unlike all other Consul operations, boostrapping requires Consul be available. This PR tries Consul 3 times with a backoff to account for the group services being asynchronously registered with Consul.	2019-08-22 08:15:32 -07:00
Mahmood Ali	33ff8c3e8d	tests: expect Docker on AppVeyor Prepare to run docker on AppVeyor Windows environment	2019-02-20 07:41:47 -05:00
Mahmood Ali	0ba7b0c132	tests: helper function for checking docker presense	2019-01-07 08:27:06 -05:00
Alex Dadgar	de98774f2c	Add test and docs	2018-05-31 18:05:03 -07:00
Michael Schurter	d687761ebf	rkt: test Stats() and always run tests Remove the NOMAD_TEST_RKT flag as a guard for rkt tests. Still require Linux, root, and rkt to be installed. Only check for rkt installation once in hopes of speeding up rkt tests a bit.	2018-04-24 11:05:42 -07:00
Michael Schurter	5032bf4f5a	Skip tests that require root when not root Also skip Chown on allocdir migration on Windows and when non-root. Windows doesn't support it, and it will always fail as a non-root user.	2017-12-12 16:58:27 -08:00
Alex Dadgar	99c81b5848	Skip if no docker	2017-10-19 16:55:10 -07:00
Alex Dadgar	c62cd5cc55	Revendor docker client	2017-02-14 17:34:05 -08:00
Diptanu Choudhury	e893e71e21	Moved the dockerIsConnected to testutils	2016-03-25 17:15:51 -07:00
Alex Dadgar	f210fcd1a6	Merge pull request #380 from hashicorp/f-daemonize Improve spawn-daemon and Nomad Client usage of it	2015-11-04 16:44:50 -08:00
Alex Dadgar	d83777f198	Make a basic executor that can be shared and fix some fingerprinting/tests	2015-11-03 12:47:48 -08:00
Alex Dadgar	2781cbbde1	Exec driver only applies on linux as root	2015-10-28 17:22:04 -07:00
Alex Dadgar	9f7dcc4ced	Use same binary as Fingerprint in the QemuCompatible function	2015-10-28 10:28:53 -07:00
Alex Dadgar	a5a1e45f4b	Get Qemu to fingerprint and test properly on both windows and linux	2015-10-27 15:27:11 -07:00
Abhishek Chanda	70293e9bc8	Run gofmt	2015-10-26 19:24:37 +00:00
Abhishek Chanda	6ecab13b5d	Cleanup tests - Consolidate checking if non-windows and if qemu is installed - Fix non-windows check	2015-10-23 14:19:22 -07:00
Abhishek Chanda	ba362fae07	Run gofmt	2015-10-07 22:24:16 +00:00
Abhishek Chanda	4be849445d	Fix function call Make it skip if rkt is not installed	2015-10-06 15:56:39 -07:00
Abhishek Chanda	528632da3d	Add missing import and remove unsued one	2015-10-06 15:56:39 -07:00
Abhishek Chanda	ab6d756dfe	Remove a stray comment	2015-10-06 15:56:39 -07:00
Abhishek Chanda	b6b7d9e875	Add a test fort he rkt driver	2015-10-06 15:56:39 -07:00
Alex Dadgar	3cea4288b9	Merge qemu test	2015-09-25 16:49:14 -07:00
Alex Dadgar	6725cbb3f5	Mount shared alloc dir, modified API and tests	2015-09-25 16:46:41 -07:00
Alex Dadgar	e095664c49	Guard tests	2015-09-22 17:10:03 -07:00

32 commits