open-nomad

Commit Graph

Author	SHA1	Message	Date
Charlie Voiselle	9dfe4aa7c0	Set RequireRoot to be a test helper. (#16641 )	2023-04-06 14:34:36 -04:00
Seth Hoenig	d11fe234e4	e2e: swap assert for test package in e2eutil/jobs.go (#16820 )	2023-04-06 10:02:27 -05:00
James Rasell	15eee99db4	client: ensure envoy version hook uses all pointer receiver funcs. (#16813 )	2023-04-06 14:47:00 +01:00
Tim Gross	09c19fa44a	E2E: test enforcement of ACL system (#16796 ) This changeset provides a matrix test of ACL enforcement across several dimensions: * anonymous vs bogus vs valid tokens * permitted vs not permitted by policy * request sent to server vs sent to client (and forwarded)	2023-04-06 09:11:20 -04:00
Tim Gross	e117ff3877	docs: remove reference to vSphere from CSI concepts docs (#16765 ) The vSphere plugin is exclusive to k8s because it relies on k8s-APIs (and crashes without them being present). Upstream unfortunately will not support Nomad, so we shouldn't refer to it in our concept docs here.	2023-04-05 15:20:24 -04:00
Tim Gross	602a2f70dc	agent: add top-level warning if mTLS is not configured (#16800 ) Nomad's security model requires mTLS in order to secure client-to-server and server-to-server communications. Configuring ACLs alone is not enough. Loudly warn the user if mTLS is not configured in non-dev modes.	2023-04-05 14:43:45 -04:00
Tim Gross	6f2b9266bc	Merge pull request #16794 from hashicorp/post-1.5.3-release Post 1.5.3 release	2023-04-05 13:02:37 -04:00
the-nando	f541f2e59b	Do not set attributes when spawning the getter child (#16791 ) * Do not set attributes when spawning the getter child * Cleanup * Cleanup --------- Co-authored-by: the-nando <the-nando@invalid.local>	2023-04-05 11:47:51 -05:00
Seth Hoenig	378c3582ff	deps: update go-m1cpu with fix for BigSur users (#16793 ) https://github.com/shoenig/go-m1cpu/releases/tag/v0.1.5 which may also help with the Nix package for Nomad, which makes use of the older IOKit framework version https://github.com/shoenig/go-m1cpu/issues/5	2023-04-05 11:44:31 -05:00
Tim Gross	c3d9c598f5	Merge release 1.5.3 files	2023-04-05 12:32:00 -04:00
hc-github-team-nomad-core	3578078caf	Prepare for next release	2023-04-05 12:31:42 -04:00
hc-github-team-nomad-core	b64ee2726d	Generate files for 1.5.3 release	2023-04-05 12:31:30 -04:00
Tim Gross	66a01bb35a	upgrade go to 1.20.3	2023-04-05 12:18:19 -04:00
Tim Gross	8278f23042	acl: fix ACL bypass for anon requests that pass thru client HTTP Requests without an ACL token that pass thru the client's HTTP API are treated as though they come from the client itself. This allows bypass of ACLs on RPC requests where ACL permissions are checked (like `Job.Register`). Invalid tokens are correctly rejected. Fix the bypass by only setting a client ID on the identity if we have a valid node secret. Note that this changeset will break rate metrics for RPCs sent by clients without a client secret such as `Node.GetClientAllocs`; these requests will be recorded as anonymous. Future work should: * Ensure the node secret is sent with all client-driven RPCs except `Node.Register` which is TOFU. * Create a new `acl.ACL` object from client requests so that we can enforce ACLs for all endpoints in a uniform way that's less error-prone.~	2023-04-05 12:17:51 -04:00
Juana De La Cuesta	9b4871fece	Prevent kill_timeout greater than progress_deadline (#16761 ) * func: add validation for kill timeout smaller than progress dealine * style: add changelog * style: typo in changelog * style: remove refactored test * Update .changelog/16761.txt Co-authored-by: James Rasell <jrasell@users.noreply.github.com> * Update nomad/structs/structs.go Co-authored-by: James Rasell <jrasell@users.noreply.github.com> --------- Co-authored-by: James Rasell <jrasell@users.noreply.github.com>	2023-04-04 18:17:10 +02:00
Seth Hoenig	15a2d912b3	cleanup: use jobID name rather than jobName in job endpoints (#16777 ) These endpoints all refer to JobID by the time you get to the RPC request layer, but the HTTP handler functions call the field JobName, which is a different field (... though often with the same value).	2023-04-04 09:11:58 -05:00
James Rasell	bcfb4ea1f2	cli: fix up failing quota inspect enterprise test. (#16781 )	2023-04-04 15:02:40 +01:00
James Rasell	cb6ba80f0f	cli: stream both stdout and stderr when following an alloc. (#16556 ) This update changes the behaviour when following logs from an allocation, so that both stdout and stderr files streamed when the operator supplies the follow flag. The previous behaviour is held when all other flags and situations are provided. Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2023-04-04 10:42:27 +01:00
Mike Nomitch	b5a1051fe6	Merge pull request #16575 from hashicorp/docs-add-roadmap-project Adds public roadmap project to readme	2023-04-03 08:21:13 -07:00
Tim Gross	118b703164	CSI: set mounts in alloc hook resources atomically (#16722 ) The allocrunner has a facility for passing data written by allocrunner hooks to taskrunner hooks. Currently the only consumers of this facility are the allocrunner CSI hook (which writes data) and the taskrunner volume hook (which reads that same data). The allocrunner hook for CSI volumes doesn't set the alloc hook resources atomically. Instead, it gets the current resources and then writes a new version back. Because the CSI hook is currently the only writer and all readers happen long afterwards, this should be safe but #16623 shows there's some sequence of events during restore where this breaks down. Refactor hook resources so that hook data is accessed via setters and getters that hold the mutex.	2023-04-03 11:03:36 -04:00
Tim Gross	0c582a2c94	docs: fix use of gpg to avoid teeing binary to terminal (#16767 )	2023-04-03 10:54:21 -04:00
Tim Gross	ffd5435ceb	docs: fix install instructions for apt (#16764 ) The workflow described in the docs for apt installation is deprecated. Update to match the workflow described in the Tutorials and official packaging guide.	2023-04-03 10:06:59 -04:00
Georgy Buranov	ca80546ef7	take maximum processor Mhz (#16740 ) * take maximum processor Mhz * remove break * cl: add cl for 16740 --------- Co-authored-by: Seth Hoenig <shoenig@duck.com>	2023-03-31 11:25:32 -05:00
Juana De La Cuesta	89baa13b14	Update quota name on failing test for quota status (#16662 ) * fix: update quota name on test * Update quota_status_test.go * Update quota_status_test.go * fix: simplify template call for quota status	2023-03-31 18:07:21 +02:00
Juana De La Cuesta	1fc13b83d8	style: update documentation (#16729 )	2023-03-31 16:38:16 +02:00
Daniel Bennett	c9adc22eec	Update enterprise licensing documentation (#16615 ) updated various docs for new expiration behavior and new command `nomad license inspect` to validate pre-upgrade	2023-03-30 16:40:19 -05:00
Daniel Bennett	c42950e342	ent: move all license info into LicenseConfig{} (#16738 ) and add new TestConfigForServer() to get a valid nomad.Config to use in tests Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2023-03-30 16:15:05 -05:00
Horacio Monsalvo	20372b1721	connect: add meta on ConsulSidecarService (#16705 ) Co-authored-by: Sol-Stiep <sol.stiep@southworks.com>	2023-03-30 16:09:28 -04:00
Luiz Aoqui	fa4ee68c6a	ci: use `BACKPORT_MERGE_COMMIT` option (#16730 ) Instead of attempting to pick each individual commit in a PR using `BACKPORT_MERGE_COMMIT` only picks the commit that was merged into `main`. This reduces the amount of work done during a backport, generating cleaner merges and avoiding potential issues on specific commits. With this setting PRs that are not squashed will fail to backport and must be handled manually, but those are considered exceptions.	2023-03-30 11:49:46 -04:00
Piotr Kazmierczak	1470d2ff62	Merge pull request #15897 from hashicorp/f-sso-jwt-auth-method acl: JWT as SSO auth method	2023-03-30 17:07:50 +02:00
Piotr Kazmierczak	1a5eba24a6	acl: set minACLJWTAuthMethodVersion to 1.5.3 and adjust code comment	2023-03-30 15:30:42 +02:00
Phil Renaud	e9a114e249	[ui] Web sign-in with JWT (#16625 ) * Bones of JWT detection * JWT to token pipeline complete * Some live-demo fixes for template language * findSelf and loginJWT funcs made async * Acceptance tests and mirage mocks for JWT login * [ui] Allow for multiple JWT auth methods in the UI (#16665) * Split selectable jwt methods * repositions the dropdown to be next to the input field	2023-03-30 09:40:12 +02:00
Piotr Kazmierczak	d98c8f6759	acl: rebased on main and changed the gate to 1.5.3-dev	2023-03-30 09:40:12 +02:00
Piotr Kazmierczak	acfc266c30	acl: JWT changelog entry and typo fix	2023-03-30 09:40:11 +02:00
Piotr Kazmierczak	4609119fb5	acl: JWT auth CLI (#16532 )	2023-03-30 09:39:56 +02:00
Piotr Kazmierczak	16b6bd9ff2	acl: fix canonicalization of JWT auth method mock (#16531 )	2023-03-30 09:39:56 +02:00
Piotr Kazmierczak	2b353902a1	acl: HTTP endpoints for JWT auth (#16519 )	2023-03-30 09:39:56 +02:00
Piotr Kazmierczak	e48c48e89b	acl: RPC endpoints for JWT auth (#15918 )	2023-03-30 09:39:56 +02:00
Piotr Kazmierczak	a9230fb0b7	acl: JWT auth method	2023-03-30 09:39:56 +02:00
Tim Gross	76284a09a0	docker: move pause container recovery to after `SetConfig` (#16713 ) When we added recovery of pause containers in #16352 we called the recovery function from the plugin factory function. But in our plugin setup protocol, a plugin isn't ready for use until we call `SetConfig`. This meant that recovering pause containers was always done with the default config. Setting up the Docker client only happens once, so setting the wrong config in the recovery function also means that all other Docker API calls will use the default config. Move the `recoveryPauseContainers` call into the `SetConfig`. Fix the error handling so that we return any error but also don't log when the context is canceled, which happens twice during normal startup as we fingerprint the driver.	2023-03-29 16:20:37 -04:00
dependabot[bot]	afa9608475	build(deps): bump github.com/opencontainers/runc from 1.1.4 to 1.1.5 (#16712 ) * build(deps): bump github.com/opencontainers/runc from 1.1.4 to 1.1.5 Bumps [github.com/opencontainers/runc](https://github.com/opencontainers/runc) from 1.1.4 to 1.1.5. - [Release notes](https://github.com/opencontainers/runc/releases) - [Changelog](https://github.com/opencontainers/runc/blob/v1.1.5/CHANGELOG.md) - [Commits](https://github.com/opencontainers/runc/compare/v1.1.4...v1.1.5) --- updated-dependencies: - dependency-name: github.com/opencontainers/runc dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> * changelog entry --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Tim Gross <tgross@hashicorp.com>	2023-03-29 15:05:05 -04:00
Juana De La Cuesta	dd770027df	fix: clean the output writter to avoid duplicates when testing for json output (#16619 )	2023-03-29 12:05:23 +02:00
Max Fröhlich	ba590b081e	docs: mention Nomad Admission Control Proxy (#16702 )	2023-03-28 15:18:26 -04:00
Tim Gross	f22ff2b847	docs: clarify capabilities options for `docker` driver (#16693 ) The `docker` driver cannot expand capabilities beyond the default set when the task is a non-root user. Clarify this in the documentation of `allow_caps` and update the `cap_add` and `cap_drop` to match the `exec` driver, which has more clear language overall.	2023-03-28 13:32:08 -04:00
Elvis Pranskevichus	11a9bb6ce7	drivers/exec: Fix handling of capabilities for unprivileged tasks (#16643 ) Currently, the `exec` driver is only setting the Bounding set, which is not sufficient to actually enable the requisite capabilities for the task process. In order for the capabilities to survive `execve` performed by libcontainer, the `Permitted`, `Inheritable`, and `Ambient` sets must also be set. Per CAPABILITIES (7): > Ambient: This is a set of capabilities that are preserved across an > execve(2) of a program that is not privileged. The ambient capability > set obeys the invariant that no capability can ever be ambient if it > is not both permitted and inheritable.	2023-03-28 12:12:55 -04:00
James Rasell	17fd1a2e35	dev: make cni, consul, dev, docker, and vault scripts Lima compat. (#16689 )	2023-03-28 16:21:14 +01:00
Seth Hoenig	87f4b71df0	client/fingerprint: correctly fingerprint E/P cores of Apple Silicon chips (#16672 ) * client/fingerprint: correctly fingerprint E/P cores of Apple Silicon chips This PR adds detection of asymetric core types (Power & Efficiency) (P/E) when running on M1/M2 Apple Silicon CPUs. This functionality is provided by shoenig/go-m1cpu which makes use of the Apple IOKit framework to read undocumented registers containing CPU performance data. Currently working on getting that functionality merged upstream into gopsutil, but gopsutil would still not support detecting P vs E cores like this PR does. Also refactors the CPUFingerprinter code to handle the mixed core types, now setting power vs efficiency cpu attributes. For now the scheduler is still unaware of mixed core types - on Apple platforms tasks cannot reserve cores anyway so it doesn't matter, but at least now the total CPU shares available will be correct. Future work should include adding support for detecting P/E cores on the latest and upcoming Intel chips, where computation of total cpu shares is currently incorrect. For that, we should also include updating the scheduler to be core-type aware, so that tasks of resources.cores on Linux platforms can be assigned the correct number of CPU shares for the core type(s) they have been assigned. node attributes before cpu.arch = arm64 cpu.modelname = Apple M2 Pro cpu.numcores = 12 cpu.reservablecores = 0 cpu.totalcompute = 1000 node attributes after cpu.arch = arm64 cpu.frequency.efficiency = 2424 cpu.frequency.power = 3504 cpu.modelname = Apple M2 Pro cpu.numcores.efficiency = 4 cpu.numcores.power = 8 cpu.reservablecores = 0 cpu.totalcompute = 37728 * fingerprint/cpu: follow up cr items	2023-03-28 08:27:58 -05:00
James Rasell	a18e480a57	dev: modify Go install to support arch64 and non-vagrant machines. (#16651 )	2023-03-28 14:18:48 +01:00
Tim Gross	78acc75b57	docs: add notes about keyring to snapshot restore (#16663 ) When cluster administrators restore from Raft snapshot, they also need to ensure the keyring is in place. For on-prem users doing in-place upgrades this is less of a concern but for typical cloud workflows where the whole host is replaced, it's an important warning (at least until #14852 has been implemented).	2023-03-28 08:31:01 -04:00
Tim Gross	a953456460	docs: fix template retry attempts default documentation (#16667 ) The configuration docs for `client.template.vault_retry`, `consul_retry`, and `nomad_retry` incorrectly document the default number of attempts to be unlimited (0). When we added these config blocks, we defaulted the fields to `nil` for backwards compatibility, which causes them to fall back to the default consul-template configuration values.	2023-03-28 08:27:06 -04:00

1 2 3 4 5 ...

24599 Commits All Branches Search

24599 Commits

All Branches