open-nomad

Author	SHA1	Message	Date
Kevin Schoonover	5d9a506bc0	agent: support multiple http address in addresses.http (#11582 )	2022-01-03 09:33:53 -05:00
Tim Gross	395628efe1	api: paginate deployment list and accept wildcard namespace (#11743 ) Add `per_page` and `next_token` handling to `Deployment.List` RPC, and allow the use of a wildcard namespace for namespace filtering.	2022-01-03 08:36:02 -05:00
Jeff Escalante	60e7a186e7	add enterprise downloads page (#11750 )	2021-12-25 14:42:12 -05:00
Noel Quiles	e748508e67	website: Upgrade deps (#11709 ) * Update @hashicorp/react-subnav * Update <Subnav /> & <ProductDownloadsPage />	2021-12-23 16:18:57 -05:00
Tim Gross	265e488ab4	task runner: fix goroutine leak in prestart hook (#11741 ) The task runner prestart hooks take a `joincontext` so they have the option to exit early if either of two contexts are canceled: from killing the task or client shutdown. Some tasks exit without being shutdown from the server, so neither of the joined contexts ever gets canceled and we leak the `joincontext` (48 bytes) and its internal goroutine. This primarily impacts batch jobs and any task that fails or completes early such as non-sidecar prestart lifecycle tasks. Cancel the `joincontext` after the prestart call exits to fix the leak.	2021-12-23 11:50:51 -05:00
Tim Gross	430d94b81d	deps: upgrade go-getter to 1.5.10 (#11740 ) The `go-getter` library was updated to 1.5.9 in #11481 to pick up a bug fix for automatically unpacking uncompressed tar archives. But this version had a regression in git `ref` param behavior and was patched in 1.5.10.	2021-12-23 10:37:52 -05:00
Luiz Aoqui	4bdd2c84e3	fix host network reserved port fingerprint (#11728 )	2021-12-22 15:29:54 -05:00
Tim Gross	2f6b9b0772	scheduler: tooling for scheduler benchmarking (#11725 ) Adds a package `scheduler/benchmarks` with some examples of profiling and benchmarking the scheduler, along with helpers for loading real-world data for profiling. This tooling comes out of work done for #11712. These test benchmarks have not been added to CI because these particular profiles are mostly examples and the runs will add an excessive amount of time to CI runs for code that rarely changes in a way that has any chance of impacting performance.	2021-12-22 10:05:44 -05:00
Alex Carpenter	d1b577330a	Merge pull request #11669 from hashicorp/ac.home-redirect fix: redirects website `/home` to `/`	2021-12-22 09:43:05 -05:00
Luiz Aoqui	40093f97cd	api: support namespace wildcard in CSI volume list (#11724 )	2021-12-21 17:19:45 -05:00
Shishir	65eab35412	Add support for setting pids_limit in docker plugin config. (#11526 )	2021-12-21 13:31:34 -05:00
Tim Gross	b0c3b99b03	scheduler: fix quadratic performance with spread blocks (#11712 ) When the scheduler picks a node for each evaluation, the `LimitIterator` provides at most 2 eligible nodes for the `MaxScoreIterator` to choose from. This keeps scheduling fast while producing acceptable results because the results are binpacked. Jobs with a `spread` block (or node affinity) remove this limit in order to produce correct spread scoring. This means that every allocation within a job with a `spread` block is evaluated against _all_ eligible nodes. Operators of large clusters have reported that jobs with `spread` blocks that are eligible on a large number of nodes can take longer than the nack timeout to evaluate (60s). Typical evaluations are processed in milliseconds. In practice, it's not necessary to evaluate every eligible node for every allocation on large clusters, because the `RandomIterator` at the base of the scheduler stack produces enough variation in each pass that the likelihood of an uneven spread is negligible. Note that feasibility is checked before the limit, so this only impacts the number of _eligible_ nodes available for scoring, not the total number of nodes. This changeset sets the iterator limit for "large" `spread` block and node affinity jobs to be equal to the number of desired allocations. This brings an example problematic job evaluation down from ~3min to ~10s. The included tests ensure that we have acceptable spread results across a variety of large cluster topologies.	2021-12-21 10:10:01 -05:00
Andy Assareh	8ba4e063e2	Mesh Gateway doc enhancements (#11354 ) * Mesh Gateway doc enhancements 1. I believe this line should be corrected to add mesh as one of the choices 2. I found that we are not setting this meta, and it is a required element for wan federation. I believe it would be helpful and potentially time saving to note that right here.	2021-12-20 17:10:44 -05:00
Guilherme	ae05515b50	Fix 'check calculations' link (#11420 )	2021-12-20 17:09:15 -05:00
Luiz Aoqui	3d3b5a2c8e	changelog: add entries for #11555 , #11557 , and #11687 (#11706 )	2021-12-20 13:45:20 -05:00
Tim Gross	e046bb31e9	api: respect wildcard in evaluations list API (#11710 )	2021-12-20 12:23:50 -05:00
Jai	90cdc7b47a	Merge pull request #11578 from hashicorp/f-ui/clickable-links-allocs clickable links in allocations chart	2021-12-20 10:08:01 -05:00
James Rasell	45f4689f9c	chore: fixup inconsistent method receiver names. (#11704 )	2021-12-20 11:44:21 +01:00
Jai	93d5ef596f	Merge pull request #11545 from hashicorp/f-ui/add-alloc-filters-on-table Add Allocation Filters in Client View	2021-12-18 09:39:53 -05:00
Jai	2b7fb2c5bd	Merge pull request #11544 from hashicorp/f-ui/add-filters-to-allocs Add filters to Allocations	2021-12-18 09:38:28 -05:00
Luiz Aoqui	dad4a0984b	ui: fix action call to set filter query param	2021-12-17 20:41:53 -05:00
Luiz Aoqui	1c7c1f3222	ui: fix volume serializer tests	2021-12-17 20:23:28 -05:00
Luiz Aoqui	5054a6d97f	ui: fix allocation serializer tests	2021-12-17 20:02:59 -05:00
Luiz Aoqui	c7dafba66e	ui: fix job allocation filter by status, remove version filter, and add tests	2021-12-17 19:50:43 -05:00
Luiz Aoqui	fdc023019e	ui: fix file formating	2021-12-17 19:47:25 -05:00
Luiz Aoqui	e067b3d75f	changelog: fix entry for #11544	2021-12-17 18:57:54 -05:00
Luiz Aoqui	e6a202b408	ui: fix linting	2021-12-17 18:55:41 -05:00
Luiz Aoqui	a1c4536523	changelog: add entry for #11545	2021-12-17 18:49:56 -05:00
Luiz Aoqui	dbb47b5bc9	ui: fix task group alloc filter and add tests	2021-12-17 18:49:47 -05:00
Luiz Aoqui	1ae40fcc37	ui: display empty message in the client details page if there are no allocations to show	2021-12-17 18:49:47 -05:00
Luiz Aoqui	660f4a8b1e	ui: fix client details page alloc status filter and replace task group with namespace and job	2021-12-17 18:49:42 -05:00
Jai	416a40bbcc	fix: more descriptive parameters in sort function Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2021-12-17 09:46:29 -05:00
Jai	e813f27b7b	fix: remove eslint disable indent	2021-12-17 09:36:42 -05:00
Michael Schurter	434832d2c8	Merge pull request #11697 from hashicorp/f-raft-state-err cli: return error from raft commands if db is open	2021-12-16 14:18:15 -08:00
Michael Schurter	4949488217	cli: return error from raft commands if db is open Before this change trying to run `nomad operator raft {info,logs}` on an inuse raft.db would cause the command to block until the agent using raft.db is closed. After this change the command will block for 1s before returning a (hopefully) helpful error message. This change also sets the ReadOnly mode on the underlying BoltDb to ensure diagnostics make no changes to the underlying store. We have no evidence this has ever occurred, but it seems like a useful safety measure. No changelog added since this is a minor tweak in a "new" feature (it was hidden in previous relases).	2021-12-16 11:41:01 -08:00
Luiz Aoqui	a46d799f2a	docs: add v1.2.0 upgrade guide about Nomad UI ACL change for job details page (#11689 )	2021-12-16 14:32:20 -05:00
Luiz Aoqui	4b39494cd1	docs: add more references and examples to the `template` block (#11691 )	2021-12-16 14:14:01 -05:00
Noel Quiles	3759dd09f1	website: Disable alert banner (#11688 )	2021-12-16 13:43:47 -05:00
Tim Gross	c7cc3cf4dc	cli: stream raft logs to operator raft logs subcommand (#11684 ) The `nomad operator raft logs` command uses a raft helper that reads in the logs from raft and serializes them to JSON. The previous implementation returned the slice of all logs and then serializes the entire object. Update the helper to stream the log entries and then serialize them as newline-delimited JSON.	2021-12-16 13:38:58 -05:00
Jai Bhagat	9d9042f95e	feat: add sliceClick to job-page/summary	2021-12-16 11:24:03 -05:00
Jai Bhagat	d508c10d8d	chore: prettify job-page/summary	2021-12-16 11:23:05 -05:00
Tim Gross	f2615992a4	cli: unhide advanced operator raft debugging commands (#11682 ) The `nomad operator raft` and `nomad operator snapshot state` subcommands for inspecting on-disk raft state were hidden and undocumented. Expose and document these so that advanced operators have support for these tools.	2021-12-16 10:32:11 -05:00
Tim Gross	536e3c5282	`nomad eval list` command (#11675 ) Use the new filtering and pagination capabilities of the `Eval.List` RPC to provide filtering and pagination at the command line. Also includes note that `nomad eval status -json` is deprecated and will be replaced with a single evaluation view in a future version of Nomad.	2021-12-15 11:58:38 -05:00
Tim Gross	f8a133a810	cli: ensure `-stale` flag is respected by `nomad operator debug` (#11678 ) When a cluster doesn't have a leader, the `nomad operator debug` command can safely use stale queries to gracefully degrade the consistency of almost all its queries. The query parameter for these API calls was not being set by the command. Some `api` package queries do not include `QueryOptions` because they target a specific agent, but they can potentially be forwarded to other agents. If there is no leader, these forwarded queries will fail. Provide methods to call these APIs with `QueryOptions`.	2021-12-15 10:44:03 -05:00
Luiz Aoqui	05bb65779c	api: return error when `LicenseGet` status is not `200` (#11644 )	2021-12-14 19:47:09 -05:00
Noel Quiles	235a778a56	website: Copy updates (#11677 )	2021-12-14 16:35:21 -05:00
Noel Quiles	2cd9fc5825	website: Update website Docker image (#11667 )	2021-12-13 16:40:46 -05:00
Kevin Wang	a62362966c	feat: versioned docs (#11407 )	2021-12-13 16:21:57 -05:00
Tim Gross	a0cf5db797	provide `-no-shutdown-delay` flag for job/alloc stop (#11596 ) Some operators use very long group/task `shutdown_delay` settings to safely drain network connections to their workloads after service deregistration. But during incident response, they may want to cause that drain to be skipped so they can quickly shed load. Provide a `-no-shutdown-delay` flag on the `nomad alloc stop` and `nomad job stop` commands that bypasses the delay. This sets a new desired transition state on the affected allocations that the allocation/task runner will identify during pre-kill on the client. Note (as documented here) that using this flag will almost always result in failed inbound network connections for workloads as the tasks will exit before clients receive updated service discovery information and won't be gracefully drained.	2021-12-13 14:54:53 -05:00
Alex Carpenter	37cb8ffc1a	fix: redirects /home to /	2021-12-13 14:28:56 -05:00

1 2 3 4 5 ...

22166 commits