open-nomad

Author	SHA1	Message	Date
Seth Hoenig	d7aa37a5c9	e2e: explicitly wait on task status in chroot download exec test (#15145 ) Also add some debug log lines for this test, because it doesn't make sense for the allocation to be complete yet a task in the allocation to be not started yet, which is what the test failures are implying.	2022-11-04 09:50:11 -05:00
Michael Schurter	9cac60dbed	test: use port collision instead of cpu exhaustion (#14994 ) Originally this test relied on Job 1 blocking Job 2 until Job 1 had a terminal ClientStatus. Job 2 ensured it would get blocked using 2 mechanisms: 1. A constraint requiring it is placed on the same node as Job 1. 2. Job 2 would require all unreserved CPU on the node to ensure it would be blocked until Job 1's resources were free. That 2nd assertion breaks if any previous job is still running on the target node! That seems very likely to happen in the flaky world of our e2e tests. In fact there may be some jobs we intentionally want running throughout; in hindsight it was never safe to assume my test would be the only thing scheduled when it ran. Ports to the rescue! Reserving a static port means that both Job 2 will now block on Job 1 being terminal. It will only conflict with other tests if those tests use that port on every node. I ensured no existing tests were using the port I chose. Other changes: - Gave job a bit more breathing room resource-wise. - Tightened timings a bit since previous failure ran into the `go test` time limit. - Cleaned up the DumpEvals output. It's quite nice and handy now!	2022-10-21 07:53:26 -07:00
Michael Schurter	21eced0a4e	test: extend timing and output of overlap e2e test (#14894 ) Keeps failing in the nightly e2e test with unhelpful output like: ``` Failed === RUN TestOverlap overlap_test.go:92: Followup job overlap93ee1d2b blocked. Sleeping for the rest of overlap48c26c39's shutdown_delay (9.2/10s) overlap_test.go:105: 1500/2000 retries reached for github.com/hashicorp/nomad/e2e/overlap.TestOverlap (err=timed out before an allocation was found for overlap93ee1d2b) overlap_test.go:105: timeout: timed out before an allocation was found for overlap93ee1d2b --- FAIL: TestOverlap (38.96s) ``` I have not been able to replicate it in my own e2e cluster, so I added the EvalDump helper to add detailed eval information like: ``` === RUN TestOverlap 1/1 Job overlap7b0e90ec Eval c38c9919-a4f0-5baf-45f7-0702383c682a Type: service TriggeredBy: job-register Deployment: Status: pending () NextEval: PrevEval: BlockedEval: -- No placement failures -- QueuedAllocs: SnapshotIdx: 0 CreateIndex: 96 ModifyIndex: 96 ... ``` Hopefully helpful when debugging other tests as well!	2022-10-14 14:15:07 -07:00
Piotr Kazmierczak	b63944b5c1	cleanup: replace TypeToPtr helper methods with pointer.Of (#14151 ) Bumping compile time requirement to go 1.18 allows us to simplify our pointer helper methods.	2022-08-17 18:26:34 +02:00
Seth Hoenig	3371214431	core: implement system batch scheduler This PR implements a new "System Batch" scheduler type. Jobs can make use of this new scheduler by setting their type to 'sysbatch'. Like the name implies, sysbatch can be thought of as a hybrid between system and batch jobs - it is for running short lived jobs intended to run on every compatible node in the cluster. As with batch jobs, sysbatch jobs can also be periodic and/or parameterized dispatch jobs. A sysbatch job is considered complete when it has been run on all compatible nodes until reaching a terminal state (success or failed on retries). Feasibility and preemption are governed the same as with system jobs. In this PR, the update stanza is not yet supported. The update stanza is sill limited in functionality for the underlying system scheduler, and is not useful yet for sysbatch jobs. Further work in #4740 will improve support for the update stanza and deployments. Closes #2527	2021-08-03 10:30:47 -04:00
Seth Hoenig	7f1191111d	e2e: add tests for consul namespaces from nomad oss This PR adds a set of tests to the Consul test suite for testing Nomad OSS's behavior around setting Consul Namespace on groups, which is to ignore the setting (as Consul Namespaces are currently an Enterprise feature). Tests are generally a reduced facsimile of existing tests, modified to check behavior of when group.consul.namespace is set and not set. Verification is oriented around what happens in Consul; the in-depth functional correctness of these features is left to the original tests. Nomad ENT will get its own version of these tests in `namespaces_ent.go`.	2021-04-16 15:32:37 -06:00
Chris Baker	ce68ee164b	Version 1.0.3 -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iQEcBAABAgAGBQJgEuOKAAoJEFGFLYc0j/xMxF8H/3TTU6Tu+Xm0YvcsDaYDphZ/ X7KQBV0aFiuL5VkTw4PzKEsgryIy9/sqEPyxxyKRowAmos9qhiusjNAIfqdP4TF8 tdZmTedkfWir9uPD+hyv/LXpwbQ2T8kTwS3xHTYvaOmaCxZr710FEn+imnMk1AUn Xs5itkd/CYGr0nBLm+I5GutWSDPmL7Uw8J5Z30fFyoaxoCPAbCWQQNk793SCRUc5 f/uo18V2tFInmQ+3sAdnM4gPewyStK/a5VvzWavL9fVDtYK83wlqWSchTXY5jpVz zNEzt/rYhbBzakPQQKb5zieblh2iGI8aHWpD5w4WduqO2Sg6B/5lAeNZIlW0UJg= =2g3c -----END PGP SIGNATURE----- Merge tag 'v1.0.3' into post-release-1.0.3 Version 1.0.3	2021-01-29 19:30:08 +00:00
Chris Baker	aa55df0413	additional e2e utils for multi-task allocs	2021-01-28 12:03:19 +00:00
Mahmood Ali	94ad40907c	e2e: prefer testutil.WaitForResultRetries Prefer testutil.WaitForResultRetries that emits more descriptive errors on failures. `require.Evatually` fails with opaque "Condition never satisfied" error message.	2021-01-26 10:01:14 -05:00
Seth Hoenig	536747f216	e2e: use jobspec2 Parse for parsing jobfile in e2e utils We directly parse job files in e2eutil, but currently using jobspec package. Instead, use the Parse method from the jobspec2 package so we can parse job files with new features.	2021-01-13 14:00:40 -06:00
Michael Schurter	66bc07d01a	test: deflake consul e2e tests Modernize test patterns by removing gomega and avoiding the mock_driver.	2020-08-19 14:29:22 -07:00
Seth Hoenig	a9991e9ab9	e2e: add tests for connect native Adds 2 tests around Connect Native. Both make use of the example connect native services in https://github.com/hashicorp/nomad-connect-examples One of them runs without Consul ACLs enabled, the other with.	2020-07-01 15:54:28 -05:00
Tim Gross	73dc2ad443	e2e/csi: add waiting for alloc stop	2020-04-06 10:15:55 -04:00
Tim Gross	d81797ea33	e2e: improve test reliability for CSI (#7616 ) This changeset: * adds eval status to the error messages emitted when we have placement failure in tests. The implementation here isn't quite perfect but it's a lot better than "condition not met". * enforces the ordering of teardown of the CSI test * doesn't pass the purge flag to one of the two CSI tests, so that we exercise both code paths.	2020-04-03 15:52:58 -04:00
Tim Gross	4c51687cbf	e2e: remove gometa from e2eutils (#7610 )	2020-04-03 10:22:22 -04:00
Drew Bailey	7bee040e61	simplify job, better error	2020-02-04 13:59:39 -05:00
Drew Bailey	a716d57ad7	clean up	2020-02-04 11:59:28 -05:00
Drew Bailey	75053a0d10	get test passing, new util func to wait for not pending	2020-02-04 11:56:37 -05:00
Drew Bailey	5117a22c30	add e2e test for system sched ineligible nodes	2020-02-04 11:56:33 -05:00
Seth Hoenig	057179edea	e2e: remove leftover debug println statement	2020-02-03 11:15:38 -06:00
Seth Hoenig	9b20ca5b25	e2e: setup consul ACLs a little more correctly	2020-01-31 19:06:11 -06:00
Tim Gross	55ee7a220b	e2e: fixes for race conditions in testing (#6300 ) - In script checks, ensure we're running `Exec` against the new running allocation and not the earlier stopped one. - In script checks, allow `Exec` calls to error due to lack of pty when we use the exec to kill the task. - In `utils.go/RegisterAllocs`, force query for allocations to wait on wait index returned by registration call.	2019-09-10 13:45:16 -04:00
Lang Martin	071dccfcce	e2e/deployment DeploymentsForJob fail instead of nil, error passing	2019-06-04 14:31:42 -04:00
Lang Martin	1635fa3c00	e2e/deployment find the second deployment, use its status	2019-06-04 13:41:52 -04:00
Lang Martin	fe69f89476	e2e add deployment to the list of e2e tests, minor fixes	2019-05-22 12:34:57 -04:00
Lang Martin	97fd114535	e2e utils remove ineffectual assignment of allocs	2019-05-22 12:34:57 -04:00
Lang Martin	824d1366dd	e2e utils error format arg match	2019-05-22 12:32:08 -04:00
Lang Martin	d73606e54e	e2e util split new alloc and await placement, new WaitForDeployment	2019-05-22 12:32:08 -04:00
Michael Schurter	cd87afd15f	e2e: add NomadAgent and basic client state test The e2e test code is absolutely hideous and leaks processes and files on disk. NomadAgent seems useful, but the clientstate e2e tests are very messy and slow. The last test "Corrupt" is probably the most useful as it explicitly corrupts the state file whereas the other tests attempt to reproduce steps thought to cause corruption in earlier releases of Nomad.	2019-03-21 07:14:34 -07:00
Michael Schurter	ce4a828fd1	Apply suggestions from code review Co-Authored-By: nickethier <ncethier@gmail.com>	2019-01-23 14:09:49 -05:00
Nick Ethier	c30965eaf9	e2e: add tests for nomad driver upgrade path	2019-01-17 23:32:45 -05:00
Michael Schurter	b09c68ceaf	e2e: wait for at least N nodes to be ready Before it was exactly N nodes which limited test portability between clusters.	2019-01-08 14:39:37 -08:00
Mahmood Ali	606ab23235	goimport file	2019-01-04 08:53:50 -05:00
Preetha Appan	378dd74d2a	Added waiting on client node ready state before running e2e tests	2019-01-03 16:16:20 -06:00
Preetha Appan	f458cb63dd	Increase alloc wait timeout in e2e test	2019-01-03 14:02:02 -06:00
Preetha Appan	d182c0f5cd	Increase timeout in e2e test	2019-01-03 11:22:21 -06:00
Danielle Tomlinson	d3b41a26c4	e2e: goimports e2eutil/utils.go	2019-01-03 13:31:49 +01:00
Preetha Appan	1bebce3525	new e2e test for spread, and refactor affinity tests to share util methods	2018-12-19 21:25:32 -06:00

38 commits