open-nomad

Author	SHA1	Message	Date
Tim Gross	99c2a2df00	e2e: reduce risk of flaky Ubuntu AMI build (#9207 ) The base Ubuntu AMI modifies apt sources during cloud-init. But the Packer build can potentially start the setup script before that work is done, resulting in errors trying to install base system dependencies like `dnsmasq`. Delay the setup long enough to lose the race with cloud-init.	2020-10-28 15:13:44 -04:00
Tim Gross	7e4a35ad7e	e2e: use more specific names for OS/distros (#9204 ) We intend to expand the nightly E2E test to cover multiple distros and platforms. Change the naming structure for "Linux client" to the more precise "Ubuntu Bionic", and "Windows" to "Windows 2016" to make it easier to add new targets without additional refactoring.	2020-10-28 12:58:00 -04:00
Tim Gross	be3f54d296	e2e: make dev cluster the default Terraform vars file (#9202 ) Most of the time that a human is running the TF provisioning, they want the "dev cluster" which is going to deploy an OSS sha, with fewer targets and configuration alternatives. But the default `terraform.tfvars` is the nightly E2E run. Because the nightly run is automated, there's no reason we can't have it pick a non-default `terraform.full.tfvars` file and have the default be the dev cluster.	2020-10-28 10:01:42 -04:00
Tim Gross	4fe1edfd63	Revert "e2e: fix destination of templates in VaultSecrets test (#9146 )" (#9163 ) This reverts commit 8aed53c177aea024d4f24d1fbb4d6e0881f04eab.	2020-10-23 09:01:25 -04:00
Tim Gross	1fb1c9c5d4	artifact/template: make destination path absolute inside taskdir (#9149 ) Prior to Nomad 0.12.5, you could use `${NOMAD_SECRETS_DIR}/mysecret.txt` as the `artifact.destination` and `template.destination` because we would always append the destination to the task working directory. In the recent security patch we treated the `destination` absolute path as valid if it didn't escape the working directory, but this breaks backwards compatibility and interpolation of `destination` fields. This changeset partially reverts the behavior so that we always append the destination, but we also perform the escape check on that new destination after interpolation so the security hole is closed. Also, ConsulTemplate test should exercise interpolation	2020-10-22 15:47:49 -04:00
Tim Gross	344e821ace	e2e: fix destination of templates in VaultSecrets test (#9146 ) The `$NOMAD_SECRETS_DIR` environment variable is rendered as `/secrets`, which prior to the recent security patch would unintentionally escape the file sandbox and get dropped in a directory named `/secrets` where the Nomad client binary was running. The `VaultSecrets` test was accidentally relying on this behavior and that causes the test to fail.	2020-10-22 13:00:08 -04:00
Tim Gross	9fa38bac98	e2e: path fixes for local_binary uploads (#9137 ) When uploading a local binary for provisioning, the location that we pass into the provisioning script needs to be where we uploaded it to, not the source on our laptop. Also, the null_resource for uploading needs to read in the private key, not its path.	2020-10-21 10:20:22 -04:00
Drew Bailey	8451de99b2	adds two base event stream e2e tests (#9126 ) * adds two base event stream e2e tests test evaluation filter keys are included * Apply suggestions from code review Co-authored-by: Tim Gross <tgross@hashicorp.com> * gc aftereach Co-authored-by: Tim Gross <tgross@hashicorp.com>	2020-10-20 08:26:21 -04:00
Tim Gross	8fcdbe0592	e2e: add reporting to flaky spread test (#9115 ) The spread test is infrequently flaky and it's hard to extract what's actually happening. If the test fails, dump all the allocation metrics so that we can debug the behavior.	2020-10-16 11:01:07 -04:00
Tim Gross	54d7f57662	e2e: fix flaky TaskEventsTest (#9114 ) Assert that we get at least N task events, rather than exactly N. When a task within an allocation dies, a sibling task can get an Allocation Unhealthy event after it's also killed, even though it's not the origin of the event.	2020-10-16 10:22:40 -04:00
Tim Gross	e0ff06be2f	e2e: networking test job needs to outlast assert (#9113 ) The `e2ejob` utility asserts that a job is running for 5s, but with a sleep time of 5s, the networking job can race with that check. Sleeping for a longer period should guarantee that we're running long enough to pass the assert. Also constrains the job to Linux because our Windows test targets don't yet support Docker (LCOW), and expand the set of DCs we can safely land on.	2020-10-16 10:13:16 -04:00
Chris Baker	0a85d2bd24	Merge pull request #9089 from hashicorp/b-explicit-rune fix go 1.15 pickiness	2020-10-14 10:37:36 -05:00
Tim Gross	fe88003f29	e2e: eliminate race condition causing rescheduling test flake (#9085 ) The autorevert test checks for reverted allocations to be placed and running before checking the deployment status, but the deployment can be completed and marked "successful" before we check it for "running" status. Instead, just wait for it to be marked "successful" and assert we have the expected count of deployment statuses.	2020-10-14 11:35:30 -04:00
Tim Gross	76f1f5e5df	e2e: use AMI filter for Ubuntu packer image (#9086 ) Instead of hard-coding the base AMI for our Packer image for Ubuntu, use the latest from Canonical so that we always have their current kernel patches.	2020-10-14 11:22:33 -04:00
Chris Baker	d4bae840b2	fix go 1.15 pickiness	2020-10-14 15:19:54 +00:00
Nick Ethier	f5250499b9	e2e/networking: use correct dc (#9088 )	2020-10-14 11:14:09 -04:00
Tim Gross	115edb53a0	e2e: add flag to opt-in to creating EBS/EFS volumes (#9082 ) For everyday developer use, we don't need volumes for testing CSI. Providing a flag to opt-in speeds up deploying dev clusters and slightly reduces infra costs. Skip CSI test if missing volume specs.	2020-10-14 10:29:33 -04:00
Tim Gross	65282a7cf1	E2E: vault secrets (#9081 ) * rename vault API compatibility test for clarity * exercise vault secrets lease renewal	2020-10-14 08:43:28 -04:00
Nick Ethier	d45be0b5a6	client: add NetworkStatus to Allocation (#8657 )	2020-10-12 13:43:04 -04:00
Yoan Blanc	891accb89a	use allow/deny instead of the colored alternatives (#9019 ) Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-10-12 08:47:05 -04:00
Tim Gross	474c18102d	e2e: extend ConsulTemplate test and fix flakiness (#8997 ) Add service discovery integration to the existing consul-template E2E test, and verify both service and key updates force re-rendering. Fixes flakiness by using the longer default wait config we use elsewhere. Removes our last direct dependency on gomega.	2020-10-05 10:51:55 -04:00
Tim Gross	727277793b	e2e: bootstrap vault and provision Nomad with vault tokens (#9010 ) Provisions vault with the policies described in the Nomad Vault integration guide, and drops a configuration file for Nomad vault server configuration with its token. The vault root token is exposed to the E2E runner so that tests can write additional policies to vault.	2020-10-05 09:28:37 -04:00
Tim Gross	b6292528fe	e2e: tfvars.dev file must override default tfvars file (#9005 ) The `-var-file` flag for loading variables into Terraform overlays the default variables file if present. This means that variables that are set in the default variables file will take precedence if the overlay file does not have them set. Set `nomad_acls` and `nomad_enteprise` to `false` in the dev cluster.	2020-10-02 08:02:37 -04:00
Tim Gross	4bab91b81b	e2e: ensure tests are constrained to Linux (#8990 ) Until we have LCOW support in the E2E environment (which requires a Windows 2019 test target), we need to constrain E2E tests to the appropriate kernel	2020-09-30 09:43:30 -04:00
Tim Gross	e49410e97b	e2e: cleanup errors should use assert, not require (#8989 ) The E2E framework wraps testify's `require` so that by default we can stop tests on errors, but the cleanup functions should use `assert` so that we continue to try to cleanup the test environment even if there's a failure.	2020-09-30 09:00:37 -04:00
Tim Gross	fa1fa623f2	e2e: rework rescheduling progress deadline test (#8958 ) Eliminate sources of randomness in the progress deadline test and clarify the purpose of the test to check for progress deadline updates.	2020-09-29 11:02:16 -04:00
Tim Gross	6489c5f626	e2e: namespace support for CLI helpers (#8978 ) Required to support tests for namespaces and other ENT features.	2020-09-28 16:37:34 -04:00
Tim Gross	6bed4ec45b	e2e: ENT placeholder for namespace/quotas tests (#8973 )	2020-09-28 11:23:37 -04:00
Tim Gross	1311f32f1b	e2e: test for host volumes and Docker volumes (#8972 ) Exercises host volume and Docker volume functionality for the `exec` and `docker` task driver, particularly around mounting locations within the container and how this can be used with `template`.	2020-09-28 11:14:13 -04:00
Tim Gross	566dae7b19	e2e: add flag to bootstrap Nomad ACLs (#8961 ) Adds a `nomad_acls` flag to our Terraform stack that bootstraps Nomad ACLs via a `local-exec` provider. There's no way to set the `NOMAD_TOKEN` in the Nomad TF provider if we're bootstrapping in the same Terraform stack, so instead of using `resource.nomad_acl_token`, we also bootstrap a wide-open anonymous policy. The resulting management token is exported as an environment var with `$(terraform output environment)` and tests that want stricter ACLs will be able to write them using that token. This should also provide a basis to do similar work with Consul ACLs in the future.	2020-09-28 09:22:36 -04:00
Tim Gross	15d3f5ea7e	e2e: remove unused migrations test (#8955 ) The areas of the code this test exercised were merged in with the node drain tests.	2020-09-23 14:50:15 -04:00
Tim Gross	147b16243d	e2e: use more recent instance type (#8954 ) Newer EC2 instances are both cheaper and have generally better performance. The dnsmasq configuration had a hard-coded interface name, so in order to accomodate instances with more recent networking that result in so-called predictable interface names, the dnsmasq configuration needs to be replaced at runtime with userdata to select the default interface.	2020-09-23 14:27:52 -04:00
Tim Gross	1fc525ec1e	e2e: add flags for provisioning Nomad Enterprise (#8929 )	2020-09-23 10:39:04 -04:00
Tim Gross	9cbc604308	e2e: node drain tests (#8906 ) Exercise the `nomad node drain` features, driving them via the new CLI helpers.	2020-09-21 11:52:11 -04:00
Tim Gross	34093f7747	e2e: reschedule tests should check for non-zero rescheduled allocs (#8927 ) The conditional around some of the rescheduling tests was backwards, where we were waiting for allocations to be rescheduled but testing for a count of 0. The test was passing but flaky because if the check happened quickly enough before the scheduler rescheduled the allocations, it would pass.	2020-09-21 08:17:24 -04:00
Tim Gross	3da61545d5	make sure dev-cluster has the option to run windows config (#8928 )	2020-09-18 16:41:35 -04:00
Tim Gross	ea1f6408bf	e2e: remove unused framework provisioning code (#8908 )	2020-09-18 11:46:47 -04:00
Tim Gross	c413fa5e49	e2e: test script for Terraform logic (#8907 )	2020-09-18 11:46:40 -04:00
Tim Gross	9d37233eaf	e2e: provision cluster entirely through Terraform (#8748 ) Have Terraform run the target-specific `provision.sh`/`provision.ps1` script rather than the test runner code which needs to be customized for each distro. Use Terraform's detection of variable value changes so that we can re-run the provisioning without having to re-install Nomad on those specific hosts that need it changed. Allow the configuration "profile" (well-known directory) to be set by a Terraform variable. The default configurations are installed during Packer build time, and symlinked into the live configuration directory by the provision script. Detect changes in the file contents so that we only upload custom configuration files that have changed between Terraform runs	2020-09-18 11:27:24 -04:00
Tim Gross	990fcf7be4	e2e: documentation and minor tweaks to configs (#8912 ) * remove outdated references to envchain in documentation * add new host volume locations in userdata * don't exit the entire script during provisioning, just return	2020-09-17 09:20:18 -04:00
Tim Gross	d7a013b6f5	e2e: refactor CLI utils out of rescheduling test (#8905 ) The CLI helpers in the rescheduling test were intended for shared use, but until some other tests were written we didn't want to waste time making them generic. This changeset refactors them and adds some new helpers associated with the node drain tests (under separate PR).	2020-09-16 16:10:06 -04:00
Tim Gross	bd889c82aa	e2e: constrain rescheduling test workloads to Linux (#8872 ) The rescheduling test workloads were created before we had Windows targets in the E2E nightly run. When these were recently ported to the e2e framework they were missing the constraint to Linux machines. Also added a little extra time to polling to avoid some flakiness on the first run, and a minor readability adjustment to the job names.	2020-09-11 09:21:28 -04:00
Tim Gross	572ae37856	Merge pull request #8860 E2E: rescheduling tests	2020-09-10 13:43:55 -04:00
Tim Gross	294c7149a2	e2e: rescheduling tests Ports the rescheduling tests (which aren't running in CI) into the current test framework so that they're run on nightly, and exercises the new CLI helpers.	2020-09-10 13:00:37 -04:00
Tim Gross	28e9bbbbf4	e2e: helper for sending CLI commands and parsing output The E2E suite exercises the API, but not the CLI. This changeset adds a helper function to send commands via a locally-built Nomad binary (which we'll need to add to the E2E setup), and some helpers to parse the resulting structured outputs in a way that tests can consume.	2020-09-10 13:00:32 -04:00
Michael Schurter	5f3a71d0b9	docs: update scripts to 0.12.4	2020-09-09 15:22:37 -07:00
James Rasell	76b03d3a2f	e2e: fix failure in running metrics test suite jobs. When running the Fabio and Prometheus jobs for the metrics suite it seems the outer directory is required in the call when registering the job. error: "e2e/input/fabio.nomad: no such file or directory"	2020-09-09 08:40:35 +02:00
Tim Gross	f499b44101	e2e: move setup jobs for metrics test into that suite (#8842 ) The fabio and prometheus workloads are specific to the metrics test and aren't used by any other test suite.	2020-09-08 13:21:44 -04:00
Tim Gross	a47b1c1081	e2e: move configurations into profile-specific directories (#8828 ) This changeset stages upcoming E2E provisioning improvements work. It splits the existing shared configuration directory into 3 profiles: * "full-cluster": the set of configurations currently in use * "dev-cluster": a simplified set of mostly existing configurations that weren't in use. * "custom": an empty profile for developers to keep non-standard configurations during complex feature development. The tooling to switch between profiles will be in a later changeset. Also drops some unused configuration knobs from the provisioning scripts to make the next stage of work easier.	2020-09-04 11:23:32 -04:00
Tim Gross	93c1093274	e2e: remove unused EBS volumes and depends_on (#8827 ) Our provisioning process for E2E doesn't require the `depends_on` fields to be set for client instances, so dropping that field allows all instances to be started in parallel. We don't use the extra EBS volumes (they aren't even mounted), so remove them to reduce costs.	2020-09-04 10:25:59 -04:00

1 2 3 4 5 ...

337 commits