open-nomad

Author	SHA1	Message	Date
Tim Gross	f4ccb360ef	E2E: use remote-exec via TF0.14.7+ The E2E provisioning used local-exec to call ssh in a for loop in a hacky workaround https://github.com/hashicorp/terraform/issues/25634, which prevented remote-exec from working on Windows. Move to a newer version of Terraform that fixes the remote-exec bug to make provisioning more reliable and observable. Note that Windows remote-exec needs to include the `powershell` call itself, unlike Unix-alike remote-exec.	2021-04-08 16:03:06 -04:00
Tim Gross	da89103c5c	E2E: extend CSI test to cover create and snapshot workflows Split the EBS and EFS tests out into their own test cases: * EBS exercises the Controller RPCs, including the create/snapshot workflow. * EFS exercises only the Node RPCs, and assumes we have an existing volume that gets registered, rather than created.	2021-04-08 12:55:36 -04:00
Drew Bailey	64084f3209	e2e allow setting an enterprise license environment variable (#10233 ) * allow setting an enterprise license environment variable * update comment * address pr comments	2021-03-25 14:35:55 -04:00
Tim Gross	fa25e048b2	CSI: unique volume per allocation Add a `PerAlloc` field to volume requests that directs the scheduler to test feasibility for volumes with a source ID that includes the allocation index suffix (ex. `[0]`), rather than the exact source ID. Read the `PerAlloc` field when making the volume claim at the client to determine if the allocation index suffix (ex. `[0]`) should be added to the volume source ID.	2021-03-18 15:35:11 -04:00
Tim Gross	2a2e36690a	docs: swap master for main in Nomad repo	2021-03-08 14:26:31 -05:00
Mahmood Ali	ff8d67fae2	Merge pull request #9935 from hashicorp/e2e-segment-e2e-clusters e2e: segment e2e clusters	2021-03-01 09:23:21 -05:00
Seth Hoenig	d2cd605995	dist: place systemd unit options correctly This PR places StartLimitIntervalSec and StartLimitBurst in the Unit section of systemd unit files, rather than the Service section. https://www.freedesktop.org/software/systemd/man/systemd.unit.html Fixes #10065	2021-02-22 19:23:00 -06:00
Drew Bailey	c152757d38	E2e/fix periodic (#10047 ) * fix periodic * update periodic to not use template nomad job inspect no longer returns an apiliststub so the required fields to query job summary are no longer there, parse cli output instead * rm tmp makefile entry * fix typo * revert makefile change	2021-02-18 12:21:53 -05:00
Seth Hoenig	45e0e70a50	consul/connect: enable custom sidecars to use expose checks This PR enables jobs configured with a custom sidecar_task to make use of the `service.expose` feature for creating checks on services in the service mesh. Before we would check that sidecar_task had not been set (indicating that something other than envoy may be in use, which would not support envoy's expose feature). However Consul has not added support for anything other than envoy and probably never will, so having the restriction in place seems like an unnecessary hindrance. If Consul ever does support something other than Envoy, they will likely find a way to provide the expose feature anyway. Fixes #9854	2021-02-09 10:49:37 -06:00
Chris Baker	b1bb8a760e	e2e packer build: upgrade jdk to java 14	2021-02-02 17:33:48 +00:00
Mahmood Ali	45889f9f55	e2e: segment e2e clusters Ensure that the e2e clusters are isolated and never attempt to autojoin with another e2e cluster. This ensures that each cluster servers have a unique `ConsulAutoJoin`, to be used for discovery.	2021-02-01 08:04:21 -05:00
Mahmood Ali	0aafd9af64	e2e: Fix build script and pass shellcheck	2021-01-26 09:11:37 -05:00
Mahmood Ali	4397eda209	Merge pull request #9798 from hashicorp/e2e-terraform-tweaks-20200113 This PR makes two ergonomics changes, meant to get e2e builds more reproducible and ease changes. ### AMI Management First, we pin the server AMIs to the commits associated with the build. No more using the latest AMI a developer build in a test branch, or accidentally using a stale AMI because we forgot to build one! Packer is to tag the AMI images with the commit sha used to generate the image, and then Terraform would look up only the AMIs associated with that sha. To minimize churn, we use the SHA associated with the latest Packer configurations, rather than SHA of all. This has few benefits: reproducibility and avoiding accidental AMI changes and contamination of changes across branches. Also, the change is a stepping stone to an e2e pipeline that builds new AMIs automatically if Packer files changed. The downside is that new AMIs will be generated even for irrelevant changes (e.g. spelling, commits), but I suspect that's OK. Also, an engineer will be forced to build the AMI whenever they change Packer files while iterating on e2e scripts; this hasn't been an issue for me yet, and I'll be open for iterating on that later if it proves to be an issue. ### Config Files and Packer Second, this PR moves e2e config hcl management to Terraform instead of Packer. Currently, the config files live in `./terraform/config`, but they are baked into the servers by Packer and changes are ignored. This current behavior surprised me, as I spent a bit of time debugging why my config changes weren't applied. Having Terraform manage them would ease engineer's iteration. Also, make Packer management more consistent (Packer only works `e2e/terraform/packer`), and easing the logic for AMI change detection. The config directory is very small (100KB), and having it as an upload step adds negligible time to `terraform apply`.	2021-01-25 13:20:28 -05:00
Mahmood Ali	39da228964	update readme about profiles and packer build	2021-01-25 11:40:26 -05:00
Drew Bailey	630babb886	prevent double job status update (#9768 ) * Prevent Job Statuses from being calculated twice https://github.com/hashicorp/nomad/pull/8435 introduced atomic eval insertion iwth job (de-)registration. This change removes a now obsolete guard which checked if the index was equal to the job.CreateIndex, which would empty the status. Now that the job regisration eval insetion is atomic with the registration this check is no longer necessary to set the job statuses correctly. * test to ensure only single job event for job register * periodic e2e * separate job update summary step * fix updatejobstability to use copy instead of modified reference of job * update envoygatewaybindaddresses copy to prevent job diff on null vs empty * set ConsulGatewayBindAddress to empty map instead of nil fix nil assertions for empty map rm unnecessary guard	2021-01-22 09:18:17 -05:00
Mahmood Ali	76ce6306a4	add helper for building ami	2021-01-15 10:49:13 -05:00
Mahmood Ali	e51651c34a	set sha	2021-01-15 10:49:13 -05:00
Mahmood Ali	82637715cf	change ami naming	2021-01-15 10:49:12 -05:00
Mahmood Ali	0af1509a77	move config files to terraform	2021-01-15 10:49:12 -05:00
Seth Hoenig	64a8b795f2	Merge pull request #9766 from hashicorp/f-bump-cni-plugins-version cni: bump CNI plugins version to v0.9.0	2021-01-11 09:59:43 -06:00
Tim Gross	f97505e384	e2e: remove deprecated terraform syntax Also bumps patch versions of some TF modules	2021-01-11 08:25:22 -05:00
Seth Hoenig	fc5f48d936	cni: bump CNI version to v0.9.0 https://github.com/containernetworking/plugins/releases/tag/v0.9.0 Also make the copy-paste install instructions work with arm64 for a better OOTB experience (AWS Graviton, Pi 4's).	2021-01-10 18:03:27 -06:00
Seth Hoenig	7da808b43a	e2e: add terraform lockfile Terraform v0.14 is producing a lockfile after running `terraform init`. The docs suggest we should include this file in the git repository: > You should include this file in your version control repository so > that you can discuss potential changes to your external dependencies > via code review, just as you would discuss potential changes to your > configuration itself. Sounds similar to go.sum https://www.terraform.io/docs/configuration/dependency-lock.html#lock-file-location	2021-01-05 08:55:37 -06:00
Tim Gross	26f4ee7fb1	e2e: dnsmasq configuration fixes * systemd units require absolute paths * ensure directory exists for dnsmasq	2021-01-04 15:40:57 -05:00
Tim Gross	c4e57fb813	e2e: document some design goals	2020-12-17 10:33:33 -05:00
Tim Gross	88fc79c35e	e2e: bump default version of dev cluster	2020-12-17 10:33:33 -05:00
Tim Gross	00bc6a7d13	e2e: move dnsmasq config into dnsmasq service unit (#9660 ) Our dnsmasq configuration needs host-specific data that we can't configure in the AMI build. But configuring this in userdata leads to a race between userdata execution, docker.service startup, and dnsmasq.service startup. So rather than letting dnsmasq come up with incorrect configuration and then modifying it after the fact, do the configuration in the service's prestart, and have it kick off a Docker restart when we're done.	2020-12-17 10:33:19 -05:00
Seth Hoenig	ad5918f754	e2e: upgrade terraform consul to 1.9.0	2020-12-03 13:01:14 -06:00
Tim Gross	d686a51d60	e2e: prevent Ubuntu startup race conditions (#9428 ) The cloud-init configuration runs on boot, which can result in a race condition between that and service startup. This has caused provisioning failures because Nomad expects the userdata to have configured a host volume directory. Diagnosing this was also compounded by a warning being fired by systemd for the Nomad unit file. * Update the location of the `StartLimitIntervalSec` field to it's post-systemd-230 location. * Ensure that the weekly AMI build is up-to-date to reduce the risk of unexpected system software changes. * Move the host volume to a directory we can set up at AMI build time rather than in userdata.	2020-11-23 12:29:08 -05:00
Drew Bailey	9a1fc720c8	enables audit log on full-cluster (#9315 )	2020-11-11 08:33:01 -05:00
Tim Gross	08ae13d3b9	e2e: Windows provisioning improvements (#9246 ) Small changes to the Windows 2016 Packer build for debuggability of provisioning: * improve verbosity of powershell error handling * remove unused "tools" installation * use ssh communicator for Packer to improve Packer build times and eliminate deprecated winrm remote access (unavailable from current macOS)	2020-11-09 13:29:40 -05:00
Drew Bailey	c181973265	append custom path to custom_config_files (#9289 ) * append custom path to custom_config_files * remove config_path variable	2020-11-06 11:16:13 -05:00
Tim Gross	dc8e20206d	E2E: switch packer build files to HCL2 (#9219 ) Build configuration files need comments, and JSON is also just the worst, isn't it? Upgrade our E2E packer configs to use the new HCL2 syntax.	2020-10-29 10:03:39 -04:00
Tim Gross	06c75460f3	e2e: provide precedence for version variables (#9216 ) The `nomad_sha`, `nomad_version`, and `nomad_local_binary` variables for the Nomad provisioning module assumed that only one would be set. By having the override each other with an explicit precedence, it makes it easier to avoid problems with Terraform's implicit variables behavior. Set the expected default values in the `terraform.full.tfvars` to avoid shadowing by any future changes to the `terraform.tfvars` file. Update the Makefile to put the `-var` and `-var-file` in the correct order.	2020-10-29 09:15:22 -04:00
Tim Gross	57f694ff2e	E2E: AMI software version bumps and cleanup (#9213 ) * remove unused vault installation from Windows AMI * match Windows and Linux Consul versions * bump AMI base Nomad to current stable	2020-10-29 08:27:50 -04:00
Tim Gross	a2710c7a31	e2e: set default version for dev cluster (#9208 )	2020-10-28 16:50:20 -04:00
Tim Gross	99c2a2df00	e2e: reduce risk of flaky Ubuntu AMI build (#9207 ) The base Ubuntu AMI modifies apt sources during cloud-init. But the Packer build can potentially start the setup script before that work is done, resulting in errors trying to install base system dependencies like `dnsmasq`. Delay the setup long enough to lose the race with cloud-init.	2020-10-28 15:13:44 -04:00
Tim Gross	7e4a35ad7e	e2e: use more specific names for OS/distros (#9204 ) We intend to expand the nightly E2E test to cover multiple distros and platforms. Change the naming structure for "Linux client" to the more precise "Ubuntu Bionic", and "Windows" to "Windows 2016" to make it easier to add new targets without additional refactoring.	2020-10-28 12:58:00 -04:00
Tim Gross	be3f54d296	e2e: make dev cluster the default Terraform vars file (#9202 ) Most of the time that a human is running the TF provisioning, they want the "dev cluster" which is going to deploy an OSS sha, with fewer targets and configuration alternatives. But the default `terraform.tfvars` is the nightly E2E run. Because the nightly run is automated, there's no reason we can't have it pick a non-default `terraform.full.tfvars` file and have the default be the dev cluster.	2020-10-28 10:01:42 -04:00
Tim Gross	9fa38bac98	e2e: path fixes for local_binary uploads (#9137 ) When uploading a local binary for provisioning, the location that we pass into the provisioning script needs to be where we uploaded it to, not the source on our laptop. Also, the null_resource for uploading needs to read in the private key, not its path.	2020-10-21 10:20:22 -04:00
Tim Gross	76f1f5e5df	e2e: use AMI filter for Ubuntu packer image (#9086 ) Instead of hard-coding the base AMI for our Packer image for Ubuntu, use the latest from Canonical so that we always have their current kernel patches.	2020-10-14 11:22:33 -04:00
Tim Gross	115edb53a0	e2e: add flag to opt-in to creating EBS/EFS volumes (#9082 ) For everyday developer use, we don't need volumes for testing CSI. Providing a flag to opt-in speeds up deploying dev clusters and slightly reduces infra costs. Skip CSI test if missing volume specs.	2020-10-14 10:29:33 -04:00
Yoan Blanc	891accb89a	use allow/deny instead of the colored alternatives (#9019 ) Signed-off-by: Yoan Blanc <yoan@dosimple.ch>	2020-10-12 08:47:05 -04:00
Tim Gross	727277793b	e2e: bootstrap vault and provision Nomad with vault tokens (#9010 ) Provisions vault with the policies described in the Nomad Vault integration guide, and drops a configuration file for Nomad vault server configuration with its token. The vault root token is exposed to the E2E runner so that tests can write additional policies to vault.	2020-10-05 09:28:37 -04:00
Tim Gross	b6292528fe	e2e: tfvars.dev file must override default tfvars file (#9005 ) The `-var-file` flag for loading variables into Terraform overlays the default variables file if present. This means that variables that are set in the default variables file will take precedence if the overlay file does not have them set. Set `nomad_acls` and `nomad_enteprise` to `false` in the dev cluster.	2020-10-02 08:02:37 -04:00
Tim Gross	566dae7b19	e2e: add flag to bootstrap Nomad ACLs (#8961 ) Adds a `nomad_acls` flag to our Terraform stack that bootstraps Nomad ACLs via a `local-exec` provider. There's no way to set the `NOMAD_TOKEN` in the Nomad TF provider if we're bootstrapping in the same Terraform stack, so instead of using `resource.nomad_acl_token`, we also bootstrap a wide-open anonymous policy. The resulting management token is exported as an environment var with `$(terraform output environment)` and tests that want stricter ACLs will be able to write them using that token. This should also provide a basis to do similar work with Consul ACLs in the future.	2020-09-28 09:22:36 -04:00
Tim Gross	147b16243d	e2e: use more recent instance type (#8954 ) Newer EC2 instances are both cheaper and have generally better performance. The dnsmasq configuration had a hard-coded interface name, so in order to accomodate instances with more recent networking that result in so-called predictable interface names, the dnsmasq configuration needs to be replaced at runtime with userdata to select the default interface.	2020-09-23 14:27:52 -04:00
Tim Gross	1fc525ec1e	e2e: add flags for provisioning Nomad Enterprise (#8929 )	2020-09-23 10:39:04 -04:00
Tim Gross	3da61545d5	make sure dev-cluster has the option to run windows config (#8928 )	2020-09-18 16:41:35 -04:00
Tim Gross	ea1f6408bf	e2e: remove unused framework provisioning code (#8908 )	2020-09-18 11:46:47 -04:00

1 2 3

128 commits