Go to file
Tim Gross f6b3d38eb8
CSI: move node unmount to server-driven RPCs (#7596)
If a volume-claiming alloc stops and the CSI Node plugin that serves
that alloc's volumes is missing, there's no way for the allocrunner
hook to send the `NodeUnpublish` and `NodeUnstage` RPCs.

This changeset addresses this issue with a redesign of the client-side
for CSI. Rather than unmounting in the alloc runner hook, the alloc
runner hook will simply exit. When the server gets the
`Node.UpdateAlloc` for the terminal allocation that had a volume claim,
it creates a volume claim GC job. This job will made client RPCs to a
new node plugin RPC endpoint, and only once that succeeds, move on to
making the client RPCs to the controller plugin. If the node plugin is
unavailable, the GC job will fail and be requeued.
2020-04-02 16:04:56 -04:00
.circleci CI: add xUnit reporting for UI tests (#7585) 2020-04-01 11:19:44 -05:00
.github Revert "Add the digital marketing team as the code owners for the website dir" 2020-02-19 12:25:42 -06:00
.netlify Remove most Netlify configuration (#6194) 2019-08-22 15:54:23 -05:00
acl more testing for ScalingPolicy, mainly around parsing and canonicalization for Min/Max 2020-03-24 19:43:50 +00:00
api added indices to the job scaling events, so we could properly do 2020-04-01 17:28:19 +00:00
client CSI: move node unmount to server-driven RPCs (#7596) 2020-04-02 16:04:56 -04:00
command Merge pull request #7572 from hashicorp/f-7422-scaling-events 2020-04-01 13:49:22 -05:00
contributing doc: add a reminder about field order for diff comparisons 2020-03-25 09:33:27 -06:00
demo ci: bump consul and vault 2020-03-15 11:01:55 +01:00
dev dev: Tweaks to cluster dev scripts 2020-02-03 11:50:43 -05:00
devices/gpu/nvidia Update devices/gpu/nvidia/README.md 2019-01-23 17:44:24 -08:00
dist Set OOMScoreAdjust within systemd dist example (#6679) 2019-11-12 08:30:54 -05:00
drivers Merge pull request #7554 from benbuzbee/benbuz/fix-seccomp-file 2020-03-31 11:54:17 -04:00
e2e e2e: have TF write-out HCL for CSI volume registration (#7599) 2020-04-02 12:16:43 -04:00
helper Merge pull request #7558 from hashicorp/b-ensure-correct-plugin-version-mapping 2020-04-01 12:34:24 +02:00
integrations spelling: registrations 2018-03-11 18:40:53 +00:00
internal/testing/apitests update rest of consul packages 2020-02-16 16:25:04 -06:00
jobspec hclfmt test fixtures (#7584) 2020-04-01 10:48:28 -04:00
lib circbufwritter: add defer to stop ticker in flush loop 2019-01-28 14:33:20 -05:00
nomad CSI: move node unmount to server-driven RPCs (#7596) 2020-04-02 16:04:56 -04:00
plugins set safe default for CSI plugin MaxVolumes (#7583) 2020-04-01 11:08:55 -04:00
scheduler csi: use node MaxVolumes during scheduling (#7565) 2020-03-31 17:16:47 -04:00
scripts dev: use go mod to install dev dependencies 2020-03-30 21:32:45 -04:00
terraform separate vars and outputs into their own files and update default link in nomad binary variable to 0.10.0 release (#6550) 2019-10-25 14:15:30 -04:00
testutil tests: wait until leadership loop finishes 2020-03-06 14:41:59 -05:00
ui UI: Fix confirmation position for right-aligned TSBs (#7590) 2020-04-02 11:58:29 -05:00
vendor Merge pull request #7560 from hashicorp/vendor-go-msgpack-v1.1.5 2020-03-31 10:09:05 -04:00
version release: prep for next dev cycle 2020-02-19 14:31:31 -05:00
website connect: set task lifecycle config for injected sidecar task 2020-04-01 21:19:41 -04:00
.gitattributes Remove invalid gitattributes 2018-02-14 14:47:43 -08:00
.gitignore gitignore: only ignore toplevel tags and bin 2019-12-03 13:36:54 -05:00
.golangci.yml chore: Switch from gometalinter to golangci-lint 2019-12-05 18:58:13 -06:00
CHANGELOG.md batch updates of changelog items (#7570) 2020-03-31 16:28:11 -07:00
GNUmakefile Use latest go-bindata and go-bindata-assetfs 2020-03-31 20:28:25 -04:00
LICENSE
README.md use golang 14.1 2020-03-23 13:02:14 -04:00
Vagrantfile Remove rkt as a built-in driver 2020-02-26 22:16:41 -05:00
appveyor.yml use golang 14.1 2020-03-23 13:02:14 -04:00
build_linux_arm.go
main.go fix comment typo 2019-09-18 09:11:08 -04:00
main_test.go

README.md

Nomad Build Status Discuss

Overview

Nomad is an easy-to-use, flexible, and performant workload orchestrator that deploys:

Nomad enables developers to use declarative infrastructure-as-code for deploying their applications (jobs). Nomad uses bin packing to efficiently schedule jobs and optimize for resource utilization. Nomad is supported on macOS, Windows, and Linux.

Nomad is widely adopted and used in production by PagerDuty, Target, Citadel, Trivago, SAP, Pandora, Roblox, eBay, Deluxe Entertainment, and more.

  • Deploy Containers and Legacy Applications: Nomads flexibility as an orchestrator enables an organization to run containers, legacy, and batch applications together on the same infrastructure. Nomad brings core orchestration benefits to legacy applications without needing to containerize via pluggable task drivers.

  • Simple & Reliable: Nomad runs as a single 75MB binary and is entirely self contained - combining resource management and scheduling into a single system. Nomad does not require any external services for storage or coordination. Nomad automatically handles application, node, and driver failures. Nomad is distributed and resilient, using leader election and state replication to provide high availability in the event of failures.

  • Device Plugins & GPU Support: Nomad offers built-in support for GPU workloads such as machine learning (ML) and artificial intelligence (AI). Nomad uses device plugins to automatically detect and utilize resources from hardware devices such as GPU, FPGAs, and TPUs.

  • Federation for Multi-Region, Multi-Cloud: Nomad was designed to support infrastructure at a global scale. Nomad supports federation out-of-the-box and can deploy jobs across multiple regions and clouds.

  • Proven Scalability: Nomad is optimistically concurrent, which increases throughput and reduces latency for workloads. Nomad has been proven to scale to clusters of 10K+ nodes in real-world production environments.

  • HashiCorp Ecosystem: Nomad integrates seamlessly with Terraform, Consul, Vault for provisioning, service discovery, and secrets management.

Getting Started

Get started with Nomad quickly in a sandbox environment on the public cloud or on your computer.

These methods are not meant for production.

Documentation & Guides

Documentation is available on the Nomad website here.

Resources

Who Uses Nomad

...and more!

Contributing to Nomad

If you wish to contribute to Nomad, you will need Go installed on your machine (version 1.14.1+ is required, and gcc-go is not supported).

See the contributing directory for more developer documentation.

Developing with Vagrant There is an included Vagrantfile that can help bootstrap the process. The created virtual machine is based off of Ubuntu 16, and installs several of the base libraries that can be used by Nomad.

To use this virtual machine, checkout Nomad and run vagrant up from the root of the repository:

$ git clone https://github.com/hashicorp/nomad.git
$ cd nomad
$ vagrant up

The virtual machine will launch, and a provisioning script will install the needed dependencies.

Developing locally For local dev first make sure Go is properly installed, including setting up a GOPATH. After setting up Go, clone this repository into $GOPATH/src/github.com/hashicorp/nomad. Then you can download the required build tools such as vet, cover, godep etc by bootstrapping your environment.

$ make bootstrap
...

Nomad creates many file handles for communicating with tasks, log handlers, etc. In some development environments, particularly macOS, the default number of file descriptors is too small to run Nomad's test suite. You should set ulimit -n 1024 or higher in your shell. This setting is scoped to your current shell and doesn't affect other running shells or future shells.

Afterwards type make test. This will run the tests. If this exits with exit status 0, then everything is working!

$ make test
...

To compile a development version of Nomad, run make dev. This will put the Nomad binary in the bin and $GOPATH/bin folders:

$ make dev

Optionally run Consul to enable service discovery and health checks:

$ sudo consul agent -dev

And finally start the nomad agent:

$ sudo bin/nomad agent -dev

If the Nomad UI is desired in the development version, run make dev-ui. This will build the UI from source and compile it into the dev binary.

$ make dev-ui
...
$ bin/nomad
...

To compile protobuf files, installing protoc is required: See
https://github.com/google/protobuf for more information.

Note: Building the Nomad UI from source requires Node, Yarn, and Ember CLI. These tools are already in the Vagrant VM. Read the UI README for more info.

To cross-compile Nomad, run make prerelease and make release. This will generate all the static assets, compile Nomad for multiple platforms and place the resulting binaries into the ./pkg directory:

$ make prerelease
$ make release
...
$ ls ./pkg
...