open-nomad/CHANGELOG.md

837 lines
42 KiB
Markdown

## 0.6.1 (Unreleased)
IMPROVEMENTS:
* core: `distinct_property` constraint can set the number of allocations that
are allowed to share a property value [GH-2942]
* driver/rkt: support read-only volume mounts [GH-2883]
BUG FIXES:
* core: Fix incorrect destructive update with `distinct_property` constraint
[GH-2939]
* cli: Fix autocmpleting global flags [GH-2928]
* cli: Fix panic when using 0.6.0 cli with an older cluster [GH-2929]
* driver/docker: Fix leaking plugin file used by syslog server [GH-2937]
## 0.6.0 (July 26, 2017)
__BACKWARDS INCOMPATIBILITIES:__
* cli: When given a prefix that does not resolve to a particular object,
commands now return exit code 1 rather than 0.
IMPROVEMENTS:
* core: Rolling updates based on allocation health [GH-2621, GH-2634, GH-2799]
* core: New deployment object to track job updates [GH-2621, GH-2634, GH-2799]
* core: Default advertise to private IP address if bind is 0.0.0.0 [GH-2399]
* core: Track multiple job versions and add a stopped state for jobs [GH-2566]
* core: Job updates can create canaries before beginning rolling update
[GH-2621, GH-2634, GH-2799]
* core: Back-pressure when evaluations are nacked and ensure scheduling
progress on evaluation failures [GH-2555]
* agent/config: Late binding to IP addresses using go-sockaddr/template syntax
[GH-2399]
* api: Add `verify_https_client` to require certificates from HTTP clients
[GH-2587]
* api/job: Ability to revert job to older versions [GH-2575]
* cli: Autocomplete for CLI commands [GH-2848]
* client: Use a random host UUID by default [GH-2735]
* client: Add `NOMAD_GROUP_NAME` environment variable [GH-2877]
* client: Environment variables for client DC and Region [GH-2507]
* client: Hash host ID so its stable and well distributed [GH-2541]
* client: GC dead allocs if total allocs > `gc_max_allocs` tunable [GH-2636]
* client: Persist state using bolt-db and more efficient write patterns
[GH-2610]
* client: Fingerprint all routable addresses on an interface including IPv6
addresses [GH-2536]
* client/artifact: Support .xz archives [GH-2836]
* client/artifact: Allow specifying a go-getter mode [GH-2781]
* client/artifact: Support non-Amazon S3-compatible sources [GH-2781]
* client/template: Support reading env vars from templates [GH-2654]
* config: Support Unix socket addresses for Consul [GH-2622]
* discovery: Advertise driver-specified IP address and port [GH-2709]
* discovery: Support `tls_skip_verify` for Consul HTTPS checks [GH-2467]
* driver/docker: Allow specifying extra hosts [GH-2547]
* driver/docker: Allow setting seccomp profiles [GH-2658]
* driver/docker: Support Docker credential helpers [GH-2651]
* driver/docker: Auth failures can optionally be ignored [GH-2786]
* driver/docker: Add `driver.docker.bridge_ip` node attribute [GH-2797]
* driver/docker: Allow setting container IP with user defined networks
[GH-2535]
* driver/rkt: Support `no_overlay` [GH-2702]
* driver/rkt: Support `insecure_options` list [GH-2695]
* server: Allow tuning of node heartbeat TTLs [GH-2859]
* server/networking: Shrink dynamic port range to not overlap with majority of
operating system's ephemeral port ranges to avoid port conflicts [GH-2856]
BUG FIXES:
* core: Protect against nil job in new allocation, avoiding panic [GH-2592]
* core: System jobs should be running until explicitly stopped [GH-2750]
* core: Prevent invalid job updates (eg service -> batch) [GH-2746]
* client: Lookup `ip` utility on `$PATH` [GH-2729]
* client: Add sticky bit to temp directory [GH-2519]
* client: Shutdown task group leader before other tasks [GH-2753]
* client: Include symlinks in snapshots when migrating disks [GH-2687]
* client: Regression for allocation directory unix perms introduced in v0.5.6
fixed [GH-2675]
* client: Client syncs allocation state with server before waiting for
allocation destroy fixing a corner case in which an allocation may be blocked
till destroy [GH-2563]
* client: Improved state file handling and reduced write volume [GH-2878]
* client/artifact: Honor netrc [GH-2524]
* client/artifact: Handle tars where file in directory is listed before
directory [GH-2524]
* client/config: Use `cpu_total_compute` whenever it is set [GH-2745]
* client/config: Respect `vault.tls_server_name` setting in consul-template
[GH-2793]
* driver/exec: Properly set file/dir ownership in chroots [GH-2552]
* driver/docker: Fix panic in Docker driver on Windows [GH-2614]
* driver/rkt: Fix env var interpolation [GH-2777]
* jobspec/validation: Prevent static port conflicts [GH-2807]
* server: Reject non-TLS clients when TLS enabled [GH-2525]
* server: Fix a panic in plan evaluation with partial failures and all_at_once
set [GH-2544]
* server/periodic: Restoring periodic jobs takes launch time zone into
consideration [GH-2808]
* server/vault: Fix Vault Client panic when given nonexistant role [GH-2648]
* telemetry: Fix merging of use node name [GH-2762]
## 0.5.6 (March 31, 2017)
IMPROVEMENTS:
* api: Improve log API error when task doesn't exist or hasn't started
[GH-2512]
* client: Improve error message when artifact downloading fails [GH-2289]
* client: Track task start/finish time [GH-2512]
* client/template: Access Node meta and attributes in template [GH-2488]
BUG FIXES:
* core: Fix periodic job state switching to dead incorrectly [GH-2486]
* core: Fix dispatch of periodic job launching allocations immediately
[GH-2489]
* api: Fix TLS in logs and fs commands/APIs [GH-2290]
* cli/plan: Fix diff alignment and remove no change DC output [GH-2465]
* client: Fix panic when restarting non-running tasks [GH-2480]
* client: Fix env vars when multiple tasks and ports present [GH-2491]
* client: Fix `user` attribute disregarding membership of non-main group
[GH-2461]
* client/vault: Stop Vault token renewal on task exit [GH-2495]
* driver/docker: Proper reference counting through task restarts [GH-2484]
## 0.5.5 (March 14, 2017)
__BACKWARDS INCOMPATIBILITIES:__
* api: The api package definition of a Job has changed from exposing
primitives to pointers to primitives to allow defaulting of unset fields.
* driver/docker: The `load` configuration took an array of paths to images
prior to this release. A single image is expected by the driver so this
behavior has been changed to take a single path as a string. Jobs using the
`load` command should update the syntax to a single string. [GH-2361]
IMPROVEMENTS:
* core: Handle Serf Reap event [GH-2310]
* core: Update Serf and Memberlist for more reliable gossip [GH-2255]
* api: API defaults missing values [GH-2300]
* api: Validate the restart policy interval [GH-2311]
* api: New task event for task environment setup [GH-2302]
* api/cli: Add nomad operator command and API for interacting with Raft
configuration [GH-2305]
* cli: node-status displays enabled drivers on the node [GH-2349]
* client: Apply GC related configurations properly [GH-2273]
* client: Don't force uppercase meta keys in env vars [GH-2338]
* client: Limit parallelism during garbage collection [GH-2427]
* client: Don't exec `uname -r` for node attribute kernel.version [GH-2380]
* client: Artifact support for git and hg as well as netrc support [GH-2386]
* client: Add metrics to show number of allocations on in each state [GH-2425]
* client: Add `NOMAD_{IP,PORT}_<task>_<label>` environment variables [GH-2426]
* client: Allow specification of `cpu_total_compute` to override fingerprinter
[GH-2447]
* client: Reproducible Node ID on OSes that provide system-level UUID
[GH-2277]
* driver/docker: Add support for volume drivers [GH-2351]
* driver/docker: Docker image coordinator and caching [GH-2361]
* jobspec: Add leader task to allow graceful shutdown of other tasks within
the task group [GH-2308]
* periodic: Allow specification of timezones in Periodic Jobs [GH-2321]
* scheduler: New `distinct_property` constraint [GH-2418]
* server: Allow specification of eval/job gc threshold [GH-2370]
* server/vault: Vault Client on Server handles SIGHUP to reload configs
[GH-2270]
* telemetry: Clients report allocated/unallocated resources [GH-2327]
* template: Allow specification of template delimiters [GH-2315]
* template: Permissions can be set on template destination file [GH-2262]
* vault: Server side Vault telemetry [GH-2318]
* vault: Disallow root policy from being specified [GH-2309]
BUG FIXES:
* core: Handle periodic paramaterized jobs [GH-2385]
* core: Improve garbage collection of stopped batch jobs [GH-2432]
* api: Fix escaping of HTML characters [GH-2322]
* cli: Display disk resources in alloc-status [GH-2404]
* client: Drivers log during fingerprinting [GH-2337]
* client: Fix race condition with deriving vault tokens [GH-2275]
* client: Fix remounting alloc dirs after reboots [GH-2391] [GH-2394]
* client: Replace `-` with `_` in environment variable names [GH-2406]
* client: Fix panic and deadlock during client restore state when prestart
fails [GH-2376]
* config: Fix Consul Config Merging/Copying [GH-2278]
* config: Fix Client reserved resource merging panic [GH-2281]
* server: Fix panic when forwarding Vault derivation requests from non-leader
servers [GH-2267]
## 0.5.4 (January 31, 2017)
IMPROVEMENTS:
* client: Made the GC related tunables configurable via client configuration
[GH-2261]
BUG FIXES:
* client: Fix panic when upgrading to 0.5.3 [GH-2256]
## 0.5.3 (January 30, 2017)
IMPROVEMENTS:
* core: Introduce parameterized jobs and dispatch command/API [GH-2128]
* core: Cancel blocked evals upon successful one for job [GH-2155]
* api: Added APIs for requesting GC of allocations [GH-2192]
* api: Job summary endpoint includes summary status for child jobs [GH-2128]
* api/client: Plain text log streaming suitable for viewing logs in a browser
[GH-2235]
* cli: Defaulting to showing allocations which belong to currently registered
job [GH-2032]
* client: Garbage collect Allocation Runners to free up disk resouces
[GH-2081]
* client: Don't retrieve Driver Stats if unsupported [GH-2173]
* client: Filter log lines in the executor based on client's log level
[GH-2172]
* client: Added environment variables to discover addresses of sibling tasks
in an allocation [GH-2223]
* discovery: Register service with duplicate names on different ports [GH-2208]
* driver/docker: Add support for network aliases [GH-1980]
* driver/docker: Add `force_pull` option to force downloading an image [GH-2147]
* driver/docker: Retry when image is not found while creating a container
[GH-2222]
* driver/java: Support setting class_path and class name. [GH-2199]
* telemetry: Prefix gauge values with node name instead of hostname [GH-2098]
* template: The template block supports keyOrDefault [GH-2209]
* template: The template block can now interpolate Nomad environment variables
[GH-2209]
* vault: Improve validation of the Vault token given to Nomad servers
[GH-2226]
* vault: Support setting the Vault role to derive tokens from with
`create_from_role` setting [GH-2226]
BUG FIXES:
* client: Fixed namespacing for the cpu arch attribute [GH-2161]
* client: Fix issue where allocations weren't pulled for several minutes. This
manifested as slow starts, delayed kills, etc [GH-2177]
* client: Fix a panic that would occur with a racy alloc migration
cancellation [GH-2231]
* config: Fix merging of Consul options which caused auto_adverise to be
ignored [GH-2159]
* driver: Fix image based drivers (eg docker) having host env vars set [GH-2211]
* driver/docker: Fix Docker auth/logging interprelation [GH-2063, GH-2130]
* driver/docker: Fix parsing of Docker Auth Configurations. New parsing is
in-line with Docker itself. Also log debug message if auth lookup failed
[GH-2190]
* template: Fix splay being used as a wait and instead randomize the delay
from 0 seconds to splay duration [GH-2227]
## 0.5.2 (December 23, 2016)
BUG FIXES:
* client: Fixed a race condition and remove panic when handling duplicate
allocations [GH-2096]
* client: Cancel wait for remote allocation if migration is no longer required
[GH-2097]
* client: Failure to stat a single mountpoint does not cause all of host
resource usage collection to fail [GH-2090]
## 0.5.1 (December 12, 2016)
IMPROVEMENTS:
* driver/rkt: Support rkt's `--dns=host` and `--dns=none` options [GH-2028]
BUG FIXES:
* agent/config: Fix use of IPv6 addresses [GH-2036]
* api: Fix file descriptor leak and high CPU usage when using the logs
endpoint [GH-2079]
* cli: Improve parsing error when a job without a name is specified [GH-2030]
* client: Fixed permissions of migrated allocation directory [GH-2061]
* client: Ensuring allocations are not blocked more than once [GH-2040]
* client: Fix race on StreamFramer Destroy which would cause a panic [GH-2007]
* client: Not migrating allocation directories on the same client if sticky is
turned off [GH-2017]
* client/vault: Fix issue in which deriving a Vault token would fail with
allocation does not exist due to stale queries [GH-2050]
* driver/docker: Make container exist errors non-retriable by task runner
[GH-2033]
* driver/docker: Fixed an issue related to purging containers with same name
as Nomad is trying to start [GH-2037]
* driver/rkt: Fix validation of rkt volumes [GH-2027]
## 0.5.0 (November 16, 2016)
__BACKWARDS INCOMPATIBILITIES:__
* jobspec: Extracted the disk resources from the task to the task group. The
new block is name `ephemeral_disk`. Nomad will automatically convert
existing jobs but newly submitted jobs should refactor the disk resource
[GH-1710, GH-1679]
* agent/config: `network_speed` is now an override and not a default value. If
the network link speed is not detected a default value is applied.
IMPROVEMENTS:
* core: Support for gossip encryption [GH-1791]
* core: Vault integration to handle secure introduction of tasks [GH-1583,
GH-1713]
* core: New `set_contains` constraint to determine if a set contains all
specified values [GH-1839]
* core: Scheduler version enforcement disallows different scheduler version
from making decisions simultaneously [GH-1872]
* core: Introduce node SecretID which can be used to minimize the available
surface area of RPCs to malicious Nomad Clients [GH-1597]
* core: Add `sticky` volumes which inform the scheduler to prefer placing
updated allocations on the same node and to reuse the `local/` and
`alloc/data` directory from previous allocation allowing semi-persistent
data and allow those folders to be synced from a remote node [GH-1654,
GH-1741]
* agent: Add DataDog telemetry sync [GH-1816]
* agent: Allow Consul health checks to use bind address rather than advertise
[GH-1866]
* agent/config: Advertise addresses do not need to specify a port [GH-1902]
* agent/config: Bind address defaults to 0.0.0.0 and Advertise defaults to
hostname [GH-1955]
* api: Support TLS for encrypting Raft, RPC and HTTP APIs [GH-1853]
* api: Implement blocking queries for querying a job's evaluations [GH-1892]
* cli: `nomad alloc-status` shows allocation creation time [GH-1623]
* cli: `nomad node-status` shows node metadata in verbose mode [GH-1841]
* client: Failed RPCs are retried on all servers [GH-1735]
* client: Fingerprint and driver blacklist support [GH-1949]
* client: Introduce a `secrets/` directory to tasks where sensitive data can
be written [GH-1681]
* client/jobspec: Add support for templates that can render static files,
dynamic content from Consul and secrets from Vault [GH-1783]
* driver: Export `NOMAD_JOB_NAME` environment variable [GH-1804]
* driver/docker: Docker For Mac support [GH-1806]
* driver/docker: Support Docker volumes [GH-1767]
* driver/docker: Allow Docker logging to be configured [GH-1767]
* driver/docker: Add `userns_mode` (`--userns`) support [GH-1940]
* driver/lxc: Support for LXC containers [GH-1699]
* driver/rkt: Support network configurations [GH-1862]
* driver/rkt: Support rkt volumes (rkt >= 1.0.0 required) [GH-1812]
* server/rpc: Added an RPC endpoint for retreiving server members [GH-1947]
BUG FIXES:
* core: Fix case where dead nodes were not properly handled by System
scheduler [GH-1715]
* agent: Handle the SIGPIPE signal preventing panics on journalctl restarts
[GH-1802]
* api: Disallow filesystem APIs to read paths that escape the allocation
directory [GH-1786]
* cli: `nomad run` failed to run on Windows [GH-1690]
* cli: `alloc-status` and `node-status` work without access to task stats
[GH-1660]
* cli: `alloc-status` does not query for allocation statistics if node is down
[GH-1844]
* client: Prevent race when persisting state file [GH-1682]
* client: Retry recoverable errors when starting a driver [GH-1891]
* client: Do not validate the command does not contain spaces [GH-1974]
* client: Fix old services not getting removed from consul on update [GH-1668]
* client: Preserve permissions of nested directories while chrooting [GH-1960]
* client: Folder permissions are dropped even when not running as root [GH-1888]
* client: Artifact download failures will be retried before failing tasks
[GH-1558]
* client: Fix a memory leak in the executor that caused failed allocations
[GH-1762]
* client: Fix a crash related to stats publishing when driver hasn't started
yet [GH-1723]
* client: Chroot environment is only created once, avoid potential filesystem
errors [GH-1753]
* client: Failures to download an artifact are retried according to restart
policy before failing the allocation [GH-1653]
* client/executor: Prevent race when updating a job configuration with the
logger [GH-1886]
* client/fingerprint: Fix inconsistent CPU MHz fingerprinting [GH-1366]
* env/aws: Fix an issue with reserved ports causing placement failures
[GH-1617]
* discovery: Interpolate all service and check fields [GH-1966]
* discovery: Fix old services not getting removed from Consul on update
[GH-1668]
* discovery: Fix HTTP timeout with Server HTTP health check when there is no
leader [GH-1656]
* discovery: Fix client flapping when server is in a different datacenter as
the client [GH-1641]
* discovery/jobspec: Validate service name after interpolation [GH-1852]
* driver/docker: Fix `local/` directory mount into container [GH-1830]
* driver/docker: Interpolate all string configuration variables [GH-1965]
* jobspec: Tasks without a resource block no longer fail to validate [GH-1864]
* jobspec: Update HCL to fix panic in JSON parsing [GH-1754]
## 0.4.1 (August 18, 2016)
__BACKWARDS INCOMPATIBILITIES:__
* telemetry: Operators will have to explicitly opt-in for Nomad client to
publish allocation and node metrics
IMPROVEMENTS:
* core: Allow count 0 on system jobs [GH-1421]
* core: Summarize the current status of registered jobs. [GH-1383, GH-1517]
* core: Gracefully handle short lived outages by holding RPC calls [GH-1403]
* core: Introduce a lost state for allocations that were on Nodes that died
[GH-1516]
* api: client Logs endpoint for streaming task logs [GH-1444]
* api/cli: Support for tailing/streaming files [GH-1404, GH-1420]
* api/server: Support for querying job summaries [GH-1455]
* cli: `nomad logs` command for streaming task logs [GH-1444]
* cli: `nomad status` shows the create time of allocations [GH-1540]
* cli: `nomad plan` exit code indicates if changes will occur [GH-1502]
* cli: status commands support JSON output and go template formating [GH-1503]
* cli: Validate and plan command supports reading from stdin [GH-1460,
GH-1458]
* cli: Allow basic authentication through address and environment variable
[GH-1610]
* cli: `nomad node-status` shows volume name for non-physical volumes instead
of showing 0B used [GH-1538]
* cli: Support retrieving job files using go-getter in the `run`, `plan` and
`validate` command [GH-1511]
* client: Add killing event to task state [GH-1457]
* client: Fingerprint network speed on Windows [GH-1443]
* discovery: Support for initial check status [GH-1599]
* discovery: Support for query params in health check urls [GH-1562]
* driver/docker: Allow working directory to be configured [GH-1513]
* driver/docker: Remove docker volumes when removing container [GH-1519]
* driver/docker: Set windows containers network mode to nat by default
[GH-1521]
* driver/exec: Allow chroot environment to be configurable [GH-1518]
* driver/qemu: Allows users to pass extra args to the qemu driver [GH-1596]
* telemetry: Circonus integration for telemetry metrics [GH-1459]
* telemetry: Allow operators to opt-in for publishing metrics [GH-1501]
BUG FIXES:
* agent: Reload agent configuration on SIGHUP [GH-1566]
* core: Sanitize empty slices/maps in jobs to avoid incorrect create/destroy
updates [GH-1434]
* core: Fix race in which a Node registers and doesn't receive system jobs
[GH-1456]
* core: Fix issue in which Nodes with large amount of reserved ports would
casue dynamic port allocations to fail [GH-1526]
* core: Fix a condition in which old batch allocations could get updated even
after terminal. In a rare case this could cause a server panic [GH-1471]
* core: Do not update the Job attached to Allocations that have been marked
terminal [GH-1508]
* agent: Fix advertise address when using IPv6 [GH-1465]
* cli: Fix node-status when using IPv6 advertise address [GH-1465]
* client: Merging telemetry configuration properly [GH-1670]
* client: Task start errors adhere to restart policy mode [GH-1405]
* client: Reregister with servers if node is unregistered [GH-1593]
* client: Killing an allocation doesn't cause allocation stats to block
[GH-1454]
* driver/docker: Disable swap on docker driver [GH-1480]
* driver/docker: Fix improper gating on priviledged mode [GH-1506]
* driver/docker: Default network type is "nat" on Windows [GH-1521]
* driver/docker: Cleanup created volume when destroying container [GH-1519]
* driver/rkt: Set host environment variables [GH-1581]
* driver/rkt: Validate the command and trust_prefix configs [GH-1493]
* plan: Plan on system jobs discounts nodes that do not meet required
constraints [GH-1568]
## 0.4.0 (June 28, 2016)
__BACKWARDS INCOMPATIBILITIES:__
* api: Tasks are no longer allowed to have slashes in their name [GH-1210]
* cli: Remove the eval-monitor command. Users should switch to `nomad
eval-status -monitor`.
* config: Consul configuration has been moved from client options map to
consul block under client configuration
* driver/docker: Enabled SSL by default for pulling images from docker
registries. [GH-1336]
IMPROVEMENTS:
* core: Scheduler reuses blocked evaluations to avoid unbounded creation of
evaluations under high contention [GH-1199]
* core: Scheduler stores placement failures in evaluations, no longer
generating failed allocations for debug information [GH-1188]
* api: Faster JSON response encoding [GH-1182]
* api: Gzip compress HTTP API requests [GH-1203]
* api: Plan api introduced for the Job endpoint [GH-1168]
* api: Job endpoint can enforce Job Modify Index to ensure job is being
modified from a known state [GH-1243]
* api/client: Add resource usage APIs for retrieving tasks/allocations/host
resource usage [GH-1189]
* cli: Faster when displaying large amounts ouptuts [GH-1362]
* cli: Deprecate `eval-monitor` and introduce `eval-status` [GH-1206]
* cli: Unify the `fs` family of commands to be a single command [GH-1150]
* cli: Introduce `nomad plan` to dry-run a job through the scheduler and
determine its effects [GH-1181]
* cli: node-status command displays host resource usage and allocation
resources [GH-1261]
* cli: Region flag and environment variable introduced to set region
forwarding. Automatic region forwarding for run and plan [GH-1237]
* client: If Consul is available, automatically bootstrap Nomad Client
using the `_nomad` service in Consul. Nomad Servers now register
themselves with Consul to make this possible. [GH-1201]
* drivers: Qemu and Java can be run without an artifact being download. Useful
if the artifact exists inside a chrooted directory [GH-1262]
* driver/docker: Added a client options to set SELinux labels for container
bind mounts. [GH-788]
* driver/docker: Enabled SSL by default for pulling images from docker
registries. [GH-1336]
* server: If Consul is available, automatically bootstrap Nomad Servers
using the `_nomad` service in Consul. [GH-1276]
BUG FIXES:
* core: Improve garbage collection of allocations and nodes [GH-1256]
* core: Fix a potential deadlock if establishing leadership fails and is
retried [GH-1231]
* core: Do not restart successful batch jobs when the node is removed/drained
[GH-1205]
* core: Fix an issue in which the scheduler could be invoked with insufficient
state [GH-1339]
* core: Updated User, Meta or Resources in a task cause create/destroy updates
[GH-1128, GH-1153]
* core: Fix blocked evaluations being run without properly accounting for
priority [GH-1183]
* api: Tasks are no longer allowed to have slashes in their name [GH-1210]
* client: Delete tmp files used to communicate with execcutor [GH-1241]
* client: Prevent the client from restoring with incorrect task state [GH-1294]
* discovery: Ensure service and check names are unique [GH-1143, GH-1144]
* driver/docker: Ensure docker client doesn't time out after a minute.
[GH-1184]
* driver/java: Fix issue in which Java on darwin attempted to chroot [GH-1262]
* driver/docker: Fix issue in which logs could be spliced [GH-1322]
## 0.3.2 (April 22, 2016)
IMPROVEMENTS:
* core: Garbage collection partitioned to avoid system delays [GH-1012]
* core: Allow count zero task groups to enable blue/green deploys [GH-931]
* core: Validate driver configurations when submitting jobs [GH-1062, GH-1089]
* core: Job Deregister forces an evaluation for the job even if it doesn't
exist [GH-981]
* core: Rename successfully finished allocations to "Complete" rather than
"Dead" for clarity [GH-975]
* cli: `alloc-status` explains restart decisions [GH-984]
* cli: `node-drain -self` drains the local node [GH-1068]
* cli: `node-status -self` queries the local node [GH-1004]
* cli: Destructive commands now require confirmation [GH-983]
* cli: `alloc-status` display is less verbose by default [GH-946]
* cli: `server-members` displays the current leader in each region [GH-935]
* cli: `run` has an `-output` flag to emit a JSON version of the job [GH-990]
* cli: New `inspect` command to display a submitted job's specification
[GH-952]
* cli: `node-status` display is less verbose by default and shows a node's
total resources [GH-946]
* client: `artifact` source can be interpreted [GH-1070]
* client: Add IP and Port environment variables [GH-1099]
* client: Nomad fingerprinter to detect client's version [GH-965]
* client: Tasks can interpret Meta set in the task group and job [GH-985]
* client: All tasks in a task group are killed when a task fails [GH-962]
* client: Pass environment variables from host to exec based tasks [GH-970]
* client: Allow task's to be run as particular user [GH-950, GH-978]
* client: `artifact` block now supports downloading paths relative to the
task's directory [GH-944]
* docker: Timeout communications with Docker Daemon to avoid deadlocks with
misbehaving Docker Daemon [GH-1117]
* discovery: Support script based health checks [GH-986]
* discovery: Allowing registration of services which don't expose ports
[GH-1092]
* driver/docker: Support for `tty` and `interactive` options [GH-1059]
* jobspec: Improved validation of services referencing port labels [GH-1097]
* periodic: Periodic jobs are always evaluated in UTC timezone [GH-1074]
BUG FIXES:
* core: Prevent garbage collection of running batch jobs [GH-989]
* core: Trigger System scheduler when Node drain is disabled [GH-1106]
* core: Fix issue where in-place updated allocation double counted resources
[GH-957]
* core: Fix drained, batched allocations from being migrated indefinitely
[GH-1086]
* client: Garbage collect Docker containers on exit [GH-1071]
* client: Fix common exec failures on CentOS and Amazon Linux [GH-1009]
* client: Fix S3 artifact downloading with IAM credentials [GH-1113]
* client: Fix handling of environment variables containing multiple equal
signs [GH-1115]
## 0.3.1 (March 16, 2016)
__BACKWARDS INCOMPATIBILITIES:__
* Service names that dont conform to RFC-1123 and RFC-2782 will fail
validation. To fix, change service name to conform to the RFCs before
running the job [GH-915]
* Jobs that downloaded artifacts will have to be updated to the new syntax and
be resubmitted. The new syntax consolidates artifacts to the `task` rather
than being duplicated inside each driver config [GH-921]
IMPROVEMENTS:
* cli: Validate job file schemas [GH-900]
* client: Add environment variables for task name, allocation ID/Name/Index
[GH-869, GH-896]
* client: Starting task is retried under the restart policy if the error is
recoverable [GH-859]
* client: Allow tasks to download artifacts, which can be archives, prior to
starting [GH-921]
* config: Validate Nomad configuration files [GH-910]
* config: Client config allows reserving resources [GH-910]
* driver/docker: Support for ECR [GH-858]
* driver/docker: Periodic Fingerprinting [GH-893]
* driver/docker: Preventing port reservation for log collection on Unix platforms [GH-897]
* driver/rkt: Pass DNS information to rkt driver [GH-892]
* jobspec: Require RFC-1123 and RFC-2782 valid service names [GH-915]
BUG FIXES:
* core: No longer cancel evaluations that are delayed in the plan queue
[GH-884]
* api: Guard client/fs/ APIs from being accessed on a non-client node [GH-890]
* client: Allow dashes in variable names during interprelation [GH-857]
* client: Updating kill timeout adheres to operator specified maximum value [GH-878]
* client: Fix a case in which clients would pull but not run allocations
[GH-906]
* consul: Remove concurrent map access [GH-874]
* driver/exec: Stopping tasks with more than one pid in a cgroup [GH-855]
* client/executor/linux: Add /run/resolvconf/ to chroot so DNS works [GH-905]
## 0.3.0 (February 25, 2016)
__BACKWARDS INCOMPATIBILITIES:__
* Stdout and Stderr log files of tasks have moved from task/local to
alloc/logs [GH-851]
* Any users of the runtime environment variable `$NOMAD_PORT_` will need to
update to the new `${NOMAD_ADDR_}` varriable [GH-704]
* Service names that include periods will fail validation. To fix, remove any
periods from the service name before running the job [GH-770]
* Task resources are now validated and enforce minimum resources. If a job
specifies resources below the minimum they will need to be updated [GH-739]
* Node ID is no longer specifiable. For users who have set a custom Node
ID, the node should be drained before Nomad is updated and the data_dir
should be deleted before starting for the first time [GH-675]
* Users of custom restart policies should update to the new syntax which adds
a `mode` field. The `mode` can be either `fail` or `delay`. The default for
`batch` and `service` jobs is `fail` and `delay` respectively [GH-594]
* All jobs that interpret variables in constraints or driver configurations
will need to be updated to the new syntax which wraps the interpreted
variable in curly braces. (`$node.class` becomes `${node.class}`) [GH-760]
IMPROVEMENTS:
* core: Populate job status [GH-663]
* core: Cgroup fingerprinter [GH-712]
* core: Node class constraint [GH-618]
* core: User specifiable kill timeout [GH-624]
* core: Job queueing via blocked evaluations [GH-726]
* core: Only reschedule failed batch allocations [GH-746]
* core: Add available nodes by DC to AllocMetrics [GH-619]
* core: Improve scheduler retry logic under contention [GH-787]
* core: Computed node class and stack optimization [GH-691, GH-708]
* core: Improved restart policy with more user configuration [GH-594]
* core: Periodic specification for jobs [GH-540, GH-657, GH-659, GH-668]
* core: Batch jobs are garbage collected from the Nomad Servers [GH-586]
* core: Free half the CPUs on leader node for use in plan queue and evaluation
broker [GH-812]
* core: Seed random number generator used to randomize node traversal order
during scheduling [GH-808]
* core: Performance improvements [GH-823, GH-825, GH-827, GH-830, GH-832,
GH-833, GH-834, GH-839]
* core/api: System garbage collection endpoint [GH-828]
* core/api: Allow users to set arbitrary headers via agent config [GH-699]
* core/cli: Prefix based lookups of allocs/nodes/evals/jobs [GH-575]
* core/cli: Print short identifiers and UX cleanup [GH-675, GH-693, GH-692]
* core/client: Client pulls minimum set of required allocations [GH-731]
* cli: Output of agent-info is sorted [GH-617]
* cli: Eval monitor detects zero wait condition [GH-776]
* cli: Ability to navigate allocation directories [GH-709, GH-798]
* client: Batch allocation updates to the server [GH-835]
* client: Log rotation for all drivers [GH-685, GH-763, GH-819]
* client: Only download artifacts from http, https, and S3 [GH-841]
* client: Create a tmp/ directory inside each task directory [GH-757]
* client: Store when an allocation was received by the client [GH-821]
* client: Heartbeating and saving state resilient under high load [GH-811]
* client: Handle updates to tasks Restart Policy and KillTimeout [GH-751]
* client: Killing a driver handle is retried with an exponential backoff
[GH-809]
* client: Send Node to server when periodic fingerprinters change Node
attributes/metadata [GH-749]
* client/api: File-system access to allocation directories [GH-669]
* drivers: Validate the "command" field contains a single value [GH-842]
* drivers: Interpret Nomad variables in environment variables/args [GH-653]
* driver/rkt: Add support for CPU/Memory isolation [GH-610]
* driver/rkt: Add support for mounting alloc/task directory [GH-645]
* driver/docker: Support for .dockercfg based auth for private registries
[GH-773]
BUG FIXES:
* core: Node drain could only be partially applied [GH-750]
* core: Fix panic when eval Ack occurs at delivery limit [GH-790]
* cli: Handle parsing of un-named ports [GH-604]
* cli: Enforce absolute paths for data directories [GH-622]
* client: Cleanup of the allocation directory [GH-755]
* client: Improved stability under high contention [GH-789]
* client: Handle non-200 codes when parsing AWS metadata [GH-614]
* client: Unmounted of shared alloc dir when client is rebooted [GH-755]
* client/consul: Service name changes handled properly [GH-766]
* driver/rkt: handle broader format of rkt version outputs [GH-745]
* driver/qemu: failed to load image and kvm accelerator fixes [GH-656]
## 0.2.3 (December 17, 2015)
BUG FIXES:
* core: Task States not being properly updated [GH-600]
* client: Fixes for user lookup to support CoreOS [GH-591]
* discovery: Using a random prefix for nomad managed services [GH-579]
* discovery: De-Registering Tasks while Nomad sleeps before failed tasks are
restarted.
* discovery: Fixes for service registration when multiple allocations are bin
packed on a node [GH-583]
* configuration: Sort configuration files [GH-588]
* cli: RetryInterval was not being applied properly [GH-601]
## 0.2.2 (December 11, 2015)
IMPROVEMENTS:
* core: Enable `raw_exec` driver in dev mode [GH-558]
* cli: Server join/retry-join command line and config options [GH-527]
* cli: Nomad reports which config files are loaded at start time, or if none
are loaded [GH-536], [GH-553]
BUG FIXES:
* core: Send syslog to `LOCAL0` by default as previously documented [GH-547]
* client: remove all calls to default logger [GH-570]
* consul: Nomad is less noisy when Consul is not running [GH-567]
* consul: Nomad only deregisters services that it created [GH-568]
* driver/exec: Shutdown a task now sends the interrupt signal first to the
process before forcefully killing it. [GH-543]
* driver/docker: Docker driver no longer leaks unix domain socket connections
[GH-556]
* fingerprint/network: Now correctly detects interfaces on Windows [GH-382]
## 0.2.1 (November 28, 2015)
IMPROVEMENTS:
* core: Can specify a whitelist for activating drivers [GH-467]
* core: Can specify a whitelist for activating fingerprinters [GH-488]
* core/api: Can list all known regions in the cluster [GH-495]
* client/spawn: spawn package tests made portable (work on Windows) [GH-442]
* client/executor: executor package tests made portable (work on Windows) [GH-497]
* client/driver: driver package tests made portable (work on windows) [GH-502]
* client/discovery: Added more consul client api configuration options [GH-503]
* driver/docker: Added TLS client options to the config file [GH-480]
* jobspec: More flexibility in naming Services [GH-509]
BUG FIXES:
* core: Shared reference to DynamicPorts caused port conflicts when scheduling
count > 1 [GH-494]
* client/restart policy: Not restarting Batch Jobs if the exit code is 0 [GH-491]
* client/service discovery: Make Service IDs unique [GH-479]
* client/service: Fixes update to check definitions and services which are already registered [GH-498]
* driver/docker: Expose the container port instead of the host port [GH-466]
* driver/docker: Support `port_map` for static ports [GH-476]
* driver/docker: Pass 0.2.0-style port environment variables to the docker container [GH-476]
* jobspec: distinct_hosts constraint can be specified as a boolean (previously panicked) [GH-501]
## 0.2.0 (November 18, 2015)
__BACKWARDS INCOMPATIBILITIES:__
* core: HTTP API `/v1/node/<id>/allocations` returns full Allocation and not
stub [GH-402]
* core: Removed weight and hard/soft fields in constraints [GH-351]
* drivers: Qemu and Java driver configurations have been updated to both use
`artifact_source` as the source for external images/jars to be ran
* jobspec: New reserved and dynamic port specification [GH-415]
* jobspec/drivers: Driver configuration supports arbitrary struct to be
passed in jobspec [GH-415]
FEATURES:
* core: Blocking queries supported in API [GH-366]
* core: System Scheduler that runs tasks on every node [GH-287]
* core: Regexp, version and lexical ordering constraints [GH-271]
* core: distinctHost constraint ensures Task Groups are running on distinct
clients [GH-321]
* core: Service block definition with Consul registration [GH-463, GH-460,
GH-458, GH-455, GH-446, GH-425]
* client: GCE Fingerprinting [GH-215]
* client: Restart policy for task groups enforced by the client [GH-369,
GH-393]
* driver/rawexec: Raw Fork/Exec Driver [GH-237]
* driver/rkt: Experimental Rkt Driver [GH-165, GH-247]
* drivers: Add support for downloading external artifacts to execute for
Exec, Raw exec drivers [GH-381]
IMPROVEMENTS:
* core: Configurable Node GC threshold [GH-362]
* core: Overlap plan verification and plan application for increased
throughput [GH-272]
* cli: Output of `alloc-status` also displays task state [GH-424]
* cli: Output of `server-members` is sorted [GH-323]
* cli: Show node attributes in `node-status` [GH-313]
* client/fingerprint: Network fingerprinter detects interface suitable for
use, rather than defaulting to eth0 [GH-334, GH-356]
* client: Client Restore State properly reattaches to tasks and recreates
them as needed [GH-364, GH-380, GH-388, GH-392, GH-394, GH-397, GH-408]
* client: Periodic Fingerprinting [GH-391]
* client: Precise snapshotting of TaskRunner and AllocRunner [GH-403, GH-411]
* client: Task State is tracked by client [GH-416]
* client: Test Skip Detection [GH-221]
* driver/docker: Can now specify auth for docker pull [GH-390]
* driver/docker: Can now specify DNS and DNSSearch options [GH-390]
* driver/docker: Can now specify the container's hostname [GH-426]
* driver/docker: Containers now have names based on the task name. [GH-389]
* driver/docker: Mount task local and alloc directory to docker containers [GH-290]
* driver/docker: Now accepts any value for `network_mode` to support userspace networking plugins in docker 1.9
* driver/java: Pass JVM options in java driver [GH-293, GH-297]
* drivers: Use BlkioWeight rather than BlkioThrottleReadIopsDevice [GH-222]
* jobspec and drivers: Driver configuration supports arbitrary struct to be passed in jobspec [GH-415]
BUG FIXES:
* core: Nomad Client/Server RPC codec encodes strings properly [GH-420]
* core: Reset Nack timer in response to scheduler operations [GH-325]
* core: Scheduler checks for updates to environment variables [GH-327]
* cli: Fix crash when -config was given a directory or empty path [GH-119]
* client/fingerprint: Use correct local interface on OS X [GH-361, GH-365]
* client: Nomad Client doesn't restart failed containers [GH-198]
* client: Reap spawn-daemon process, avoiding a zombie process [GH-240]
* client: Resource exhausted errors because of link-speed zero [GH-146,
GH-205]
* client: Restarting Nomad Client leads to orphaned containers [GH-159]
* driver/docker: Apply SELinux label for mounting directories in docker
[GH-377]
* driver/docker: Docker driver exposes ports when creating container [GH-212,
GH-412]
* driver/docker: Docker driver uses docker environment variables correctly
[GH-407]
* driver/qemu: Qemu fingerprint and tests work on both windows/linux [GH-352]
## 0.1.2 (October 6, 2015)
IMPROVEMENTS:
* client: Nomad client cleans allocations on exit when in dev mode [GH-214]
* drivers: Use go-getter for artifact retrieval, add artifact support to
Exec, Raw Exec drivers [GH-288]
## 0.1.1 (October 5, 2015)
IMPROVEMENTS:
* cli: Nomad Client configurable from command-line [GH-191]
* client/fingerprint: Native IP detection and user specifiable network
interface for fingerprinting [GH-189]
* driver/docker: Docker networking mode is configurable [GH-184]
* drivers: Set task environment variables [GH-206]
BUG FIXES:
* client/fingerprint: Network fingerprinting failed if default network
interface did not exist [GH-189]
* client: Fixed issue where network resources throughput would be set to 0
MBits if the link speed could not be determined [GH-205]
* client: Improved detection of Nomad binary [GH-181]
* driver/docker: Docker dynamic port mapping were not being set properly
[GH-199]
## 0.1.0 (September 28, 2015)
* Initial release