open-nomad

Commit Graph

Author	SHA1	Message	Date
Diptanu Choudhury	7c61e115bd	Moved tlsutil into helpers	2016-10-25 16:05:37 -07:00
Diptanu Choudhury	353e7fc7f1	Moving the certs into tlsutil package	2016-10-25 16:01:53 -07:00
Diptanu Choudhury	cf35aeac84	Moving the TLSConfig to structs	2016-10-25 15:57:38 -07:00
Alex Dadgar	03eba049ed	Merge pull request #1848 from hashicorp/f-vault-error Thread through whether DeriveToken error is recoverable or not	2016-10-24 15:01:18 -07:00
Alex Dadgar	692a809919	Merge pull request #1842 from hashicorp/f-version-and-id Print the version and client node ID	2016-10-24 10:13:33 -07:00
Diptanu Choudhury	2e3118e69c	Implemented TLS support for http and rpc	2016-10-23 22:22:00 -07:00
Alex Dadgar	ede3a814ba	Small fixes	2016-10-22 18:20:50 -07:00
Alex Dadgar	0070178741	Thread through whether DeriveToken error is recoverable or not	2016-10-22 18:08:30 -07:00
Michael Schurter	285e80ac0f	Remove disk usage enforcement Many thanks to @iverberk for the original PR (#1609), but we ended up not wanting to ship this implementation with 0.5. We'll come back to it after 0.5 and hopefully find a way to leverage filesystem accounting and quotas, so we can skip the expensive polling.	2016-10-21 13:55:51 -07:00
Alex Dadgar	aa0d8d0d8d	Print the version and client node ID	2016-10-20 17:46:04 -07:00
Evan Phoenix	e7a98d5500	Make EvalSymlink errors more verbose	2016-10-12 17:07:21 -07:00
Evan Phoenix	f8a65a3b9d	Resolve alloc/state directories to make Docker For Mac happy * In -dev mode, `ioutil.TempDir` is used for the alloc and state directories. * `TempDir` uses `$TMPDIR`, which os OS X contains a per user directory which is under `/var/folder`. * `/var` is actually a symlink to `/private/var` * Docker For Mac validates the directories that are passed to bind and on OS X. That whitelist contains `/private`, but not `/var`. It does not expand the path, and so any paths in `$TMPDIR` fail the whitelist check. And thusly, by expanding the alloc/state directories the value passed for binding does contain `/private` and Docker For Mac is happy.	2016-10-12 17:06:25 -07:00
Michael Schurter	6dea6df919	Restore lost chan inits	2016-10-03 14:56:50 -07:00
Diptanu Choudhury	d50c395421	Getting snapshot of allocation from remote node (#1741 ) * Added the alloc dir move * Moving allocdirs when starting allocations * Added the migrate flag to ephemeral disk * Stopping migration if the allocation doesn't need migration any more * Added the GetAllocDir method * refactored code * Added a test for alloc runner * Incorporated review comments	2016-10-03 09:59:57 -07:00
Michael Schurter	b117725dc9	Only log consul errors once since last succesful run	2016-09-28 17:18:45 -07:00
Michael Schurter	d486de3804	Remove unused const	2016-09-27 16:04:01 -07:00
Michael Schurter	2e696c5e61	Fix lies found in comments by fact checkers	2016-09-26 16:51:53 -07:00
Michael Schurter	11cf9686a6	No need to put reaper ticker on the struct	2016-09-26 16:15:19 -07:00
Michael Schurter	2eb0062959	Drop clumsy timeout on discovery notifications It's better to just let goroutines fallback to their longer retry intervals then try to be clever here.	2016-09-26 16:05:21 -07:00
Michael Schurter	307e674eca	Flip disco chan; clarify method names/comments	2016-09-26 15:52:40 -07:00
Michael Schurter	888ee21270	Return csv of servers from Stats, not just count	2016-09-26 15:40:26 -07:00
Michael Schurter	7dc0079dd2	doDisco -> triggerDiscoveryCh; discovered -> serversDiscoveredCh Also fix log line formatting	2016-09-26 15:21:28 -07:00
Michael Schurter	434e4be97c	noServers -> noServersErr	2016-09-26 15:12:35 -07:00
Michael Schurter	b2ddb85a78	consul -> Consul	2016-09-26 15:06:57 -07:00
Michael Schurter	37cfb2769c	Replace periodic handlers with event driven disco Remove use of periodic consul handlers in the client and just use goroutines. Consul Discovery is now triggered with a chan instead of using a timer and deadline to trigger. Once discovery is complete a chan is ticked so all goroutines waiting for servers will run. Should speed up bootstraping and recovery while decreasing spinning on timers.	2016-09-23 17:02:48 -07:00
Michael Schurter	2ab5264595	Retry all servers on RPC call failure rpcproxy is refactored into serverlist which prioritizes good servers over servers in a remote DC or who have had a failure. Registration, heartbeating, and alloc status updating will retry faster when new servers are discovered. Consul discovery will be retried more quickly when no servers are available (eg on startup or an outage).	2016-09-23 11:44:48 -07:00
Alex Dadgar	50efdb00e9	Merge pull request #1713 from hashicorp/f-alloc-runner-vault Vault integration in client	2016-09-20 16:15:55 -07:00
Alex Dadgar	64de46432a	Merge pull request #1677 from hashicorp/f-vault-implicit-constraint Vault implicit Task Group constraint + allow root tokens	2016-09-20 16:15:32 -07:00
Alex Dadgar	ec152a6d12	Clean up vault client	2016-09-14 18:10:56 -07:00
Alex Dadgar	6702a29071	Vault token threaded	2016-09-14 13:30:01 -07:00
Robert Neumayer	8dc19dbd10	Log adding of servers at INFO level	2016-09-14 22:24:17 +02:00
Alex Dadgar	2c8dd8bbd3	Revert "Introduce a Secret/ directory"	2016-09-01 17:23:15 -07:00
Alex Dadgar	b0adaa5301	Allow root token	2016-09-01 12:05:08 -07:00
Alex Dadgar	1ed454dd60	Merge pull request #1671 from hashicorp/f-secret-dir2 Introduce a Secret/ directory	2016-09-01 09:56:17 -07:00
Alex Dadgar	9fa23e3536	Symlink on windows	2016-08-31 21:41:44 -07:00
Alex Dadgar	5d3b47e648	Address comments and reserve	2016-08-31 18:11:02 -07:00
vishalnayak	55a6f06e15	Addressed review feedback	2016-08-30 13:08:13 -04:00
vishalnayak	3808dd0ff8	Return only fatal error to renewal error channel	2016-08-30 12:46:59 -04:00
vishalnayak	a0dbfe25b3	Fix tests	2016-08-29 21:30:06 -04:00
vishalnayak	82f6209e97	tokenDeriver function pointer to derive tokens. Remove rpc*, connPool, node and region from vaultclient.	2016-08-29 20:32:05 -04:00
Alex Dadgar	14b7126511	Secret dir, hello world	2016-08-29 15:41:52 -07:00
vishalnayak	56e42cf03d	Employ DeriveVaultToken API and flesh-up DeriveToken	2016-08-24 12:29:59 -04:00
vishalnayak	6002e596c4	VaultClient for Nomad Client	2016-08-24 09:43:45 -04:00
Diptanu Choudhury	1e1eef56a1	Putting the mock driver behind a build flag	2016-08-22 15:02:28 -05:00
Diptanu Choudhury	4ca623bcfe	blocking chained allocations until previous allocation hasn't terminated	2016-08-22 11:34:24 -05:00
Alex Dadgar	a90dafe9ab	handle the upgrade case	2016-08-18 19:01:24 -07:00
Alex Dadgar	895c31f605	Nodes generate Secret ID and used for retrieving allocations and registering	2016-08-17 16:31:47 -07:00
Alex Dadgar	84820db86f	If the client detects that a heartbeat has failed because it is not registered, reregister	2016-08-15 17:24:09 -07:00
Diptanu Choudhury	28b3f511e0	Fixed some error messages	2016-08-10 15:17:32 -07:00
Kenjiro Nakayama	6a810e6f1e	Update after review	2016-08-09 08:57:26 +09:00
Kenjiro Nakayama	5c621b74e5	tiny: Return fmt.Errorf instead of duplicated error messages	2016-08-09 08:57:26 +09:00
Diptanu Choudhury	41b540fbc8	Allow operators to opt into publishing node and alloc metrics	2016-08-01 19:52:20 -07:00
Cameron Davison	777bdf4a1e	fix setup consul syncer error message	2016-07-28 22:14:52 -05:00
Alex Dadgar	ebac5cb283	Node.Register handles the case of transistioning to ready and creating evals	2016-07-21 15:22:02 -07:00
Diptanu Choudhury	5b39a5db40	Fixed a debug message	2016-07-09 00:12:53 -07:00
Sean Chittenden	03c571c61b	Consolidate fingerprinters into a single `map`.	2016-07-08 23:37:14 -07:00
Sean Chittenden	8bdb38d016	Code golf Pointed out by: @dadgar	2016-06-21 14:26:01 -07:00
Sean Chittenden	df4fe2e502	Fix the shuffling of remote datacenters. Pointed out by: @ryanuber	2016-06-21 13:37:22 -07:00
Sean Chittenden	9a60999100	Pass a logger arg to `NewClient` and `NewServer`	2016-06-16 23:29:23 -07:00
Sean Chittenden	fd18eb7fdb	Only register the Client services reaper when `consul.auto_advertise` is enabled	2016-06-16 18:24:58 -07:00
Sean Chittenden	952b6ce7b5	Only auto-join clients if `client_auto_join` is true	2016-06-16 14:47:21 -07:00
Sean Chittenden	af55b74114	Merge pull request #1276 from hashicorp/f-consul-server-autojoin Teach Nomad servers how to fall back to Consul.	2016-06-16 14:40:45 -07:00
Sean Chittenden	008d75184b	Use the `%+q` verb in log messages (vs `%q`).	2016-06-16 11:03:51 -07:00
Alex Dadgar	7375d828e1	remove trace	2016-06-15 15:47:59 -07:00
Sean Chittenden	5e0ced2ae7	Shuffle all datacenters vs only the nearest N datacenters. Per discussion, we want to be aggressive about fanning out vs possibly fixating on only local DCs. With RPC forwarding in place, a random walk may be less optimal from a network latency perspective, but it is guaranteed to eventually result in a converged state because all DCs are candidates during the bootstrapping process.	2016-06-15 12:40:51 -07:00
Sean Chittenden	2123460cf0	Bump various Consul search limits Client: Search limit increased from 4 random DCs to 8 random DCs, plus nearest. Server: Search factor increased from 3 to 5 times the bootstrap_expect. This should allow for faster convergence in large environments (e.g. sub-5min for 10K Consul DCs).	2016-06-15 12:40:51 -07:00
Alex Dadgar	cf99fc3173	Use Status.Peers instead of Status.Ping	2016-06-15 12:00:20 -07:00
Alex Dadgar	4b04e503f3	address comments	2016-06-13 17:32:18 -07:00
Alex Dadgar	8bbf4a55e5	Fix IDs and domain scoping	2016-06-13 16:30:58 -07:00
Diptanu Choudhury	d019d8ef8e	implemented reconciliation of unwanted services	2016-06-13 14:52:26 +02:00
Alex Dadgar	a82c2bb058	Do not reconcile in client and cleanup executor a bit	2016-06-12 18:22:07 -07:00
Alex Dadgar	8e231fa382	Rename ConsulService back to Service	2016-06-12 16:36:49 -07:00
Alex Dadgar	fdda90229f	only support latest and remove ring buffer	2016-06-12 09:32:38 -07:00
Alex Dadgar	e952540f6f	Allocation resources returned in a struct	2016-06-11 21:04:10 -07:00
Sean Chittenden	2f036231e5	Merge pull request #1201 from hashicorp/f-dyn-server-list Dynamic Server Lists/Client Bootstrapping via consul.	2016-06-11 18:58:25 -04:00
Sean Chittenden	92e2cfb0ad	Walk the DCs from nearest to most remote.	2016-06-11 18:52:21 -04:00
Sean Chittenden	2968545201	Walk the DCs from nearest to most remote, no limit on the search.	2016-06-11 18:23:06 -04:00
Sean Chittenden	917766a3df	Prefer `%+q` over `%q` in log messages.	2016-06-11 18:17:20 -04:00
Diptanu Choudhury	fd60cfd585	Emitting client resource usage metrics as guages instead of k/v pairs	2016-06-11 22:17:32 +02:00
Sean Chittenden	bbd8dfa798	goling(1) compliance pass (e.g. Rpc* -> RPC)	2016-06-10 23:38:28 -04:00
Sean Chittenden	bc771d35df	Query for the Nomad service across multiple Consul datacenters.	2016-06-10 23:05:14 -04:00
Sean Chittenden	26b1e826d7	golint(1) police	2016-06-10 15:54:39 -04:00
Sean Chittenden	f139d0c68b	Properly guard consulPullHeartbeatDeadline behind heartbeatLock	2016-06-10 15:54:39 -04:00
Sean Chittenden	ed29946f5e	Populate the RPC Proxy's server list if heartbeat did not include a leader. It's possible that a Nomad Client is heartbeating with a Nomad server that has become issolated from the quorum of Nomad Servers. When 3x the heartbeatTTL has been exceeded, append the Consul server list to the primary primary server list. When the next RPCProxy rebalance occurs, there is a chance one of the servers discovered from Consul will be in the majority. When client reattaches to a Nomad Server in the majority, it will include a heartbeat and will reset the TTLs AND will clear the primary server list to include only values from the heartbeat.	2016-06-10 15:54:39 -04:00
Sean Chittenden	9a223936bb	Generate and sync Consul ServiceIDs consistently	2016-06-10 15:54:39 -04:00
Sean Chittenden	7956eb0c80	Rename structs.Task's `Service` attribute to `ConsulService`	2016-06-10 15:54:39 -04:00
Sean Chittenden	8c813630e6	Move package client/consul/sync to command/agent/consul. This has been done to allow the Server and Client to reuse the same Syncer because the Agent may be running Client, Server, or both simultaneously and we only want one Syncer object alive in the agent.	2016-06-10 15:54:39 -04:00
Sean Chittenden	fda03c5c9e	Change the signature of the PeriodicCallback to return an error I KNEW I should have done this when I wrote it, but didn't want to go back and audit the handlers to include the appropriate return handling, but now that the code is taking shape, make this change.	2016-06-10 15:54:39 -04:00
Sean Chittenden	555f4fe135	Change client/consul.NewSyncer() to accept a shutdown channel In addition to the API changing, consul.Syncer can now be signaled to shutdown via the Shutdown() method, which will call the Run()'ing sync task to exit gracefully.	2016-06-10 15:54:39 -04:00
Sean Chittenden	484816f5e0	Ensure that all accesses to Client.alloc are wrapped by allocLock.	2016-06-10 15:50:11 -04:00
Sean Chittenden	08cab4fdfa	Use client.getAllocRunners() where appropriate.	2016-06-10 15:50:11 -04:00
Sean Chittenden	f9d0b9da32	Line wrap long line.	2016-06-10 15:50:11 -04:00
Sean Chittenden	0d201631a3	Rename rpcproxy.UpdateFromNodeUpdateResponse to RefreshServerLists While breaking the API within this PR, break out the individual arguments to RefreshServerLists. The servers parameter is reusing `structs.NodeServerInfo` for the time being, but this can be revisited if the needs of the strucutre diverge in the future.	2016-06-10 15:50:11 -04:00
Sean Chittenden	0997fb1669	Fix up the comments Pointed out by: @dadgar	2016-06-10 15:50:11 -04:00
Sean Chittenden	aaa7d6bf40	Make the locking protocol more explicit in client.NewClient With an over abundance of caution, preevnt future copy/pasta by using the right locks when bootstrapping a Client. Strictly speaking this is not necessary, but it makes explicit the locking semantics and guards against future concurrent or parallel initialization.	2016-06-10 15:50:11 -04:00
Sean Chittenden	525554c008	Use the client configCopy and lock appropriately.	2016-06-10 15:50:11 -04:00
Sean Chittenden	3060d6b33c	Flesh out the comment re: the client.rpcproxy.Run() task. Requested by: Alex	2016-06-10 15:50:11 -04:00
Sean Chittenden	b1ee131db8	Rename `backupServerDeadline` to `consulPullHeartbeatDeadline` Suggested by: @alex	2016-06-10 15:50:11 -04:00
Sean Chittenden	b9adfcecf5	Remove unused variable	2016-06-10 15:50:11 -04:00
Sean Chittenden	f15eeb8f27	Clean up some docs and comments to be more accurate	2016-06-10 15:50:11 -04:00

1 2 3 4 5 ...

317 Commits