open-consul

Commit Graph

Author	SHA1	Message	Date
Paul Banks	ae5c0aad39	cache: Fix bug where connection errors can cause early cache expiry (#9979 ) Fixes a cache bug where TTL is not updated while a value isn't changing or cache entry is returning fetch errors.	2021-04-08 11:11:15 +01:00
Paul Banks	b61e00b772	cache: fix bug where TTLs were ignored leading to leaked memory in client agents (#9978 ) * Fix bug in cache where TTLs are effectively ignored This mostly affects streaming since streaming will immediately return from Fetch calls when the state is Closed on eviction which causes the race condition every time. However this also affects all other cache types if the fetch call happens to return between the eviction and then next time around the Get loop by any client. There is a separate bug that allows cache items to be evicted even when there are active clients which is the trigger here. * Add changelog entry * Update .changelog/9978.txt	2021-04-08 11:08:56 +01:00
Daniel Nephin	e47131bfe6	cache: log a warning when Cache.Notify handles an error Without these warnings, errors are silently ignored, which can make debugging problems more challenging.	2021-02-12 13:02:23 -05:00
Matt Keeler	19c99dc104	Stop background refresh of cached data for requests that result in ACL not found errors (#9738 )	2021-02-09 10:15:53 -05:00
Daniel Nephin	ef0999547a	testing: skip slow tests with -short Add a skip condition to all tests slower than 100ms. This change was made using `gotestsum tool slowest` with data from the last 3 CI runs of master. See https://github.com/gotestyourself/gotestsum#finding-and-skipping-slow-tests With this change: ``` $ time go test -count=1 -short ./agent ok github.com/hashicorp/consul/agent 0.743s real 0m4.791s $ time go test -count=1 -short ./agent/consul ok github.com/hashicorp/consul/agent/consul 4.229s real 0m8.769s ```	2020-12-07 13:42:55 -05:00
Hans Hasselberg	25f9e232af	add missing descriptions for metrics	2020-11-23 22:06:30 +01:00
Kit Patella	af719981f3	finish adding static server metrics	2020-11-13 16:26:08 -08:00
Daniel Nephin	09d62f1df0	lib/ttlcache: unexport key and additional godoc	2020-10-20 19:16:03 -04:00
Daniel Nephin	2601998766	lib/ttlcache: add a constant for NotIndexed	2020-10-20 19:10:20 -04:00
Daniel Nephin	0beaced90f	cache: fix a bug with Prepopulate Prepopulate was setting entry.Expiry.HeapIndex to 0. Previously this would result in a call to heap.Fix(0) which wasn't correct, but was also not really a problem because at worse it would re-notify. With the recent change to extract cachettl it was changed to call Update(idx), which would have updated the wrong entry. A previous commit removed the setting of entry.Expiry so that the HeapIndex would be reported as -1, and this commit adds a test and handles the -1 heap index.	2020-10-20 19:10:20 -04:00
Daniel Nephin	9d5b738cdb	lib/ttlcache: extract package from agent/cache	2020-10-20 19:10:20 -04:00
Daniel Nephin	909b8e674e	cache: export ExpiryHeap and hide internal methods on an unexported type, so that when it is extrated those methods are not exported.	2020-10-20 19:10:20 -04:00
Daniel Nephin	d3742a1d0e	cache: Refactor heap.notify to make it more explicit. And remove duplicate notifications. Instead of performing the check in the heap implementation, check the index in the higher level interface (Add,Remove,Update) and notify if one of the relevant indexes is 0.	2020-10-20 19:10:20 -04:00
Daniel Nephin	a96646c562	cache: Move more of the expiryLoop into the Heap	2020-10-20 19:10:20 -04:00
Daniel Nephin	b6f24c6554	cache: extract cache eviction heap Start creating an interface that doesn't require using heap and hides more of the entry internals.	2020-10-20 19:10:19 -04:00
Daniel Nephin	f857aef4a8	submatview: add a test for handling of NewSnapshotToFollow Also add some godoc Rename some vars and functions Fix a data race in the new cache test for entry closing.	2020-10-06 13:22:02 -04:00
Daniel Nephin	50846a96ff	cache-types: Update Streaming health cache-type To use latest protobuf types	2020-10-06 13:22:02 -04:00
Daniel Nephin	e5d37bdf23	agent/cache: Add cache-type and materialized view for streaming health Extracted from d97412ce4c399a35b41bbdae2716f0e32dce80bf Co-authored-by: Paul Banks <banks@banksco.de>	2020-10-06 13:21:57 -04:00
Daniel Nephin	845661c8af	Merge pull request #8548 from edevil/fix_flake Fix flaky TestACLResolver_Client/Concurrent-Token-Resolve	2020-08-28 15:10:55 -04:00
Pierre Souchay	ee50b55163	Added Unit test for cache reloading	2020-08-28 13:03:58 +02:00
Pierre Souchay	084d0e8015	Added `options.Equals()` and minor fixes indentation fixes	2020-08-27 13:44:45 +02:00
Pierre Souchay	dd385f05e6	Ensure that Cache options are reloaded when `consul reload` is performed. This will apply cache throttling parameters are properly applied: * cache.EntryFetchMaxBurst * cache.EntryFetchRate When values are updated, a log is displayed in info.	2020-08-24 23:33:10 +02:00
André Cruz	673bd69f36	Decrease test flakiness Fix flaky TestACLResolver_Client/Concurrent-Token-Resolve and TestCacheNotifyPolling	2020-08-24 20:30:02 +01:00
Hans Hasselberg	ffdc3057fe	agent/cache test for cache throttling. (#8396 )	2020-07-30 14:41:13 +02:00
Matt Keeler	2ec4e46eb2	Default Cache rate limiting options in New Also get rid of the TestCache helper which was where these defaults were happening previously.	2020-07-28 12:34:35 -04:00
Pierre Souchay	947d8eb039	Added ratelimit to handle throtling cache (#8226 ) This implements a solution for #7863 It does: Add a new config cache.entry_fetch_rate to limit the number of calls/s for a given cache entry, default value = rate.Inf Add cache.entry_fetch_max_burst size of rate limit (default value = 2) The new configuration now supports the following syntax for instance to allow 1 query every 3s: command line HCL: -hcl 'cache = { entry_fetch_rate = 0.333}' in JSON { "cache": { "entry_fetch_rate": 0.333 } }	2020-07-27 23:11:11 +02:00
Matt Keeler	6d94900cd7	Disable background cache refresh for Connect Leaf Certs The rationale behind removing them is that all of our own code (xDS, builtin connect proxy) use the cache notification mechanism. This ensures that the blocking fetch behind the scenes is always executing. Therefore the only way you might go to get a certificate and have to wait is when 1) the request has never been made for that cert before or 2) you are using the v1/agent/connect/ca/leaf API for retrieving the cert yourself. In the first case, the refresh change doesn’t alter the behavior. In the second case, it can be mitigated by using blocking queries with that API which just like normal cache notification mechanism will cause the blocking fetch to be initiated and to get leaf certs as soon as needed. If you are not using blocking queries, or Envoy/xDS, or the builtin connect proxy but are retrieving the certs yourself then the HTTP endpoint might take a little longer to respond. This also renames the RefreshTimeout field on the register options to QueryTimeout to more accurately reflect that it is used for any type that supports blocking queries.	2020-07-21 12:19:25 -04:00
Daniel Nephin	797abe1f00	agent/cache: Use AllowNotModifiedResponse in CatalogListServices Co-authored-by: Pierre Souchay <pierresouchay@users.noreply.github.com>	2020-07-14 18:58:20 -04:00
Daniel Nephin	8aa3335b22	agent/cache: Update some docstrings	2020-07-14 18:58:20 -04:00
Matt Keeler	976f922abf	Make the Agent Cache more Context aware (#8092 ) Blocking queries issues will still be uncancellable (that cannot be helped until we get rid of net/rpc). However this makes it so that if calling getWithIndex (like during a cache Notify go routine) we can cancell the outer routine. Previously it would keep issuing more blocking queries until the result state actually changed.	2020-06-15 11:01:25 -04:00
Daniel Nephin	3114943f8d	agent/cache: remove error return from fetch A previous change removed the only error, so the return value can be removed now.	2020-04-17 11:55:01 -04:00
Daniel Nephin	d015d3c563	agent/cache: reduce function arguments by removing duplicates A few of the unexported functions in agent/cache took a large number of arguments. These arguments were effectively overrides for values that were provided in RequestInfo. By using a struct we can not only reduce the number of arguments, but also simplify the logic by removing the need for overrides.	2020-04-17 11:35:07 -04:00
Daniel Nephin	1251c01b73	agent/cache: Make all cache options RegisterOptions Previously the SupportsBlocking option was specified by a method on the type, and all the other options were specified from RegisterOptions. This change moves RegisterOptions to a method on the type, and moves SupportsBlocking into the options struct. Currently there are only 2 cache-types. So all cache-types can implement this method by embedding a struct with those predefined values. In the future if a cache type needs to be registered more than once with different options it can remove the embedded type and implement the method in a way that allows for paramaterization.	2020-04-16 18:56:34 -04:00
Daniel Nephin	fb31212de7	Remove TTL from cacheEntryExpiry This should very slightly reduce the amount of memory required to store each item in the cache. It will also enable setting different TTLs based on the type of result. For example we may want to use a shorter TTL when the result indicates the resource does not exist, as storing these types of records could easily lead to a DOS caused by OOM.	2020-04-13 13:10:38 -04:00
Daniel Nephin	371cf05340	agent/cache: Reduce differences between notify implementations These two notify functions are very similar. There appear to be just enough differences that trying to parameterize the differences may not improve things. For now, reduce some of the cosmetic differences so that the material differences are more obvious.	2020-04-13 13:10:38 -04:00
Daniel Nephin	4d398d26ae	agent/cache: Inline the refresh function to make recursion more obvious fetch is already an exceptionally long function, but hiding the recrusion in a function call likely does not help.	2020-04-13 13:10:38 -04:00
Daniel Nephin	98ef66e70a	agent/cache: Make the return values of getEntryLocked more obvious Use named returned so that the caller has a better idea of what these bools mean. Return early to reduce the scope, and make it more obvious what values are returned in which cases. Also reduces the number of conditional expressions in each case.	2020-04-13 13:10:38 -04:00
Daniel Nephin	cef60d1547	agent/cache: Small formatting improvements to improve readability Remove Cache.entryKey which called a single function. Format multiline struct creation one field per line.	2020-04-13 12:34:11 -04:00
Daniel Nephin	bf2a6452f1	Merge pull request #7596 from hashicorp/dnephin/agent-cache-type-entry agent/cache: move typeEntry lookup to the edge	2020-04-13 12:24:07 -04:00
Daniel Nephin	ab068325da	agent/cache: move typeEntry lookup to the edge This change moves all the typeEntry lookups to the first step in the exported methods, and makes unexporter internals accept the typeEntry struct. This change is primarily intended to make it easier to extract the container of caches from the Cache type. It may incidentally reduce locking in fetch, but that was not a goal.	2020-04-03 16:01:56 -04:00
Pierre Souchay	984583d980	tests: more tolerance to latency for unstable test `TestCacheNotifyPolling()`. (#7574 )	2020-04-03 10:29:38 +02:00
R.B. Boyer	0e152672a1	avoid 'panic: Log in goroutine after TestCacheGet_refreshAge has completed' (#7276 )	2020-02-12 10:01:51 -06:00
Anthony Scalisi	4b92c2deee	fix spelling errors (#7135 )	2020-01-27 07:00:33 -06:00
R.B. Boyer	55fdae203f	agent: cache notifications work after error if the underlying RPC returns index=1 (#6547 ) Fixes #6521 Ensure that initial failures to fetch an agent cache entry using the notify API where the underlying RPC returns a synthetic index of 1 correctly recovers when those RPCs resume working. The bug in the Cache.notifyBlockingQuery used to incorrectly "fix" the index for the next query from 0 to 1 for all queries, when it should have not done so for queries that errored. Also fixed some things that made debugging difficult: - config entry read/list endpoints send back QueryMeta headers - xds event loops don't swallow the cache notification errors	2019-09-26 10:42:17 -05:00
Christian Muehlhaeuser	2602f6907e	Simplified code in various places (#6176 ) All these changes should have no side-effects or change behavior: - Use bytes.Buffer's String() instead of a conversion - Use time.Since and time.Until where fitting - Drop unnecessary returns and assignment	2019-07-20 09:37:19 -04:00
Freddy	a295d9e5db	Flaky test overhaul (#6100 )	2019-07-12 09:52:26 -06:00
Hans Hasselberg	73c4e9f07c	tls: auto_encrypt enables automatic RPC cert provisioning for consul clients (#5597 )	2019-06-27 22:22:07 +02:00
Matt Keeler	07f2854683	Fixes race condition in Agent Cache (#5796 ) * Fix race condition during a cache get Check the entry we pulled out of the cache while holding the lock had Fetching set. If it did then we should use the existing Waiter instead of calling fetch. The reason this is better than just calling fetch is that fetch re-gets the entry out of the entries map and the previous fetch may have finished. Therefore this prevents erroneously starting a new fetch because we just missed the last update. * Fix race condition fully The first commit still allowed for the following scenario: • No entry existing when checked in getWithIndex while holding the read lock • Then by time we had reached fetch it had been created and finished. * always use ok when returning * comment mentioning the reading from entries. * use cacheHit consistently	2019-05-07 11:15:49 +01:00
Kyle Havlovitz	a113d8ca1f	Test an index=0 value in cache.Notify	2019-04-25 02:11:07 -07:00
Kyle Havlovitz	1fc96c770b	Make central service config opt-in and rework the initial registration	2019-04-24 06:11:08 -07:00

1 2

94 Commits