rocksdb

Go to file

Baptiste Lemaire 837705ad80 Make mempurge a background process (equivalent to in-memory compaction). (#8505 ) Summary: In https://github.com/facebook/rocksdb/issues/8454, I introduced a new process baptized `MemPurge` (memtable garbage collection). This new PR is built upon this past mempurge prototype. In this PR, I made the `mempurge` process a background task, which provides superior performance since the mempurge process does not cling on the db_mutex anymore, and addresses severe restrictions from the past iteration (including a scenario where the past mempurge was failling, when a memtable was mempurged but was still referred to by an iterator/snapshot/...). Now the mempurge process ressembles an in-memory compaction process: the stack of immutable memtables is filtered out, and the useful payload is used to populate an output memtable. If the output memtable is filled at more than 60% capacity (arbitrary heuristic) the mempurge process is aborted and a regular flush process takes place, else the output memtable is kept in the immutable memtable stack. Note that adding this output memtable to the `imm()` memtable stack does not trigger another flush process, so that the flush thread can go to sleep at the end of a successful mempurge. MemPurge is activated by making the `experimental_allow_mempurge` flag `true`. When activated, the `MemPurge` process will always happen when the flush reason is `kWriteBufferFull`. The 3 unit tests confirm that this process supports `Put`, `Get`, `Delete`, `DeleteRange` operators and is compatible with `Iterators` and `CompactionFilters`. Pull Request resolved: https://github.com/facebook/rocksdb/pull/8505 Reviewed By: pdillinger Differential Revision: D29619283 Pulled By: bjlemaire fbshipit-source-id: 8a99bee76b63a8211bff1a00e0ae32360aaece95		2021-07-09 17:23:59 -07:00
.circleci	Add micro-benchmark support (#8493 )	2021-07-08 18:22:45 -07:00
.github/workflows	Update clang-format-diff.py path (#7944 )	2021-02-09 12:49:38 -08:00
buckifier	Modify script which generates TARGETS (#8366 )	2021-06-04 16:28:59 -07:00
build_tools	Add micro-benchmark support (#8493 )	2021-07-08 18:22:45 -07:00
cache	Make SecondaryCache Customizable (#8480 )	2021-07-06 09:18:08 -07:00
cmake	Add `find_dependency()` in cmake config file. (#6791 )	2020-05-12 21:18:29 -07:00
coverage	Find the correct gcov (#6904 )	2020-06-01 16:33:05 -07:00
db	Make mempurge a background process (equivalent to in-memory compaction). (#8505 )	2021-07-09 17:23:59 -07:00
db_stress_tool	FaultInjectionTestFS::DeleteFilesCreatedAfterLastDirSync() to recover… (#8501 )	2021-07-07 16:23:23 -07:00
docs	Preset dictionary compression blog post (#8342 )	2021-05-31 21:31:13 -07:00
env	Add CreateFrom methods to Env/FileSystem (#8174 )	2021-06-15 03:43:48 -07:00
examples	make:Fix c header prototypes (#7994 )	2021-03-09 20:44:23 -08:00
file	Using existing crc32c checksum in checksum handoff for Manifest and WAL (#8412 )	2021-06-25 00:47:17 -07:00
fuzz	Remove Legacy and Custom FileWrapper classes from header files (#7851 )	2021-01-28 22:10:32 -08:00
hdfs	fix build with 'USE_HDFS' on windows (#6950 )	2020-06-12 16:21:50 -07:00
include/rocksdb	Add ribbon filter to C API (#8486 )	2021-07-09 16:22:48 -07:00
java	Added memtable garbage statistics (#8411 )	2021-06-18 04:57:27 -07:00
logging	Use SystemClock* instead of std::shared_ptr<SystemClock> in lower level routines (#8033 )	2021-03-15 04:34:11 -07:00
memory	Use thread-safe `strerror_r()` to get error message (#8087 )	2021-03-24 23:07:27 -07:00
memtable	Move slow valgrind tests behind -DROCKSDB_FULL_VALGRIND_RUN (#8475 )	2021-07-07 11:14:05 -07:00
microbench	Add micro-benchmark support (#8493 )	2021-07-08 18:22:45 -07:00
monitoring	Added memtable garbage statistics (#8411 )	2021-06-18 04:57:27 -07:00
options	Make SecondaryCache Customizable (#8480 )	2021-07-06 09:18:08 -07:00
plugin	Makefile support to statically link external plugin code (#7918 )	2021-02-10 08:35:34 -08:00
port	jemalloc_helper: Limit the mm_malloc.h hack to glibc on linux (#8425 )	2021-06-29 08:40:02 -07:00
table	Move slow valgrind tests behind -DROCKSDB_FULL_VALGRIND_RUN (#8475 )	2021-07-07 11:14:05 -07:00
test_util	Add CreateFrom methods to Env/FileSystem (#8174 )	2021-06-15 03:43:48 -07:00
third-party	Fix a compilation error in CircleCI vs2019 CXX20 (#8090 )	2021-03-23 10:28:04 -07:00
tools	Stress test to inject read failures in DB reopen (#8476 )	2021-07-06 11:05:27 -07:00
trace_replay	Trace MultiGet Keys and CF_IDs to the trace file (#8421 )	2021-06-18 15:04:05 -07:00
util	Add customizable_util.h to the public API (#8301 )	2021-06-29 09:08:57 -07:00
utilities	FaultInjectionTestFS::DeleteFilesCreatedAfterLastDirSync() to recover… (#8501 )	2021-07-07 16:23:23 -07:00
.clang-format	A script that automatically reformat affected lines	2014-01-14 12:21:24 -08:00
.gitignore	gitignore cmake-build-* for CLion integration (#7933 )	2021-02-19 13:43:15 -08:00
.lgtm.yml	Create lgtm.yml for LGTM.com C/C++ analysis (#4058 )	2018-06-26 12:43:04 -07:00
.travis.yml	Move arm build from travis to circleci (#8203 )	2021-04-19 20:07:02 -07:00
.watchmanconfig	Added .watchmanconfig file to rocksdb repo (#5593 )	2019-07-19 15:00:33 -07:00
AUTHORS	Update RocksDB Authors File	2017-10-18 14:42:10 -07:00
CMakeLists.txt	Add micro-benchmark support (#8493 )	2021-07-08 18:22:45 -07:00
CODE_OF_CONDUCT.md	Adopt Contributor Covenant	2019-08-29 23:21:01 -07:00
CONTRIBUTING.md	Add Code of Conduct	2017-12-05 18:42:35 -08:00
COPYING	Add GPLv2 as an alternative license.	2017-04-27 18:06:12 -07:00
DEFAULT_OPTIONS_HISTORY.md	options.delayed_write_rate use the rate of rate_limiter by default.	2017-05-24 09:58:24 -07:00
DUMP_FORMAT.md	First version of rocksdb_dump and rocksdb_undump.	2015-06-19 16:24:36 -07:00
HISTORY.md	Make SecondaryCache Customizable (#8480 )	2021-07-06 09:18:08 -07:00
INSTALL.md	Update installation instructions (#8158 )	2021-04-06 16:02:04 -07:00
LANGUAGE-BINDINGS.md	Add RestoreDBFromLatestBackup to C API, add new C# package (#7092 )	2020-07-08 11:56:41 -07:00
LICENSE.Apache	Change RocksDB License	2017-07-15 16:11:23 -07:00
LICENSE.leveldb	Add back the LevelDB license file	2017-07-16 18:42:18 -07:00
Makefile	Add micro-benchmark support (#8493 )	2021-07-08 18:22:45 -07:00
PLUGINS.md	Add ZenFS to plugin list (#8218 )	2021-04-22 11:12:40 -07:00
README.md	Fix the CI badge for ppc64le Jenkins (#7561 )	2020-10-16 09:00:56 -07:00
ROCKSDB_LITE.md	Fix some typos in comments and docs.	2018-03-08 10:27:25 -08:00
TARGETS	Add an internal iterator that can measure the inflow of blobs (#8443 )	2021-06-23 10:25:47 -07:00
USERS.md	Add Apache Doris to USERS (#7865 )	2021-01-19 15:31:56 -08:00
Vagrantfile	Adding CentOS 7 Vagrantfile & build script	2018-02-26 15:27:17 -08:00
WINDOWS_PORT.md	#5145 , rename port/dirent.h to port/port_dirent.h to avoid compile err when use port dir as header dir output (#5152 )	2019-04-04 11:38:19 -07:00
appveyor.yml	Remove 2019 from appveyor (#7038 )	2020-06-29 14:31:41 -07:00
defs.bzl	Make testpilot recognize that these tests have coverage instrumentation	2020-03-20 11:23:23 -07:00
issue_template.md	Add Google Group to Issue Template	2020-01-28 14:40:37 -08:00
src.mk	Add micro-benchmark support (#8493 )	2021-07-08 18:22:45 -07:00
thirdparty.inc	Fix build jemalloc api (#5470 )	2019-06-24 17:40:32 -07:00

README.md

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key-value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it especially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/ and https://rocksdb.slack.com/

License

RocksDB is dual-licensed under both the GPLv2 (found in the COPYING file in the root directory) and Apache 2.0 License (found in the LICENSE.Apache file in the root directory). You may select, at your option, one of the above-listed licenses.