Go to file
krad de85e4cadf Introduce WAL recovery consistency levels
Summary:
The "one size fits all" approach with WAL recovery will only introduce inconvenience for our varied clients as we go forward. The current recovery is a bit heuristic. We introduce the following levels of consistency while replaying the WAL.

1. RecoverAfterRestart (kTolerateCorruptedTailRecords)

This mocks the current recovery mode.

2. RecoverAfterCleanShutdown (kAbsoluteConsistency)

This is ideal for unit test and cases where the store is shutdown cleanly. We tolerate no corruption or incomplete writes.

3. RecoverPointInTime (kPointInTimeRecovery)

This is ideal when using devices with controller cache or file systems which can loose data on restart. We recover upto the point were is no corruption or incomplete write.

4. RecoverAfterDisaster (kSkipAnyCorruptRecord)

This is ideal mode to recover data. We tolerate corruption and incomplete writes, and we hop over those sections that we cannot make sense of salvaging as many records as possible.

Test Plan:
(1) Run added unit test to cover all levels.
(2) Run make check.

Reviewers: leveldb, sdong, igor

Subscribers: yoshinorim, dhruba

Differential Revision: https://reviews.facebook.net/D38487
2015-06-22 15:28:12 -07:00
arcanist_util Integrate Jenkins with Phabricator 2015-04-07 11:56:29 -07:00
build_tools Move dockerbuild.sh to build_tools/ 2015-06-17 14:09:12 -07:00
coverage Fix coverage script 2014-11-03 14:53:00 -08:00
db Introduce WAL recovery consistency levels 2015-06-22 15:28:12 -07:00
doc Remove seek compaction 2014-06-20 10:23:02 +02:00
examples [API Change] Improve EventListener::OnFlushCompleted interface 2015-06-05 12:28:51 -07:00
hdfs Add Env::GetThreadID(), which returns the ID of the current thread. 2015-06-11 14:18:02 -07:00
include Introduce WAL recovery consistency levels 2015-06-22 15:28:12 -07:00
java Use CompactRangeOptions for CompactRange 2015-06-17 14:36:14 -07:00
port Build for CYGWIN 2015-04-23 21:33:44 -07:00
table Add TablePropertiesCollector::NeedCompact() to suggest DB to further compact output files 2015-06-05 20:18:21 -07:00
third-party Update COMMIT.md 2015-03-30 17:48:16 -07:00
tools First version of rocksdb_dump and rocksdb_undump. 2015-06-19 16:24:36 -07:00
util Introduce WAL recovery consistency levels 2015-06-22 15:28:12 -07:00
utilities Fixing valgrind error in checkpoint_test 2015-06-19 20:21:23 -07:00
.arcconfig Integrate Jenkins with Phabricator 2015-04-07 11:56:29 -07:00
.clang-format A script that automatically reformat affected lines 2014-01-14 12:21:24 -08:00
.gitignore First version of rocksdb_dump and rocksdb_undump. 2015-06-19 16:24:36 -07:00
.travis.yml Don't preinstall jemalloc in Travis 2015-04-24 18:43:07 -07:00
AUTHORS Add AUTHORS file. Fix #203 2014-09-29 10:52:18 -07:00
CONTRIBUTING.md facebook accounts are not required for CLA signers 2014-07-08 05:57:54 -04:00
DUMP_FORMAT.md First version of rocksdb_dump and rocksdb_undump. 2015-06-19 16:24:36 -07:00
HISTORY.md Fail DB::Open() when the requested compression is not available 2015-06-18 14:55:05 -07:00
INSTALL.md Fix broken gflags link 2015-06-22 09:31:52 -07:00
LICENSE Fix copyright year 2014-03-12 12:06:58 -07:00
Makefile Remove ldb_tests.py from make check until it is working again. 2015-06-19 17:41:49 -07:00
PATENTS Update Patent Grant. 2015-04-13 10:33:43 +01:00
README.md Replaced "built on on earlier work" by "built on earlier work" in README.md 2014-09-17 01:16:17 -07:00
ROCKSDB_LITE.md Optimistic Transactions 2015-05-29 14:36:35 -07:00
USERS.md Add Yahoo's blog post about Sherpa to USERS.md 2015-06-09 12:55:58 -07:00
Vagrantfile RocksDB on FreeBSD support 2015-02-26 15:19:17 -08:00
src.mk Add wal files to Checkpoint for multiple column families. 2015-06-19 16:08:31 -07:00

README.md

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

Build Status

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it specially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/