rocksdb

mirror of https://github.com/facebook/rocksdb.git synced 2024-11-30 13:41:46 +00:00

History

Changyu Bi 62fc15f009 Block per key-value checksum (#11287 ) Summary: add option `block_protection_bytes_per_key` and implementation for block per key-value checksum. The main changes are 1. checksum construction and verification in block.cc/h 2. pass the option `block_protection_bytes_per_key` around (mainly for methods defined in table_cache.h) 3. unit tests/crash test updates Tests: * Added unit tests * Crash test: `python3 tools/db_crashtest.py blackbox --simple --block_protection_bytes_per_key=1 --write_buffer_size=1048576` Follow up (maybe as a separate PR): make sure corruption status returned from BlockIters are correctly handled. Performance: Turning on block per KV protection has a non-trivial negative impact on read performance and costs additional memory. For memory, each block includes additional 24 bytes for checksum-related states beside checksum itself. For CPU, I set up a DB of size ~1.2GB with 5M keys (32 bytes key and 200 bytes value) which compacts to ~5 SST files (target file size 256 MB) in L6 without compression. I tested readrandom performance with various block cache size (to mimic various cache hit rates): ``` SETUP make OPTIMIZE_LEVEL="-O3" USE_LTO=1 DEBUG_LEVEL=0 -j32 db_bench ./db_bench -benchmarks=fillseq,compact0,waitforcompaction,compact,waitforcompaction -write_buffer_size=33554432 -level_compaction_dynamic_level_bytes=true -max_background_jobs=8 -target_file_size_base=268435456 --num=5000000 --key_size=32 --value_size=200 --compression_type=none BENCHMARK ./db_bench --use_existing_db -benchmarks=readtocache,readrandom[-X10] --num=5000000 --key_size=32 --disable_auto_compactions --reads=1000000 --block_protection_bytes_per_key=[0\|1] --cache_size=$CACHESIZE The readrandom ops/sec looks like the following: Block cache size: 2GB 1.2GB * 0.9 1.2GB * 0.8 1.2GB * 0.5 8MB Main 240805 223604 198176 161653 139040 PR prot_bytes=0 238691 226693 200127 161082 141153 PR prot_bytes=1 214983 193199 178532 137013 108211 prot_bytes=1 vs -10% -15% -10.8% -15% -23% prot_bytes=0 ``` The benchmark has a lot of variance, but there was a 5% to 25% regression in this benchmark with different cache hit rates. Pull Request resolved: https://github.com/facebook/rocksdb/pull/11287 Reviewed By: ajkr Differential Revision: D43970708 Pulled By: cbi42 fbshipit-source-id: ef98d898b71779846fa74212b9ec9e08b7183940		2023-04-25 12:08:23 -07:00
..
advisor	Fix lint issues after enable BLACK (#10717 )	2022-09-21 13:37:51 -07:00
block_cache_analyzer	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
dump	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
analyze_txn_stress_test.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
auto_sanity_test.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
backup_db.sh	Revamp check_format_compatible.sh (#8012 )	2021-03-02 11:42:27 -08:00
benchmark.sh	Fix file modes (#10815 )	2022-10-13 09:00:37 -07:00
benchmark_ci.py	Remove NUMA setting for benchmark-linux (#11180 )	2023-02-02 15:15:09 -08:00
benchmark_compare.sh	Fix file modes (#10815 )	2022-10-13 09:00:37 -07:00
benchmark_leveldb.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
blob_dump.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
check_all_python.py	Enable BLACK for internal_repo_rocksdb (#10710 )	2022-09-20 17:47:52 -07:00
check_format_compatible.sh	Start version 8.3 (#11405 )	2023-04-24 13:37:56 -07:00
CMakeLists.txt	Mark dependencies as PRIVATE and fix missing dependencies in tools. (#6790 )	2020-05-12 21:07:55 -07:00
db_bench.cc	Add (& fix) some simple source code checks (#8821 )	2021-09-07 21:19:27 -07:00
db_bench_tool.cc	Block per key-value checksum (#11287 )	2023-04-25 12:08:23 -07:00
db_bench_tool_test.cc	Remove deprecated util functions in options_util.h (#11126 )	2023-01-27 11:10:53 -08:00
db_crashtest.py	Block per key-value checksum (#11287 )	2023-04-25 12:08:23 -07:00
db_repl_stress.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
db_sanity_test.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
dbench_monitor	Fix /bin/bash shebangs	2017-08-03 15:56:46 -07:00
Dockerfile	adding docker build script and dockerfile	2015-05-22 16:03:39 -07:00
generate_random_db.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
ingest_external_sst.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
io_tracer_parser.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
io_tracer_parser_test.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
io_tracer_parser_tool.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
io_tracer_parser_tool.h	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
ldb.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
ldb_cmd.cc	Group rocksdb.sst.read.micros stat by IOActivity flush and compaction (#11288 )	2023-04-21 09:07:18 -07:00
ldb_cmd_impl.h	Run clang format against files under tools/ and db_stress_tool/ (#10868 )	2022-10-25 14:29:41 -07:00
ldb_cmd_test.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
ldb_test.py	Enable BLACK for internal_repo_rocksdb (#10710 )	2022-09-20 17:47:52 -07:00
ldb_tool.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
pflag	Fix /bin/bash shebangs	2017-08-03 15:56:46 -07:00
reduce_levels_test.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
regression_test.sh	Fix hang in async_io benchmarks in regression script (#11285 )	2023-03-09 09:16:20 -08:00
restore_db.sh	Revamp check_format_compatible.sh (#8012 )	2021-03-02 11:42:27 -08:00
rocksdb_dump_test.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
run_blob_bench.sh	Support prepopulating/warming the blob cache (#10298 )	2022-07-17 07:13:59 -07:00
run_flash_bench.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
run_leveldb.sh	Add copyright headers per FB open-source checkup tool. (#5199 )	2019-04-18 10:55:01 -07:00
sample-dump.dmp	First version of rocksdb_dump and rocksdb_undump.	2015-06-19 16:24:36 -07:00
simulated_hybrid_file_system.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
simulated_hybrid_file_system.h	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
sst_dump.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
sst_dump_test.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
sst_dump_tool.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
trace_analyzer.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
trace_analyzer_test.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
trace_analyzer_tool.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
trace_analyzer_tool.h	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
verify_random_db.sh	Fix some bugs in verify_random_db.sh (#10112 )	2022-06-03 16:35:13 -07:00
write_external_sst.sh	Revamp check_format_compatible.sh (#8012 )	2021-03-02 11:42:27 -08:00
write_stress.cc	Remove RocksDB LITE (#11147 )	2023-01-27 13:14:19 -08:00
write_stress_runner.py	Enable BLACK for internal_repo_rocksdb (#10710 )	2022-09-20 17:47:52 -07:00