Merge pull request #196 from google/iterationdoc

Add section on iterations.
2016-04-20 08:31:33 -07:00 · 2016-04-20 08:31:33 -07:00 · 27f8a62313
parent 075a8a6fbf d6f96ed639
commit 27f8a62313
1 changed files with 45 additions and 41 deletions
--- a/README.md
+++ b/README.md
@ -1,5 +1,4 @@
-benchmark
+# benchmark
 =========
 [![Build Status](https://travis-ci.org/google/benchmark.svg?branch=master)](https://travis-ci.org/google/benchmark)
 [![Build status](https://ci.appveyor.com/api/projects/status/u0qsyp7t1tk7cpxs/branch/master?svg=true)](https://ci.appveyor.com/project/google/benchmark/branch/master)
 [![Coverage Status](https://coveralls.io/repos/google/benchmark/badge.svg)](https://coveralls.io/r/google/benchmark)
@ -10,10 +9,9 @@ Discussion group: https://groups.google.com/d/forum/benchmark-discuss
 IRC channel: https://freenode.net #googlebenchmark
-Example usage
+## Example usage
-------------
+### Basic usage
-Define a function that executes the code to be measured a
+Define a function that executes the code to be measured.
 specified number of times:
 ```c++
 static void BM_StringCreation(benchmark::State& state) {
@ -34,15 +32,16 @@ BENCHMARK(BM_StringCopy);
 BENCHMARK_MAIN();
 ```
-Sometimes a family of microbenchmarks can be implemented with
+### Passing arguments
-just one routine that takes an extra argument to specify which
+Sometimes a family of benchmarks can be implemented with just one routine that
-one of the family of benchmarks to run.  For example, the following
+takes an extra argument to specify which one of the family of benchmarks to
-code defines a family of microbenchmarks for measuring the speed
+run. For example, the following code defines a family of benchmarks for
-of `memcpy()` calls of different lengths:
+measuring the speed of `memcpy()` calls of different lengths:
 ```c++
 static void BM_memcpy(benchmark::State& state) {
-  char* src = new char[state.range_x()]; char* dst = new char[state.range_x()];
+  char* src = new char[state.range_x()];
  char* dst = new char[state.range_x()];
  memset(src, 'x', state.range_x());
  while (state.KeepRunning())
    memcpy(dst, src, state.range_x());
@ -54,18 +53,17 @@ static void BM_memcpy(benchmark::State& state) {
 BENCHMARK(BM_memcpy)->Arg(8)->Arg(64)->Arg(512)->Arg(1<<10)->Arg(8<<10);
 ```
-The preceding code is quite repetitive, and can be replaced with the
+The preceding code is quite repetitive, and can be replaced with the following
-following short-hand.  The following invocation will pick a few
+short-hand. The following invocation will pick a few appropriate arguments in
-appropriate arguments in the specified range and will generate a
+the specified range and will generate a benchmark for each such argument.
 microbenchmark for each such argument.
 ```c++
 BENCHMARK(BM_memcpy)->Range(8, 8<<10);
 ```
-You might have a microbenchmark that depends on two inputs.  For
+You might have a benchmark that depends on two inputs. For example, the
-example, the following code defines a family of microbenchmarks for
+following code defines a family of benchmarks for measuring the speed of set
-measuring the speed of set insertion.
+insertion.
 ```c++
 static void BM_SetInsert(benchmark::State& state) {
@ -88,19 +86,18 @@ BENCHMARK(BM_SetInsert)
    ->ArgPair(8<<10, 512);
 ```
-The preceding code is quite repetitive, and can be replaced with
+The preceding code is quite repetitive, and can be replaced with the following
-the following short-hand.  The following macro will pick a few
+short-hand. The following macro will pick a few appropriate arguments in the
-appropriate arguments in the product of the two specified ranges
+product of the two specified ranges and will generate a benchmark for each such
-and will generate a microbenchmark for each such pair.
+pair.
 ```c++
 BENCHMARK(BM_SetInsert)->RangePair(1<<10, 8<<10, 1, 512);
 ```
-For more complex patterns of inputs, passing a custom function
+For more complex patterns of inputs, passing a custom function to `Apply` allows
-to Apply allows programmatic specification of an
+programmatic specification of an arbitrary set of arguments on which to run the
-arbitrary set of arguments to run the microbenchmark on.
+benchmark. The following example enumerates a dense range on one parameter,
 The following example enumerates a dense range on one parameter,
 and a sparse range on the second.
 ```c++
@ -112,9 +109,10 @@ static void CustomArguments(benchmark::internal::Benchmark* b) {
 BENCHMARK(BM_SetInsert)->Apply(CustomArguments);
 ```
-Templated microbenchmarks work the same way:
+### Templated benchmarks
-Produce then consume 'size' messages 'iters' times
+Templated benchmarks work the same way: This example produces and consumes
-Measures throughput in the absence of multiprogramming.
+messages of size `sizeof(v)` `range_x` times. It also outputs throughput in the
 absence of multiprogramming.
 ```c++
 template <class Q> int BM_Sequential(benchmark::State& state) {
@ -145,11 +143,12 @@ Three macros are provided for adding benchmark templates.
 #define BENCHMARK_TEMPLATE2(func, arg1, arg2)
 ```
 ### Multithreaded benchmarks
 In a multithreaded test (benchmark invoked by multiple threads simultaneously),
 it is guaranteed that none of the threads will start until all have called
-KeepRunning, and all will have finished before KeepRunning returns false. As
+`KeepRunning`, and all will have finished before KeepRunning returns false. As
-such, any global setup or teardown you want to do can be
+such, any global setup or teardown can be wrapped in a check against the thread
-wrapped in a check against the thread index:
+index:
 ```c++
 static void BM_MultiThreaded(benchmark::State& state) {
@ -176,6 +175,7 @@ BENCHMARK(BM_test)->Range(8, 8<<10)->UseRealTime();
 Without `UseRealTime`, CPU time is used by default.
 ### Preventing optimisation
 To prevent a value or expression from being optimized away by the compiler
 the `benchmark::DoNotOptimize(...)` function can be used.
@ -190,8 +190,15 @@ static void BM_test(benchmark::State& state) {
 }
 ```
-Benchmark Fixtures
+## Controlling number of iterations
------------------
+In all cases, the number of iterations for which the benchmark is run is
 governed by the amount of time the benchmark takes. Concretely, the number of
 iterations is at least one, not more than 1e9, until CPU time is greater than
 the minimum time, or the wallclock time is 5x minimum time. The minimum time is
 set as a flag `--benchmark_min_time` or per benchmark by calling `MinTime` on
 the registered benchmark object.
 ## Fixtures
 Fixture tests are created by
 first defining a type that derives from ::benchmark::Fixture and then
 creating/registering the tests using the following macros:
@ -221,8 +228,7 @@ BENCHMARK_REGISTER_F(MyFixture, BarTest)->Threads(2);
 /* BarTest is now registered */
 ```
-Output Formats
+## Output Formats
 --------------
 The library supports multiple output formats. Use the
 `--benchmark_format=<tabular|json>` flag to set the format type. `tabular` is
 the default format.
@ -290,8 +296,7 @@ name,iterations,real_time,cpu_time,bytes_per_second,items_per_second,label
 "BM_SetInsert/1024/10",106365,17238.4,8421.53,4.74973e+06,1.18743e+06,
 ```
-Debug vs Release
+## Debug vs Release
 ----------------
 By default, benchmark builds as a debug library. You will see a warning in the output when this is the case. To build it as a release library instead, use:
 ```
@ -304,6 +309,5 @@ To enable link-time optimisation, use
 cmake -DCMAKE_BUILD_TYPE=Release -DBENCHMARK_ENABLE_LTO=true
 ```
-Linking against the library
+## Linking against the library
 ---------------------------
 When using gcc, it is necessary to link against pthread to avoid runtime exceptions. This is due to how gcc implements std::thread. See [issue #67](https://github.com/google/benchmark/issues/67) for more details.