benchmark

mirror of https://github.com/google/benchmark.git synced 2024-11-27 02:44:34 +00:00

Author	SHA1	Message	Date
Phoenix Meadowlark	a9b9471c02	Fix typo in invalid file name error message. (#1094 )	2021-02-22 09:55:07 +00:00
SSE4	d90321ff7a	- add support for Elbrus 2000 (e2k) (#1091 ) Signed-off-by: SSE4 <tomskside@gmail.com>	2021-02-14 17:45:57 +03:00
Michał Janiszewski	ea5a5bbff4	Add MSVC ARM64 support for reading clocks (#1052 ) Lacks CMake support, see https://github.com/google/benchmark/pull/1090	2021-02-12 13:30:26 +03:00
Scott K Logan	17a6b21ee1	Fix Range when starting at zero (#1073 ) The existing behavior results in the `0` value being added twice. Since `lo` is always added to `dst`, we never want to explicitly add `0` if `lo` is equal to `0`.	2020-11-26 11:12:45 +00:00
Steven Wan	d9abf01763	Rename 'mftbl' to 'mftb' (#1069 ) * Rename 'mftbl' to 'mftb' * Add my name to the contributor list	2020-11-03 09:08:46 +00:00
Abhina Sree	a9704c268d	Nanosleep workaround for z/OS in sleep.cc (#1067 ) * z/OS does not support nanosleep, add workaround to use sleep() and usleep() instead * change unsigned to int, and fix while loop	2020-10-29 08:49:02 +00:00
Fanbo Meng	dce3322a54	Add support for z/OS XL compiler inline asm syntax (#1063 ) On s390 architecture, z/OS XL compiler uses HLASM inline assembly, which has different syntax and needs to be distinguished to avoid compilation error.	2020-10-21 16:39:54 +01:00
Sergei Trofimovich	3d1c267768	src/benchmark_register.h: add missing <limits> inclusion (#1060 ) Noticed missing header when was building llvm with gcc-11: ``` llvm-project/llvm/utils/benchmark/src/benchmark_register.h:17:30: error: 'numeric_limits' is not a member of 'std' 17 \| static const T kmax = std::numeric_limits<T>::max(); \| ^~~~~~~~~~~~~~ ```	2020-10-15 11:12:40 +03:00
Michael Neumann	af72911f2f	Add support for DragonFly BSD (#1058 ) Without this commit, compilation fails on DragonFly with the following message: ``` /home/mneumann/Dev/benchmark.old/src/sysinfo.cc:446:2: error: #warning "HOST_NAME_MAX not defined. using 64" [-Werror=cpp] ^~~~~~~ ``` Also note that the sysctl is actually `hw.tsc_frequency` on DragonFly: ``` $ sysctl hw.tsc_frequency hw.tsc_frequency: 3498984022 ``` Tested on: ``` $ uname -a DragonFly box.localnet 5.9-DEVELOPMENT DragonFly v5.9.0.742.g4b29dd-DEVELOPMENT #5: Tue Aug 18 00:21:31 CEST 2020 ```	2020-10-12 23:41:49 +03:00
Min-Yih Hsu	ffe1342eb2	Add CycleTimer implementation for M68K architecture (#1050 ) As per discussions in here [1], LLVM is going to get backend support on Motorola 68000 series CPUs (a.k.a M68K or M680x0). So it's necessary to add CycleTimer implementation here, which is simply using `gettimeofday` same as MIPS. This fixes #1049 [1] https://reviews.llvm.org/D88389	2020-09-29 09:35:18 +03:00
Christian Wassermann	4857962394	Add CartesianProduct with associated test (#1029 ) * Add CartesianProduct with associated test * Use CartesianProduct in Ranges to avoid code duplication * Add new cartesian_product_test to CMakeLists.txt * Update AUTHORS & CONTRIBUTORS * Rename CartesianProduct to ArgsProduct * Rename test & fixture accordingly * Add example for ArgsProduct to README	2020-08-25 13:47:44 +01:00
Dominic Hamon	5b72b6c2da	Remove "BENCHMARK_" prefix from env var version of command line flags (#997 ) As noted in #995, this causes issues when the command line flag already starts with "benchmark_", which they all do. Not caught by tests as the test flags didn't start with "benchmark". Fixes #995	2020-08-18 10:02:20 +01:00
Dominic Hamon	1302d2ce09	Add missing breaks for QNX cache counting (#1012 )	2020-07-30 09:51:48 +01:00
Alexander Enaldiev	9901011880	JSONReporter: don't report on scaling if we didn't get it (#1005 ) (#1008 ) * JSONReporter: don't report on scaling if we didn't get it (#1005) * JSONReporter: fix due to review (std::pair<bool, bool> -> enum) * JSONReporter: scaling: fix the algo (due to review discussion) * benchmark.h: revert to old-fashioned enum's (C++03 compatibility); rreporter_output_test: let's skip scaling	2020-07-28 12:46:07 +01:00
Reid Paape	15e6dfd718	timers: silence strncat truncation warning (#984 )	2020-06-17 14:58:12 +03:00
Brian Wolfe	7cc06ef80c	timers: just make the buffers big enough	2020-06-15 16:16:19 -07:00
Brian Wolfe	f25ea40ae1	timers: use snprintf instead of sprintf	2020-06-15 14:16:20 -07:00
Brian Wolfe	f6ac240cd2	timers: silence format overflow warning	2020-06-15 14:02:15 -07:00
Brian Wolfe	99c52f1414	use rfc3339-formatted timestamps in output [output format change] (#965 ) * timestamp: use rfc3339-formatted timestamps in output Replace localized timestamps with machine-readable IETF RFC 3339 format timestamps. This is an attempt to make the output timestamps easily machine-readable. ISO8601 specifies standards for time interchange formats. IETF RFC 3339: https://tools.ietf.org/html/rfc3339 defines a subset of these for use in the internet. The general form for these timestamps is: YYYY-MM-DDTHH:mm:SS[+-]hhmm This replaces the localized time formats that are currently being used in the benchmark output to prioritize interchangeability and machine-readability. This might break existing programs that rely on the particular date-time format. This might also may make times less human readable. RFC3339 was intended to balance human readability and simplicity for machine readability, but it is primarily intended as an internal representation. * timers: remove utc string formatting We only ever need local time printing. Remove the UTC printing and cosnolidate the logic slightly. * timers: manually create rfc3339 string The C++ standard library does not output the time offset in RFC3339 format, it is missing the : between hours and minutes. VS does not appear to support timezone information by default. To avoid adding too much complexity to benchmark around timezone handling e.g. a full date library like https://github.com/HowardHinnant/date, we fall back to outputting GMT time with a -00:00 offset for those cases. * timers: use reentrant form for localtime_r & tmtime_r For non-windows, use the reentrant form for the time conversion functions. * timers: cleanup Use strtol instead of brittle moving characters around. * timers: only call strftime twice. Also size buffers to known maximum necessary size and name constants more appropriately. * timers: fix unused variable warning	2020-06-15 17:28:17 +01:00
Kai Wolf	56898e9a92	Add missing <cerrno> header include - fixes Android build (#960 ) * Add missing <cerrno> header This commit fixes a current build error on Android where 'errno' is an unidentified symbol due to a missing header * Update string_util.cc Conditionally adds <cerrno> if BENCHMARK_STL_ANDROID_GNUSTL is defined	2020-04-23 14:19:19 +03:00
Luís Marques	ecc1685340	Fix formatting issues introduced by `a77d5f7` (#959 )	2020-04-17 19:31:49 +03:00
Keith Moyer	8cead00783	Remove warnings for internal use of CSVReporter (#956 ) In a previous commit[1], diagnostic pragmas were used to avoid this warning. However, the incorrect warning flag was indicated, leaving the warning in place. -Wdeprecated is for deprecated features while -Wdeprecated-declarations for deprecated functions, variables, and types[2]. [1] `c408461983` [2] https://gcc.gnu.org/onlinedocs/gcc/Warning-Options.html	2020-04-14 10:20:22 +01:00
Luís Marques	a77d5f70ef	Fix cycleclock::Now for RISC-V and PPC (#955 ) Fixes the following issues with the implementation of `cycleclock::Now`: - The RISC-V implementation wouldn't compile due to a typo; - Both the PPC and RISC-V implementation's asm statements lacked the volatile keyword. This resulted in the repeated read of the counter's high part being optimized away, so overflow wasn't handled at all. Multiple counter reads could also be misoptimized, especially in LTO scenarios. - Relied on the zero/sign-extension of inline asm operands, which isn't guaranteed to occur and differs between compilers, namely GCC and Clang. The PowerPC64 implementation was improved to do a single 64-bit read of the time-base counter. The RISC-V implementation was improved to do the overflow handing in assembly, since Clang would generate a branch, defeating the purpose of the non-branching counter reading approach.	2020-04-10 17:02:13 +01:00
Dominic Hamon	0ab2c2906b	Fix type conversion warnings. (#951 ) * Fix type conversion warnings. Fixes #949 Tested locally (Linux/clang), but warnings are on MSVC so may differ. * Drop the ULP so the double test passes	2020-04-06 13:52:09 +01:00
Roman Lebedev	70d89ac519	Revert "Add d postfix to Debug libraries (#923 )" This reverts commit `5ce2429af7`. Reverts https://github.com/google/benchmark/pull/923 Reopens https://github.com/google/benchmark/issues/922 Fixes https://github.com/google/benchmark/issues/928 Closes https://github.com/google/benchmark/pull/930 See discussion in https://github.com/google/benchmark/issues/928 this broke pkg-config support, since there we don't account for the suffix, nor is it trivial to do so.	2020-03-16 14:29:27 +03:00
Paweł Bylica	c078337494	Relax CHECK condition in benchmark_runner.cc (#938 ) * Add State::error_occurred() * Relax CHECK condition in benchmark_runner.cc If the benchmark state contains an error, do not expect any iterations has been run. This allows using SkipWithError() and return early from the benchmark function. * README.md: document new possible usage of SkipWithError()	2020-02-21 17:53:25 +03:00
Ben Clayton	8982e1ee6a	Fix MSVC warning. (#935 ) This fixes the Visual Studio 2019 warning: `C4244: '=': conversion from 'int' to 'char', possible loss of data` When implicitly casting the return value of tolower() (int) to char. Fixes: #932	2020-02-08 02:48:01 +03:00
Jordan Williams	daff5fead3	Alias CMake Targets. Fixes #921 (#926 ) * add Jordan Williams to both CONTRIBUTORS and AUTHORS * alias benchmark libraries Provide aliased CMake targets for the benchmark and benchmark_main targets. The alias targets are namespaced under benchmark::, which is the namespace when they are exported. I chose not to use either the PROJECT_NAME or the namespace variable but to hard-code the namespace. This is because the benchmark and benchmark_main targets are hard-coded by name themselves. Hard-coding the namespace is also much cleaner and easier to read. * link to aliased benchmark targets It is safer to link against namespaced targets because of how CMake interprets the double colon. Typo's will be caught by CMake at configuration-time instead of during compile / link time. * document the provided alias targets * add "Usage with CMake" section in documentation This section covers linking against the alias/import CMake targets and including them using either find_package or add_subdirectory. * format the "Usage with CMake" README section Added a newline after the "Usage with CMake" section header. Dropped the header level of the section by one to make it a direct subsection of the "Usage" section. Wrapped lines to be no longer than 80 characters in length.	2020-01-14 23:21:24 +03:00
Szitár Gergő	5ce2429af7	Add d postfix to Debug libraries (#923 ) * Add DEBUG_POSTFIX to libraries. Makes it possible to install Debug and Release versions on the same system. Without this, there were only linker errors when using the wrong configuration. * Update CONTRIBUTORS and AUTHORS according to guide	2020-01-05 12:32:40 +00:00
Tetsuo Kiso	0811f1d782	Fix typo in mutex.h (#917 )	2019-12-15 13:38:54 +03:00
Roman Lebedev	367119482f	CPU caches are binary units, not SI. (#911 ) As disscussed in https://github.com/google/benchmark/issues/899, it is all but certain that the multiplier should be 1024, not 1000. Fixes https://github.com/google/benchmark/issues/899	2019-12-02 09:29:16 +00:00
Dominic Hamon	49aa79b635	update header guard to match style	2019-11-25 13:05:13 +00:00
Roman Lebedev	51d991f1d7	ParseCommandLineFlags(): do not dereference argc if it is null Higher up we dereference argc only if it is not null. But here we do no such check.	2019-11-23 00:23:11 +03:00
Roman Lebedev	c22c266eaf	JSONReporter: RoundDouble(): use std::lround() to round double to int From clang-tidy bugprone-incorrect-roundings check: > casting (double + 0.5) to integer leads to incorrect rounding; consider using lround (#include <cmath>) instead	2019-11-23 00:19:02 +03:00
Roman Lebedev	cc7f50e126	BenchmarkRunner: use std::lround() to round double to int From clang-tidy bugprone-incorrect-roundings check: > casting (double + 0.5) to integer leads to incorrect rounding; consider using lround (#include <cmath>) instead	2019-11-23 00:18:28 +03:00
Roman Lebedev	173aff82ee	src/counter.h: add header guard	2019-11-22 15:06:43 +03:00
András Leitereg	cf446a18bf	Remove superfluous cache line scaling in JSON reporter. (#896 ) Cache size is already stored in bytes.	2019-10-24 22:13:03 +03:00
Martin Blanchard	bc200ed8ee	Read options from environment (#881 ) (#883 ) Initialize option flags from environment variables values if they are defined, eg. `BENCHMARK_OUT=<filename>` for `--benchmark_out=<filename>`. Command line flag value always prevails. Fixes https://github.com/google/benchmark/issues/881.	2019-10-23 11:07:08 +03:00
Geoffrey Martin-Noble	b874e72208	Guard definition of __STDC_FORMAT_MACROS in ifndef (#875 ) This macro is sometimes already defined and redefining it results in build errors.	2019-09-23 10:53:09 +01:00
Geoffrey Martin-Noble	7411874d95	Define HOST_NAME_MAX for NaCl and RTEMS (#876 ) These OS's don't always have HOST_NAME_MAX defined, resulting in build errors. A few related changes as well: * Only define HOST_NAME_MAX if it's not already defined. There are some cases where this is already defined, e.g. with NaCl if __USE_POSIX is set. To avoid all of these, only define it if it's not already defined. * Default HOST_NAME_MAX to 64 and issue a #warning. Having the wrong max length is pretty harmless. The name just ends up getting truncated and this is only for printing debug info. Because we're constructing a std::string from a char[] (so defined length), we don't need to worry about gethostname's undefined behavior for whether the truncation is null-terminated when the hostname doesn't fit in HOST_NAME_MAX. Of course, this doesn't help people who have -Werror set, since they'll still get a warning.	2019-09-23 10:38:34 +01:00
Sayan Bhattacharjee	7ee72863fd	Remove unused `doc` argument from `DEFINE_` macros. (#857 ) - Adresses : #856 - The unused `doc` argument was removed from the `DEFINE_` macros in `commandlineflags.h` - Converted all the previous `doc` strings passed to the `DEFINE_` macros to multiline comments.	2019-08-21 14:12:03 -07:00
Roman Lebedev	7d97a057e1	Custom user counters: add invert modifier. (#850 ) While current counters can e.g. answer the question "how many items is processed per second", it is impossible to get it to tell "how many seconds it takes to process a single item". The solution is to add a yet another modifier `kInvert`, that is always considered last, which simply inverts the answer. Fixes #781, #830, #848.	2019-08-12 17:47:46 +03:00
Eric Fiselier	c408461983	Disable deprecated warnings when touching CSVReporter internally. The CSVReporter is deprecated, but we still need to reference it in a few places. To avoid breaking the build when warnings are errors, we need to disable the warning when we do so.	2019-08-07 15:55:40 -04:00
Roman Lebedev	8e48105d46	CMake; windows: link to lowercase 'shlwapi' - consistent with headers (#840 ) The filenames are consistently inconsistent in windows world, might have something to do with default file system being case-insensitive. While the native MinGW buils were fixed in `5261307982` that only addressed the headers, but not libraries. The problem remains when one tries to do a MinGW cross-build from case-sensitive filesystem.	2019-07-22 13:42:12 +01:00
Sam Elliott	4abdfbb802	Add RISC-V support in cycleclock::Now (#833 ) The RISC-V implementation of `cycleclock::Now` uses the user-space `rdcycle` instruction to query how many cycles have happened since the core started. The only complexity here is on 32-bit RISC-V, where `rdcycle` can only read the lower 32 bits of the 64-bit hardware counter. In this case, `rdcycleh` reads the higher 32 bits of the counter. We match the powerpc implementation to detect and correct for overflow in the high bits.	2019-07-05 09:28:17 +01:00
Orgad Shaneh	04a9343fc9	Make some functions const (#832 ) and ThreadManager ctor explicit. Reported by CppCheck.	2019-06-26 09:06:24 +01:00
Roman Lebedev	090faecb45	Use IterationCount in one more place Found in -UNDEBUG build	2019-05-13 22:42:18 +03:00
Roman Lebedev	f92903cc53	Iteration counts should be `uint64_t` globally. (#817 ) This is a shameless rip-off of https://github.com/google/benchmark/pull/646 I did promise to look into why that proposed PR was producing so much worse assembly, and so i finally did. The reason is - that diff changes `size_t` (unsigned) to `int64_t` (signed). There is this nice little `assert`: `7a1c370283/include/benchmark/benchmark.h (L744)` It ensures that we didn't magically decide to advance our iterator when we should have finished benchmarking. When `cached_` was unsigned, the `assert` was `cached_ UGT 0`. But we only ever get to that `assert` if `cached_ NE 0`, and naturally if `cached_` is not `0`, then it is bigger than `0`, so the `assert` is tautological, and gets folded away. But now that `cached_` became signed, the assert became `cached_ SGT 0`. And we still only know that `cached_ NE 0`, so the assert can't be optimized out, or at least it doesn't currently. Regardless of whether or not that is a bug in itself, that particular diff would have regressed the normal 64-bit systems, by halving the maximal iteration space (since we go from unsigned counter to signed one, of the same bit-width), which seems like a bug. And just so it happens, fixing this bug, fixes the other bug. This produces fully (bit-by-bit) identical state_assembly_test.s The filecheck change is actually needed regardless of this patch, else this test does not pass for me even without this diff.	2019-05-13 12:33:11 +03:00
Michał Janiszewski	b988639f31	Fix compilation for Android (#816 ) Android doesn't support `getloadavg`	2019-05-09 15:22:13 -07:00
Roman Lebedev	33d4404650	Don't read CMAKE_BUILD_TYPE if it is not there (#811 ) Weird, but seems consistent with the rest of cmake here.	2019-05-07 16:06:50 -07:00
Lockywolf	823d24630d	Add support for GNU Install Dirs from GNU Coding Standards. Fixes #807 (#808 ) * Add support for GNU Install Dirs from GNU Coding Standards * src/CMakeLists.txt: Added support for setting the standard variables, such as CMAKE_INSTALL_BINDIR. * Replace install destinations by the ones from GNU Coding Standards. * Set the default .cmake and .pc default path.	2019-05-01 09:13:33 +01:00
Dominic Hamon	13b8bdc2b5	Bump required cmake version from 2.x to 3.x (#801 )	2019-05-01 09:06:12 +01:00
Michael Tesch	588be0446a	escape special chars in csv and json output. (#802 ) * escape special chars in csv and json output. - escape \b,\f,\n,\r,\t,\," from strings before dumping them to json or csv. - also faithfully reproduce the sign of nan in json. this fixes github issue #745. * functionalize. * split string escape functions between csv and json * Update src/csv_reporter.cc Co-Authored-By: tesch1 <tesch1@gmail.com> * Update src/json_reporter.cc Co-Authored-By: tesch1 <tesch1@gmail.com>	2019-04-19 18:47:25 +01:00
Dominic Hamon	1d41de8463	Add command line flags tests (#793 ) Increase coverage	2019-04-17 17:08:52 +01:00
Hannes Hauswedell	415835e03e	fix master branch on BSD (#792 ) fix master branch on BSD add name to CONTRIBUTORS	2019-04-11 16:36:11 +01:00
Bryan Lunt	7a1c370283	Add process_time for better OpenMP and user-managed thread timing * Google Benchmark now works with OpenMP and other user-managed threading.	2019-04-09 13:01:33 +01:00
Daniel Harvey	e3666568a9	Negative ranges #762 (#787 ) * Add FIXME in multiple_ranges_test.cc * Improve handling of large bounds in AddRange. Due to breaking the loop too early, AddRange would miss a final multplier of 'mult' that was within the numeric range of T. * Enable negative values for Range argument Fixes #762. * Try to fix build of benchmark_gtest * Try some more to fix build * Attempt to fix format macros * Attempt to resolve format errors for mingw32 * Review feedback Put unit tests in benchmark::internal namespace Fix error reporting in multiple_ranges_test.cc	2019-03-26 10:50:53 +00:00
BaaMeow	478eafa36b	[JSON] add threads and repetitions to the json output (#748 ) * [JSON] add threads and repetitions to the json output, for better ide… [Tests] explicitly check for thread == 1 [Tests] specifically mark all repetition checks [JSON] add repetition_index reporting, but only for non-aggregates (i… * [Formatting] Be very, very explicit about pointer alignment so clang-format can not put pointers/references on the wrong side of arguments. [Benchmark::Run] Make sure to use explanatory sentinel variable rather than a magic number. * Do not pass redundant information	2019-03-26 09:53:07 +00:00
Michael Tesch	fae8726690	Replace JSON inf and nan with JS compliant Infinity and NaN	2019-03-19 10:12:54 +00:00
Daniel Harvey	f6e96861a3	BENCHMARK_CAPTURE() and Complexity() - naming problem (#761 ) Created BenchmarkName class which holds the full benchmark name and allows specifying and retrieving different components of the name (e.g. ARGS, THREADS etc.) Fixes #730.	2019-03-17 16:38:51 +03:00
Jilin Zhou	d205ead299	[#774 ] implement GetNumCPUs(), GetCPUCyclesPerSecond(), and GetCacheSizes() (#775 ) - On qnx platform, cpu and cache info is stored in a syspage struct which is different from other OS platform. - The fix has been verified on an aarch64 target running qnx 7.0. Fixes #774	2019-02-28 10:42:44 +00:00
Jilin Zhou	0ae233ab23	[#766 ] add x-compile support for QNX SDP7 (#770 ) Since googletest already supports x-compilation for QNX, it is nice to have google benchmark support it too. Fixes #766	2019-02-19 13:05:55 +00:00
Andriy Berestovskyy	4b9f43e2c4	Fix header lines length (#752 ) Commit `17a012d7` added a newline to the str, so the line built from str.length() is one character longer than it should be.	2019-01-13 17:26:49 +03:00
Eric	4528c76b71	Print at least three significant digits for times. (#701 ) Some benchmarks are particularly sensitive and they run in less than a nanosecond. In order for the console reporter to provide meaningful output for such benchmarks it needs to be able to display the times using more resolution than a single nanosecond. This patch changes the console reporter to print at least three significant digits for all results. Unlike the initial attempt, this patch does not align the decimal point.	2018-12-13 22:49:21 -05:00
Dominic Hamon	0ed529a7e3	Update documentation of benchmark_filter (#744 ) It should now match reality.	2018-12-13 11:14:50 +00:00
Jatin Chaudhary	47a5f77d75	#722 Adding Host Name in Reporting (#733 ) * Adding Host Name and test * Addressing Review Comments * Adding Test for JSON Reporter * Adding HOST_NAME_MAX for MacOS systems * Adding Explaination for MacOS HOST_NAME_MAX Addition * Addressing Peer Review Comments * Adding codecvt in windows header guard * Changing name SystemInfo and adding empty message incase host name fetch fails * Adding Comment on Struct SystemInfo	2018-12-11 11:23:02 +00:00
Tobias Ulvgård	1f3cba06e4	Update reference to complexity calculation (#723 )	2018-12-10 15:15:34 +00:00
Roman Lebedev	c9f2693ea9	StrFormat() is a printf-like function, mark it as such, fix fallout. (#727 ) Fixes #714.	2018-11-26 19:55:05 -05:00
Dominic Hamon	b5082bbd65	Merge branch 'report_loadavg' of https://github.com/atdt/benchmark into atdt-report_loadavg	2018-11-13 10:13:58 +00:00
Anton Gladky	c6193afe7e	Fix parsing of cpuinfo for s390 platform. (#712 ) s390 has another line structure for processor-field. It should be differently parsed.	2018-10-21 11:01:42 +03:00
Roman Lebedev	507c06e636	Aggregates: use non-aggregate count as iteration count. (#706 ) It is incorrect to say that an aggregate is computed over run's iterations, because those iterations already got averaged. Similarly, if there are N repetitions with 1 iterations each, an aggregate will be computed over N measurements, not 1. Thus it is best to simply use the count of separate reports. Fixes #586.	2018-10-18 17:17:14 +03:00
Roman Lebedev	99d1356c04	[NFC] BenchmarkRunner: always populate _report_aggregates_only bools. (#708 ) It is better to let the RunBenchmarks(), report() decide whether to actually only* output aggregates or not, depending on whether there are actually aggregates. It's subtle indeed. Previously, `BenchmarkRunner()` always said that "if there are no repetitions, then you should never output only the repetitions". And the `report()` simply assumed that the `report_aggregates_only` bool it received makes sense, and simply used it. Now, the logic is the same, but the blame has shifted. `BenchmarkRunner()` always propagates what those benchmarks would have wanted to happen wrt the aggregates. And the `report()` lambda has to actually consider both the `report_aggregates_only` bool, and it's meaningfulness. To put it in the context of the patch series - if the repetition count was `1`, but `_report_aggregates_only` was set to `true`, and we capture each iteration separately, then we will compute the aggregates, but then output everything, both the iteration, and aggregates, despite `_report_aggregates_only` being set to `true`.	2018-10-18 15:08:59 +03:00
Roman Lebedev	9cacec8e78	[NFC] RunBenchmarks(): s/has_repetitions/might_have_aggregates/ (#707 ) That is the real purpose of that bool. A follow-up change will make it consider something else other than repetitions.	2018-10-18 15:03:17 +03:00
Ilya A. Kriveshko	8503dfe537	benchmark_color: fix auto option (#559 ) (#699 ) As prevously written, "--benchmark_color=auto" was treated as true, because IsTruthyFlagValue("auto") returned true. The fix is to rely on IsColorTerminal test only if the flag value is "auto", and fall back to IsTruthyFlagValue otherwise. I also integrated force_no_color check into the same block.	2018-10-08 09:33:21 +01:00
Roman Lebedev	a8082de5df	[NFC] Refactor RunBenchmark() (#690 ) Ok, so, i'm still trying to get to the state when it will be a trivial change to report all the separate iterations. The old code (LHS of the diff) was rather convoluted i'd say. I have tried to refactor it a bit into small logical chunks, with proper comments. As far as i can tell, i preserved the intent of the code, what it was doing before. The road forward still isn't clear, but i'm quite sure it's not with the old code :)	2018-10-01 17:51:08 +03:00
Dominic Hamon	edc77a3669	Make State constructor private. (#650 ) The State constructor should not be part of the public API. Adding a utility method to BenchmarkInstance allows us to avoid leaking the RunInThread method into the public API.	2018-09-28 12:28:43 +01:00
Martin Storsjö	439d6b1c2a	Include sys/time.h for cycleclock.h when building on MinGW (#680 ) When building for ARM, there is a fallback codepath that uses gettimeofday, which requires sys/time.h. The Windows SDK doesn't have this header, but MinGW does have it. Thus, this fixes building for Windows on ARM with MinGW headers/libraries, while Windows on ARM with the Windows SDK still is broken.	2018-09-19 11:52:05 +01:00
Martin Storsjö	5261307982	[benchmark] Lowercase windows specific includes (#679 ) The windows SDK headers don't have self-consistent casing anyway, and many projects consistently use lowercase for them, in order to fix crosscompilation with mingw headers.	2018-09-18 09:42:20 +01:00
Roman Lebedev	1b44120cd1	Un-deprecate [SG]et{Item,Byte}sProcessed, re-implement as custom counters. (#676 ) As discussed with @dominichamon and @dbabokin, sugar is nice. Well, maybe not for the health, but it's sweet. Alright, enough puns. A special care needs to be applied not to break csv reporter. UGH. We end up shedding some code over this. We no longer specially pretty-print them, they are printed just like the rest of custom counters. Fixes #627.	2018-09-13 22:03:47 +03:00
Roman Lebedev	58588476ce	Track two more details about runs - the aggregate name, and run name. (#675 ) This is related to @BaaMeow's work in https://github.com/google/benchmark/pull/616 but is not based on it. Two new fields are tracked, and dumped into JSON: * If the run is an aggregate, the aggregate's name is stored. It can be RMS, BigO, mean, median, stddev, or any custom stat name. * The aggregate-name-less run name is additionally stored. I.e. not some name of the benchmark function, but the actual name, but without the 'aggregate name' suffix. This way one can group/filter all the runs, and filter by the particular aggregate type. I might need this for further tooling improvement. Or maybe not. But this is certainly worthwhile for custom tooling.	2018-09-13 15:08:15 +03:00
Roman Lebedev	c614dfc0d4	Display aggregates only. (#665 ) There is a flag `d9cab612e4/src/benchmark.cc (L75-L78)` and a call `d9cab612e4/include/benchmark/benchmark.h (L837-L840)` But that affects everything, every reporter, destination: `d9cab612e4/src/benchmark.cc (L316)` It would be quite useful to have an ability to be more picky. More specifically, i would like to be able to only see the aggregates in the on-screen output, but for the file output to still contain everything. The former is useful in case of a lot of repetition (or even more so if every iteration is reported separately), while the former is great for tooling. Fixes https://github.com/google/benchmark/issues/664	2018-09-12 16:26:17 +03:00
Roman Lebedev	f0901417c8	GetCacheSizesMacOSX(): use consistent types. (#667 ) I have absolutely no way to test this, but this looks obviously-good. This was reported by Tim Northover @TNorthover in http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20180903/584223.html > I think this breaks some 32-bit configurations (well, mine at least). > I was using Clang (from Xcode 10 beta) on macOS and got a bunch of > errors referencing sysinfo.cc:292 and onwards: > /Users/tim/llvm/llvm-project/llvm/utils/benchmark/src/sysinfo.cc:292:47: > error: non-constant-expression cannot be narrowed from type > 'std::__1::array<unsigned long long, 4>::value_type' (aka 'unsigned > long long') to 'size_t' (aka 'unsigned long') in initializer list > [-Wc++11-narrowing] > } Cases[] = {{"hw.l1dcachesize", "Data", 1, CacheCounts[1]}, > ^~~~~~~~~~~~~~ > > The same happens when self-hosting ToT. Unfortunately I couldn't > reproduce the issue on Debian (Clang 6.0.1) even with libc++; I'm not > sure what the difference is.	2018-09-05 12:20:18 +01:00
pseyfert	fbfc495d7f	add missing closing bracket in --help message (#666 )	2018-09-03 19:45:09 +03:00
Roman Lebedev	caa2fcb19c	Counter(): add 'one thousand' param. (#657 ) * Counter(): add 'one thousand' param. Needed for https://github.com/google/benchmark/pull/654 Custom user counters are quite custom. It is not guaranteed that the user always expects for these to have 1k == 1000. If the counter represents bytes/memory/etc, 1k should be 1024. Some bikeshedding points: 1. Is this sufficient, or do we really want to go full on into custom types with names? I think just the '1000' is sufficient for now. 2. Should there be a helper benchmark::Counter::Counter{1000,1024}() static 'constructor' functions, since these two, by far, will be the most used? 3. In the future, we should be somehow encoding this info into JSON. * Counter(): use std::pair<> to represent 'one thousand' * Counter(): just use a new enum with two values 1000 vs 1024. Simpler is better. If someone comes up with a real reason to need something more advanced, it can be added later on. * Counter: just store the 1000 or 1024 in the One_K values directly * Counter: s/One_K/OneK/	2018-08-29 21:11:06 +03:00
Roman Lebedev	d9cab612e4	[NFC] s/console_reporter/display_reporter/ (#663 ) There are two destinations: * display (console, terminal) and * file. And each of the destinations can be poplulated with one of the reporters: * console - human-friendly table-like display * json * csv (deprecated) So using the name console_reporter is confusing. Is it talking about the console reporter in the sense of table-like reporter, or in the sense of display destination?	2018-08-29 14:58:54 +03:00
Roman Lebedev	8688c5c4cf	Track 'type' of the run - is it an actual measurement, or an aggregate. (#658 ) This is only exposed in the JSON. Not in CSV, which is deprecated. This only supposed to track these two states. An additional field could later track which aggregate this is, specifically (statistic name, rms, bigo, ...) The motivation is that we already have ReportAggregatesOnly, but it affects the entire reports, both the display, and the reporters (json files), which isn't ideal. It would be very useful to have a 'display aggregates only' option, both in the library's console reporter, and the python tooling, This will be especially needed for the 'store separate iterations'.	2018-08-28 18:11:36 +03:00
Roman Lebedev	9a179cb93f	[NFC] Prefix "report(_)?mode" with Aggregation. (#656 ) This only specifically represents handling of reporting of aggregates. Not of anything else. Making it more specific makes the name less generic. This is an issue because i want to add "iteration report mode", so the naming would be conflicting.	2018-08-28 17:19:25 +03:00
BaaMeow	af441fc114	properly escape json names (#652 )	2018-08-16 09:47:09 -07:00
Kirill Bobyrev	f85304e4e3	Remove redundant default which causes failures (#649 ) * Remove redundant default which causes failures * Fix old GCC warnings caused by poor analysis * Use __builtin_unreachable * Use BENCHMARK_UNREACHABLE() * Pull __has_builtin to benchmark.h too * Also move compiler identification macro to main header * Move custom compiler identification macro back	2018-08-08 14:39:57 +01:00
Dominic Hamon	f965eab508	Memory management and reporting hooks (#625 ) * Introduce memory manager interface * Add memory stats to JSON reporter and a test * Add comments and switch json output test to int	2018-07-24 15:57:15 +01:00
Ori Livneh	da9ec3dfca	Include system load average in console and JSON reports High system load can skew benchmark results. By including system load averages in the library's output, we help users identify a potential issue in the quality of their measurements, and thus assist them in producing better (more reproducible) results. I got the idea for this from Brendan Gregg's checklist for benchmark accuracy (http://www.brendangregg.com/blog/2018-06-30/benchmarking-checklist.html).	2018-07-09 10:51:08 -04:00
Federico Ficarelli	0c21bc369a	Fix build with Intel compiler (#631 ) * Set -Wno-deprecated-declarations for Intel Intel compiler silently ignores -Wno-deprecated-declarations so warning no. 1786 must be explicitly suppressed. * Make std::int64_t → double casts explicit While std::int64_t → double is a perfectly conformant implicit conversion, Intel compiler warns about it. Make them explicit via static_cast<double>. * Make std::int64_t → int casts explicit Intel compiler warns about emplacing an std::int64_t into an int container. Just make the conversion explicit via static_cast<int>. * Cleanup Intel -Wno-deprecated-declarations workaround logic	2018-07-09 11:45:10 +01:00
Federico Ficarelli	5946795e82	Disable Intel invalid offsetof warning (#629 )	2018-07-03 10:13:22 +01:00
Roman Lebedev	b123abdcf4	Add Iteration-related Counter::Flags. Fixes #618 (#621 ) Inspired by these [two](`a1ebe07bea`) [bugs](`0891555be5`) in my code due to the lack of those i have found fixed in my code: * `kIsIterationInvariant` - `* state.iterations()` The value is constant for every iteration, and needs to be multiplied by the iteration count. * `kAvgIterations` - `/ state.iterations()` The is global over all the iterations, and needs to be divided by the iteration count. They play nice with `kIsRate`: * `kIsIterationInvariantRate` * `kAvgIterationsRate`. I'm not sure how meaningful they are when combined with `kAvgThreads`. I guess the `kIsThreadInvariant` can be added, too, for symmetry with `kAvgThreads`.	2018-06-27 15:45:30 +01:00
Marat Dukhan	505be96ab2	Avoid using CMake 3.6 feature list(FILTER ...) (#612 ) list(FILTER ...) is a CMake 3.6 feature, but benchmark targets CMake 2.8.12	2018-06-06 12:32:42 +01:00
Sergiu Deitsch	1301f53e31	cmake: use numeric version in package config (#611 )	2018-06-05 15:01:44 +01:00
Marat Dukhan	7fb3c564e5	Fix compilation on Android with GNU STL (#596 ) * Fix compilation on Android with GNU STL GNU STL in Android NDK lacks string conversion functions from C++11, including std::stoul, std::stoi, and std::stod. This patch reimplements these functions in benchmark:: namespace using C-style equivalents from C++03. * Avoid use of log2 which doesn't exist in Android GNU STL GNU STL in Android NDK lacks log2 function from C99/C++11. This patch replaces their use in the code with double log(double) function.	2018-06-05 11:36:26 +01:00
BaaMeow	4c2af07889	(clang-)format all the things (#610 ) * format all documents according to contributor guidelines and specifications use clang-format on/off to stop formatting when it makes excessively poor decisions * format all tests as well, and mark blocks which change too much	2018-06-01 11:14:19 +01:00
Dominic Hamon	4fbfa2f336	Some platforms and environments don't pass a valid argc/argv. (#607 ) Specifically some iOS targets.	2018-05-30 13:17:41 +01:00
Alex Strelnikov	e776aa0275	Add benchmark_main target. (#601 ) * Add benchmark_main library with support for Bazel. * fix newline at end of file * Add CMake support for benchmark_main. * Mention optionally using benchmark_main in README.	2018-05-25 11:18:58 +01:00
Nan Xiao	e90801ae47	Remove unnecessary memset functions. (#591 )	2018-05-09 10:31:24 +01:00
Sam Clegg	8986839e4a	Use __EMSCRIPTEN__ (rather then EMSCRIPTEN) to check for emscripten (#583 ) The old EMSCRIPTEN macro is deprecated and not enabled when EMCC_STRICT is set. Also fix a typo in EMSCRIPTN (not sure how this ever worked).	2018-05-03 09:34:26 +01:00
Nan Xiao	ea5551e7b3	Porting into OpenBSD (#582 )	2018-05-02 11:26:43 +01:00
Tim Bradgate	ed1bac8434	Issue 571: Allow support for negative regex filtering (#576 ) * Allow support for negative regex filtering This patch allows one to apply a negation to the entire regex filter by appending it with a '-' character, much in the same style as GoogleTest uses. * Address issues in PR * Add unit tests for negative filtering	2018-04-26 10:56:06 +01:00
Victor Costan	64d4805dd7	Fix precision loss warning in MSVC. (#574 )	2018-04-23 11:58:02 +01:00
Dominic Hamon	c4858d8012	Report the actual iterations run. (#572 ) Before this change, we would report the number of requested iterations passed to the state. After, we will report the actual number run. As a side-effect, instead of multiplying the expected iterations by the number of threads to get the total number, we can report the actual number of iterations across all threads, which takes into account the situation where some threads might run more iterations than others.	2018-04-19 18:40:08 +01:00
Dominic Hamon	64e5a13fa0	Ensure 64-bit truncation doesn't happen for complexity_n (#569 ) * Ensure 64-bit truncation doesn't happen for complexity results * One more complexity_n 64-bit fix * Missed another vector of int * Piping through the int64_t	2018-04-12 15:40:24 +01:00
Fred Tingaud	50ffc781b1	Optimize by using nth_element instead of partial_sort to find the median. (#565 )	2018-04-09 13:40:58 +01:00
Dominic Hamon	9913418d32	Allow AddRange to work with int64_t. (#548 ) * Allow AddRange to work with int64_t. Fixes #516 Also, tweak how we manage per-test build needs, and create a standard _gtest suffix for googletest to differentiate from non-googletest tests. I also ran clang-format on the files that I changed (but not the benchmark include or main src as they have too many clang-format issues). * Add benchmark_gtest to cmake * Set(Items\|Bytes)Processed now take int64_t	2018-04-03 23:12:47 +01:00
Dominic Hamon	df60aeb266	Rely on compiler intrinsics to identify regex engine. (#555 ) Having the copts set on a per-target level can lead to ODR violations in some cases. Avoid this by ensuring the regex engine is picked through compiler intrinsics in the header directly.	2018-03-23 11:45:15 +00:00
Eric Fiselier	e668e2a1ba	Fix #552 - GCC and Clang warn on possibly invalid offsetof usage. This patch disables the -Winvalid-offsetof warning for GCC and Clang when using it to check the cache lines of the State object. Technically this usage of offsetof is undefined behavior until C++17. However, all major compilers support this application as an extension, as demonstrated by the passing static assert (If a compiler encounters UB during evaluation of a constant expression, that UB must be diagnosed). Unfortunately, Clang and GCC also produce a warning about it. This patch temporarily suppresses the warning using #pragma's in the source file (instead of globally suppressing the warning in the build systems). This way the warning is ignored for both CMake and Bazel builds without having to modify either build system.	2018-03-21 13:47:25 -06:00
Dominic Hamon	674d0498b8	Move thread classes out to clean up monolithic code (#554 )	2018-03-16 10:14:38 +00:00
Wink Saville	61497236dd	Make string_util naming more consistent (#547 ) * Rename StringXxx to StrXxx in string_util.h and its users This makes the naming consistent within string_util and moves is the Abseil convention. * Style guide is 2 spaces before end of line "//" comments * Rename StrPrintF/StringPrintF to StrFormat for absl compatibility.	2018-03-07 11:20:06 +00:00
Wink Saville	f48a28d12a	Do not let StrCat be renamed to lstrcatA (#546 ) On Windows the Shlwapi.h file has a macro: #define StrCat lstrcatA And benchmark/src/string_util.h defines StrCat and it is renamed to lstrcatA if we don't undef the macro in Shlwapi.h. This is an innocuous bug if string_util.h is included after Shlwapi.h, but it is a compile error if string_util.h is included before Shlwapi.h. This fixes issue #545.	2018-03-06 18:15:03 +00:00
Wink Saville	69a52cff4f	Spelling fixes (#543 ) Upstream spelling fix changes from Pony, ec47ba8f565726414552f4bbf97d7, by ka7@la-evento.com that effected google/benchmark.	2018-03-06 11:44:25 +00:00
alekseyshl	47df49e573	Add Solaris support (#539 ) * Add Solaris support Define BENCHMARK_OS_SOLARIS for Solaris. Platform specific implementations added: * number of CPUs detection * CPU cycles per second detection * Thread CPU usage * Process CPU usage * Remove the special case for per process CPU time for Solaris, it's the same as the default.	2018-03-02 03:53:58 -08:00
Robert Guo	ff2c255af5	Use STCK to get the CPU clock on s390x (#540 )	2018-03-02 03:22:03 -08:00
Eric	56f52ee228	Print the executable name as part of the context. (#534 ) * Print the executable name as part of the context. A common use case of the library is to run two different versions of a benchmark to compare them. In my experience this often means compiling a benchmark twice, renaming one of the executables, and then running the executables back-to-back. In this case the name of the executable is important contextually information. Unfortunately the benchmark does not report this information. This patch adds the executable name to the context reported by the benchmark. * attempt to fix tests on Windows * attempt to fix tests on Windows	2018-02-21 08:43:57 -08:00
Ian McKellar	6ecf8a8e80	Don't include <sys/resource.h> on Fuchsia. (#531 ) * Don't include <sys/resource.h> on Fuchsia. It doesn't support POSIX resource measurement and timing APIs. Change-Id: Ifab4bac4296575f042c699db1ce5a4f7c2d82893 * Add BENCHMARK_OS_FUCHSIA for Fuchsia Change-Id: Ic536f9625e413270285fbfd08471dcb6753ddad1	2018-02-14 14:17:12 -07:00
Eric	207b9c7aec	Improve State packing: put important members on first cache line. (#527 ) * Improve State packing: put important members on first cache line. This patch does a few different things to ensure commonly accessed data is on the first cache line of the `State` object. First, it moves the `error_occurred_` member to reside after the `started_` and `finished_` bools, since there was internal padding there that was unused. Second, it moves `batch_leftover_` and `max_iterations` further up in the struct declaration. These variables are used in the calculation of `iterations()` which users might call within the loop. Therefore it's more important they exist on the first cache line. Finally, this patch turns the bool members into bitfields. Although this shouldn't have much of an effect currently, because padding is still needed between the last bool and the first size_t, it should help in future changes that require more "bool like" members. * Remove bitfield change for now * Move bools (and their padding) to end of "first cache line" vars. I think it makes the most sense to move the padding required following the group of bools to the end of the variables we want on the first cache line. This also means that the `total_iterations_` variable, which is the most accessed, has the same address as the State object. * Fix static assertion after moving bools	2018-02-14 13:44:41 -07:00
Samuel Panzer	296ec5693e	Support State::KeepRunningBatch(). (#521 ) * Support State::KeepRunningBatch(). State::KeepRunning() can take large amounts of time relative to quick operations (on the order of 1ns, depending on hardware). For such sensitive operations, it is recommended to run batches of repeated operations. This commit simplifies handling of total_iterations_. Rather than predecrementing such that total_iterations_ == 1 signals that KeepRunning() should exit, total_iterations_ == 0 now signals the intention for the benchmark to exit. * Create better fast path in State::KeepRunningBatch() * Replace int parameter with size_t to fix signed mismatch warnings * Ensure benchmark State has been started even on error. * Simplify KeepRunningBatch()	2018-02-09 21:57:04 -07:00
Dominic Hamon	df415adb2a	Some small clang-tidy fixes (#520 )	2018-01-29 08:38:47 -08:00
oskidan	4fe0206b65	Fixes compilation error caused by integer precision loss due to implicit (#518 ) conversion in sysinfo.cc	2018-01-19 09:17:01 -08:00
Dominic Hamon	9f5694ceb6	Wrap COMPILER macros. (#514 ) Some command line or build systems may already set these (eg, bazel) so make sure that takes priority. Fixes #513	2018-01-11 17:22:45 -08:00
Victor Costan	95a1435b81	Fix compilation error with GCC on OSX (issue #490 ). (#491 )	2017-11-30 08:05:38 -08:00
Roman Lebedev	ec5684ed75	Console reporter: properly account for the lenght of custom counter names (#484 ) Old output example: ``` Benchmark Time CPU Iterations CPUTime,s Pixels/s ThreadingFactor ------------------------------------------------------------------------------------------------------------------------------ 20170525_0036TEST.RAF/threads:8/real_time 45 ms 45 ms 16 0.718738 79.6277M/s 0.999978 2.41419GB/s 22.2613 items/s FileSize,MB=111.050781; MPix=57.231360 ``` New output example: ``` Benchmark Time CPU Iterations CPUTime,s Pixels/s ThreadingFactor ------------------------------------------------------------------------------------------------------------------------------ 20170525_0036TEST.RAF/threads:8/real_time 45 ms 45 ms 16 0.713575 80.1713M/s 0.999571 2.43067GB/s 22.4133 items/s FileSize,MB=111.050781; MPix=57.231360 ```	2017-11-27 09:01:01 -08:00
Eric Fiselier	2ec7399cf1	Improve BENCHMARK_UNREACHABLE() implementation. This patch primarily changes the BENCHMARK_UNREACHABLE() implementation under MSVC to use __assume(false) instead of being a NORETURN function, which ironically caused unreachable code warnings. Second, since the NOTHROW function attempt generated the warnings we meant to avoid, it has been replaced with a dummy null statement.	2017-11-26 13:58:24 -07:00
Eric	11dc36822b	Improve CPU Cache info reporting -- Add Windows support. (#486 ) * Improve CPU Cache info reporting -- Add Windows support. This patch does a couple of thing regarding CPU Cache reporting. First, it adds an implementation on Windows. Second it fixes the JSONReporter to correctly (and actually) output the CPU configuration information. And finally, third, it detects and reports the number of physical CPU's that share the same cache.	2017-11-26 13:33:01 -07:00
Eric	27e0b439cf	Refactor System information collection -- Add CPU Cache Info (#483 ) * Refactor System information collection. This patch refactors the system information collection, and in particular information about the target CPU. The motivation is to make it easier to access CPU information, and easier to add new information as need be. This patch additionally adds information about the cache sizes of the CPU. * Address review comments: Clean up integer types. This commit cleans up the integer types used in ValueUnion to follow the Google style guide. Additionally it adds a BENCHMARK_UNREACHABLE macro to assist in documenting/catching unreachable code paths. * Rename ValueUnion accessors.	2017-11-22 08:33:52 -08:00
Kamil Rytarowski	aad6a5fa76	Add NetBSD support (#482 ) Define BENCHMARK_OS_NETBSD for NetBSD. Add detection of cpuinfo_cycles_per_second and cpuinfo_num_cpus. This code shared detection of these properties with FreeBSD.	2017-11-17 08:46:08 -08:00
Steinar H. Gunderson	0c3ec998c4	Add a pkg-config file, for the benefit of projects not using CMake. (#480 )	2017-11-15 11:51:22 -08:00
Eric	72a4581caf	Fix #382 - MinGW often reports negative CPU times. (#475 ) When stopping a timer, the current time is subtracted from the start time. However, when the times are identical, or sufficiently close together, the subtraction can result in a negative number. For some reason MinGW is the only platform where this problem manifests. I suspect it's due to MinGW specific behavior in either the CPU timing code, floating point model, or printf formatting. Either way, the fix for MinGW should be correct across all platforms.	2017-11-07 09:44:39 -08:00
Dominic Hamon	f65c6d9a2c	Remove deprecated headers (#473 )	2017-11-06 08:53:23 -08:00
Yangqing Jia	491360b833	Add option to install benchmark (#463 ) * Add option to install benchmark * Change to BENCHMARK_ENABLE_INSTALL per @dominichamon	2017-10-20 13:49:37 -07:00
Eric	a37fc0c48a	Improve KeepRunning loop performance to be similar to the range-based for. (#460 ) This patch improves the performance of the KeepRunning loop in two ways: (A) it removes the dependency on the max_iterations variable, preventing it from being loaded every iteration. (B) it loops to zero, instead of to an upper bound. This allows a single decrement instruction to be used instead of a arithmetic op followed by a comparison.	2017-10-17 08:40:44 -07:00
Raúl Marín	cacd321808	Avoid implicit float to double conversion (#457 ) Triggered by -Werror=double-promotion	2017-10-13 09:17:02 -07:00
Dominic Hamon	2409cb2eb1	Minor move of code to cleanup up namespace spaghetti a bit	2017-10-09 12:01:30 -07:00
Roman Lebedev	a271c36af9	Drop Stat1, refactor statistics to be user-providable, add median. (#428 ) * Drop Stat1, refactor statistics to be user-providable, add median. My main goal was to add median statistic. Since Stat1 calculated the stats incrementally, and did not store the values themselves, it is was not possible. Thus, i have replaced Stat1 with simple std::vector<double>, containing all the values. Then, i have refactored current mean/stdev to be a function that is provided with values vector, and returns the statistic. While there, it seemed to make sense to deduplicate the code by storing all the statistics functions in a map, and then simply iterate over it. And the interface to add new statistics is intentionally exposed, so they may be added easily. The notable change is that Iterations are no longer displayed as 0 for stdev. Is could be changed, but i'm not sure how to nicely fit that into the API. Similarly, this dance about sometimes (for some fields, for some statistics) dividing by run.iterations, and then multiplying the calculated stastic back is also dropped, and if you do the math, i fail to see why it was needed there in the first place. Since that was the only use of stat.h, it is removed. * complexity.h: attempt to fix MSVC build * Update README.md * Store statistics to compute in a vector, ensures ordering. * Add a bit more tests for repetitions. * Partially address review notes. * Fix gcc build: drop extra ';' clang, why didn't you warn me? * Address review comments. * double() -> 0.0 * early return	2017-08-23 16:44:29 -07:00
Dominic Hamon	d70417994a	Allow the definition of 1k to be flexible. (#438 ) When generating a human-readable number for user counters, we don't generally expect 1k to be 1024. This is the default due to the more general purpose string utility. Fixes #437	2017-08-21 16:05:24 -07:00
Roman Lebedev	b9be142d1e	Json reporter: don't cast floating-point to int; adjust tooling (#426 ) * Json reporter: passthrough fp, don't cast it to int; adjust tooling Json output format is generally meant for further processing using some automated tools. Thus, it makes sense not to intentionally limit the precision of the values contained in the report. As it can be seen, FormatKV() for doubles, used %.2f format, which was meant to preserve at least some of the precision. However, before that function is ever called, the doubles were already cast to the integer via RoundDouble()... This is also the case for console reporter, where it makes sense because the screen space is limited, and this reporter, however the CSV reporter does output some( decimal digits. Thus i can only conclude that the loss of the precision was not really considered, so i have decided to adjust the code of the json reporter to output the full fp precision. There can be several reasons why that is the right thing to do, the bigger the time_unit used, the greater the precision loss, so i'd say any sort of further processing (like e.g. tools/compare_bench.py does) is best done on the values with most precision. Also, that cast skewed the data away from zero, which i think may or may not result in false- positives/negatives in the output of tools/compare_bench.py * Json reporter: FormatKV(double): address review note * tools/gbench/report.py: skip benchmarks with different time units While it may be useful to teach it to operate on the measurements with different time units, which is now possible since floats are stored, and not the integers, but for now at least doing such a sanity-checking is better than providing misinformation.	2017-07-24 16:13:55 -07:00
Dominic Hamon	5b7683f49e	more clang tidy cleanups (#417 )	2017-07-15 00:21:20 +02:00
Dominic Hamon	e8fc2a2b8c	Google-style cleanups (#416 )	2017-07-13 18:33:43 +02:00
Tom Madams	ee3cfca651	Fix ThreadCPUUsage when running on RTEMS. (#414 ) Change ThreadCPUUsage to call ProcessCPUUsage if __rtems__ is defined. RTEMS real time OS doesn't support CLOCK_THREAD_CPUTIME_ID. See https://github.com/RTEMS/rtems/blob/master/cpukit/posix/src/clockgettime.c#L58-L59 Prior to this change, ThreadCPUUsage would fail when running on RTEMS with: ERROR: clock_gettime(CLOCK_THREAD_CPUTIME_ID, ...) failed	2017-07-06 15:59:13 -07:00
Eric	9d4b719dae	Make Benchmark a single header library (but not header-only) (#407 ) * Make Benchmark a single header library (but not header-only) This patch refactors benchmark into a single header, to allow for slightly easier usage. The initial reason for the header split was to keep C++ library components from being included by benchmark_api.h, making that part of the library STL agnostic. However this has since changed and there seems to be little reason to separate the reporters from the rest of the library. * Fix internal_macros.h * Remove more references to macros.h	2017-07-04 16:31:47 -06:00
Eric	b8a2206fb2	Add ClearRegisteredBenchmark() function. (#402 ) * Add ClearRegisteredBenchmark() function. Since benchmarks can be registered at runtime using the RegisterBenchmark(...) functions, it makes sense to have a ClearRegisteredBenchmarks() function too, that can be used at runtime to clear the currently registered benchmark and re-register an entirely new set. This allows users to run a set of registered benchmarks, get the output using a custom reporter, and then clear and re-register new benchmarks based on the previous results. This fixes issue #400, at least partially. * Remove unused change	2017-06-14 09:16:53 -07:00
Yixuan Qiu	f3b3dd99be	Use the sample version of standard deviation (#383 ) * remove unnecessary weights * use sample standard deviation * add contributor information * remove redundant code * initialize variable to eliminate compiler warning	2017-06-05 10:32:15 -07:00
David Kruger	15e9ebaf83	Associate the required include directory with the benchmark library (#393 ) Using target_include_directories CMake will implicitly add the the necessary include paths to targets which link against the benchmark library. This is useful when the benchmark repo is included as a subdirectory in another CMake build.	2017-05-23 08:40:31 -07:00
Joao Paulo Magalhaes	ec6f03579e	Trying again to fix error caused by -Wunused-function. This thing with the pragma ignore was getting out of hand: now MinGW (and probably GCC) was erroring too. So I chose to move the definition of IsZero() out of the anonymous namespace into benchmark.cc.	2017-05-03 00:05:15 +01:00
Joao Paulo Magalhaes	160770fd08	Fix dropped-style elses.	2017-05-02 23:30:36 +01:00
Joao Paulo Magalhaes	ea019f3cd8	Allow different counter sets in CSV reporting.	2017-05-02 22:10:08 +01:00

1 2 3 4 5 ...

574 commits