benchmark

Commit Graph

Author	SHA1	Message	Date
Andy Christiansen	4931aefb51	Fix broken PFM-enabled tests (#1623 ) * Add pfm CI actions for bazel * Fix problems in unit test. * Undo enabling the CI tests for pfm - github CI machines seemingly do not support performance counters. * Remove commented code - can be revisited in github history when needed, and there's a comment explaining the rationale behind the new test code. --------- Co-authored-by: Andy Christiansen <achristiansen@google.com> Co-authored-by: dominic <510002+dmah42@users.noreply.github.com>	2023-07-07 09:58:16 +01:00
Enrico Seiler	e730f91d8c	Fix passing non-const lvalue refs to DoNotOptimize (#1622 )	2023-07-05 18:05:08 +01:00
Bulat Gaifullin	df9a99d998	Fix pass rvalue to DoNotOptimize (#1608 ) * Fix pass rvalue to DoNotOptimize #1584 * Add test	2023-06-19 08:35:52 +01:00
Vy Nguyen	fec77322b4	Fix code triggering -Wsign-conversion (#1596 ) * Fix code triggering -Wsign-conversion * more test	2023-05-11 08:40:05 +01:00
Andy Christiansen	318dd44225	Disable debug-only test in release builds to avoid expected failures. (#1595 ) Co-authored-by: Andy Christiansen <achristiansen@google.com>	2023-05-10 10:18:43 +01:00
pkasting	d29044d5da	Fix compile warnings about ignoring a [[nodiscard]] type. (#1577 ) Co-authored-by: dominic <510002+dmah42@users.noreply.github.com>	2023-03-16 11:07:13 +00:00
dominic	efc89f0b52	link to benchmark directly for tests that aren't link_main_test (#1576 )	2023-03-14 13:35:32 +00:00
dominic	46d3c84518	Convert uses of `const char` to `std::string` (#1567 ) Convert uses of const char* to std::string * fix sanitizer builds * reformat user guide * include python bindings * clang-format	2023-03-14 13:10:27 +00:00
Henrique Bucher	9f7dc386be	Address warnings on NVIDIA nvc++ (#1573 ) * Address warnings on NVIDIA nvc++ Types of warnings were being generated: 1. Deprecated warnings - solved by defining the relevant BENCHMARK_* macros for nvc++ and adding pragma suppress on a couple of .cc files 2. Setup/TearDown const vs non-const partial override - solved by adding non-const version 3. Static but not referenced - added diagnostic suppress for that file * Modified manually to comply with CD/CI * Revert partial override * Suppress warnings from tests if compiler is NVHPC --------- Co-authored-by: dominic <510002+dmah42@users.noreply.github.com>	2023-03-13 12:34:12 +00:00
Mike Apodaca	adb0d3d0bf	[FR] state.SkipWithMessage #963 (#1564 ) * Add `SkipWithMessage` * Added `enum Skipped` * Fix: error at end of enumerator list * Fix lint errors --------- Co-authored-by: dominic <510002+dmah42@users.noreply.github.com>	2023-03-08 18:24:48 +00:00
Henrique Bucher	fbc6efa9b5	Refactoring of PerfCounters infrastructure (#1559 ) * Refactoring of PerfCounters infrastructure The main feature in this pull request is the removal of the static sharing of PerfCounters and instead creating them at the top `RunBenchmarks()` function where all benchmark runners are created. A single PerfCountersMeasurement object is created and then shared with all the new BenchmarkRunners objects, one per existing benchmark. Other features conflated here in this PR are: - Added BENCHMARK_DONT_OPTIMIZE macro in global scope - Removal of the `IsValid()` query, being replaced by checking the number of remaining counters after validity tests - Refactoring of all GTests to reflect the changes and new semantics - extra comments throughout the new code to clarify intent It was extremely hard to separate all those features in different PRs as requested since they are so interdependent on each other so I'm just pushing them altogether and asking for forgiveness. This PR comes replacing PRs 1555 and 1558 which have been closed. * Fixed whitespace issue with clang-format My clang-format insists in deleting this single white space on line 601 while Github's clang format breaks when it is added. I had to disable format-on-save to check-in this revert change. I'm using clang 14.0.6.	2023-03-07 10:27:52 +00:00
dominic	9885aefb96	get rid of warnings in tests (#1562 )	2023-03-06 14:47:54 +00:00
Henrique Bucher	2d5012275a	Filter performance counter names, not invalidate all (#1554 ) * Filter performance counter names, not invalidate all Currently, the performance counters are validated while they are being created and one failure returns NoCounters(), ie it effecitvely invalidates all the counters. I would like to propose a new behavior: filter instead. If an invalid name is added to the counter list, or if that particular counter is not supported on this platform, that counter is dropped from the list and an error messages is created, while all the other counters remain active. This will give testers a peace of mind that if one mistake is made or if something is changed or removed from libpfm, their entire test will not be invalidated. This feature gives more tolerance with respect to versioning. Another positive is that testers can now input a superset of all desired counters for all platforms they support and just let Benchmark drop all those that are not supported, although it will create quite a lot of noise down the line, in which case perhaps we should drop silently or make a consolidated, single error line but this was not implemented in this change set. * Removed unused helper type.	2023-03-02 14:56:13 +00:00
Henrique Bucher	27c1d8ace9	Implement unlimited number of performance counters (#1552 ) * Implement unlimited number of performance counters Linux performance counters will limit the number of hardware counters per reading group. For that reason the implementation of PerfCounters is limited to 3. However if only software counters are added, there is no reason to limit the counters. For hardware counters, we create multiple groups and store a vector or leaders in the PerfCounters object. When reading, there is an extra time waste by iterating through all the group leaders. However this should be the same performance as with today. Reading is done by groups and it had to be heavily adjusted with the logic being moved to PerfCounterValues. I created a test for x86-64 and took care of filtering out the events in case it runs in a platform that does not support those counters - the test will not fail. The current tests were already failing (ReOpenExistingCounters, CreateExistingMeasurements and MultiThreaded) on the main branch and they continue to fail after this implementation - I did not fix those not to conflate all here. * Moved the PerfCounterValues::Read() implementation from header to body. * Added missing implementation of PerfCounters::IsCounterSupported when HAVE_LIBPFM is not defined. * Changed comments to reflect the implementation * Removed arg name so it does not generate an error when HAVE_LIBPBM is not defined. * Made loop counter a const reference for clang-tidy * Added missig BENCHMARK_EXPORT to PerfCounterValues	2023-03-01 15:30:41 +00:00
Yury Fedorov	bd721f9859	Removing warnings appearing with C++20 / CLang 15 (#1542 ) * Removing warnings appearing with C++20 / CLang 15 ``` [ 70%] Building CXX object _deps/benchmark-build/test/CMakeFiles/benchmark_min_time_flag_time_test.dir/benchmark_min_time_flag_time_test.cc.o /home/xxx/cpp/_deps/benchmark-src/test/benchmark_min_time_flag_time_test.cc:31:55: warning: unused parameter 'has_explicit_iters' [-Wunused-parameter] virtual void ReportRunsConfig(double min_time, bool has_explicit_iters, ^ /home/xxx/cpp/_deps/benchmark-src/test/benchmark_min_time_flag_time_test.cc:32:48: warning: unused parameter 'iters' [-Wunused-parameter] IterationCount iters) BENCHMARK_OVERRIDE { ^ 2 warnings generated. ``` ``` [ 70%] Building CXX object _deps/benchmark-build/test/CMakeFiles/benchmark_min_time_flag_iters_test.dir/benchmark_min_time_flag_iters_test.cc.o /home/xxx/cpp/_deps/benchmark-src/test/benchmark_min_time_flag_iters_test.cc:22:36: warning: implicit conversion loses integer precision: 'const benchmark::IterationCount' (aka 'const long') to 'std::vector<int>::value_type' (aka 'int') [-Wshorten-64-to-32] iter_nums_.push_back(report[0].iterations); ~~~~~~~~~ ~~~~~~~~~~^~~~~~~~~~ 1 warning generated. ``` * Refactoring to get the proper type of collection * Refactoring to get the proper type of collection * clang format * bug fix in main	2023-02-13 11:18:07 +00:00
Vy Nguyen	6cf7725ea1	Allow specifying number of iterations via --benchmark_min_time. (#1525 ) * Allow specifying number of iterations via --benchmark_min_time. Make the flag accept two new suffixes: + <integer>x: number of iterations + <floag>s: minimum number of seconds. This matches the internal benchmark API. * forgot to change flag type to string * used tagged union instead of std::variant, which is not available pre C++14 * update decl in benchmark_runner.h too * fixed errors * refactor * backward compat * typo * use IterationCount type * fixed test * const_cast * ret type * remove extra _ * debug * fixed bug from reporting that caused the new configs not to be included in the final report * addressed review comments * restore unnecessary changes in test/BUILD * fix float comparisons warnings from Release builds * clang format * fix visibility warning * remove misc file * removed backup files * addressed review comments * fix shorten in warning * use suffix for existing min_time specs to silent warnings in tests * fix leaks * use default min-time value in flag decl for consistency * removed double kMinTimeDecl from benchmark.h * dont need to preserve errno * add death tests * Add BENCHMARK_EXPORT to hopefully fix missing def errors * only enable death tests in debug mode because bm_check is no-op in release mode * guard death tests with additional support-check macros * Add additional guard to prevent running in Release mode --------- Co-authored-by: dominic <510002+dmah42@users.noreply.github.com>	2023-02-07 11:45:18 +00:00
Dominic Hamon	f15f332fd1	get rid of some deprecation warnings from tests	2023-02-06 16:38:53 +00:00
dominic	ff8d44c928	fix #1446 by removing the address operator (#1538 ) * fix #1446 by removing the address operator * add test * format --------- Co-authored-by: Thomas <thomas.maierbacher@rohde-schwarz.com> Co-authored-by: Dominic Hamon <dominichamon@users.noreply.github.com>	2023-02-06 16:34:47 +00:00
SunBlack	cfbc94960f	Fix Clang-Tidy warnings readability-else-after-return (#1528 )	2023-01-16 12:28:48 +00:00
Vy Nguyen	a3235d7b69	Include the benchmark's family-name in State (#1511 ) * Include the benchmark's family-name in State For compat with internal library, where State::name() returns the benchmark's family name. * added missing files from prev commit * fix field-init order error * added test	2023-01-10 16:48:17 +00:00
SunBlack	fe65457e80	Fix typos found by codespell (#1519 )	2023-01-10 12:25:32 +00:00
SunBlack	37faf6f975	Fix Clang-Tidy warnings related to modernize-use-override (#1523 )	2023-01-09 17:52:18 +00:00
Yury Fedorov	62edc4fb00	Bug fix variable 'actual_iterations' set but not used (#1517 ) * Bug fix variable 'actual_iterations' set but not used Compiling the project in clang 15 without -Wno-unused-but-set-variable flag the following error is generated: benchmark-src/test/options_test.cc:70:10: error: variable 'actual_iterations' set but not used [-Werror,-Wunused-but-set-variable] size_t actual_iterations = 0; ^ * Adjust according formatting of `clang-format` Co-authored-by: dominic hamon <510002+dmah42@users.noreply.github.com>	2022-12-19 14:03:11 +00:00
Vy Nguyen	9714eb8d11	Removed deprecated function (#1506 ) * Removed deprecated function * updated tests too * restore comment Co-authored-by: dominic hamon <dominichamon@users.noreply.github.com>	2022-11-11 15:12:12 +00:00
Roman Lebedev	db4f581fbb	Partially revert "Do not depend on unversioned python binary (#1496 )" (#1501 ) As predicted, the cmake part of the change is contentious. https://github.com/google/benchmark/pull/1496#issuecomment-1276508266 This partially reverts commit `229bc5a937`.	2022-10-13 10:03:29 +01:00
Matthias Braun	229bc5a937	Do not depend on unversioned python binary (#1496 ) Some linux distributions no longer provide `python` binary and require usage of `python3` instead. This changes the scripts here and uses cmake `find_package(Python3` when running python. Co-authored-by: Dominic Hamon <dominichamon@users.noreply.github.com>	2022-10-10 14:46:41 +03:00
rorth	12e0d70a43	Fix Solaris compilation (#1499 ) (#1500 ) This patch fixes compilation on Solaris, addressing the problems reported in Issue #1499: * Provide `HOST_NAME_MAX` definition. * Match `sysconf(3C)` return type. * Avoid `-Wcast-qual` warnings with `libkstat(3KSTAT)` functions. * Avoid clash with `<floatingpoint.h>` `single` typedef.	2022-10-06 09:18:55 +01:00
AJ Heller	13196fff84	Clean up test documentation formatting (#1475 )	2022-08-27 20:41:33 +03:00
Dominic Hamon	974cd5a5c5	Ensure we don't need benchmark installed to pass c++ feature checks (#1456 ) * Ensure we don't need benchmark installed to pass c++ feature checks Requires removal of some dependencies on benchmark.h from internal low-level headers, which is a good thing. Also added better logging to the feature check cmake module.	2022-08-04 15:33:35 +01:00
Vy Nguyen	5eb16eebb3	Explicitly cast int literals to int8_t in tests to silence implicit-conversion warnings (#1455 ) * Explicitly cast int literals to int8_t in tests so silence implicit-conversion warnings Error came from: ``` : error: implicit conversion loses integer precision: 'const int' to 'const signed char' [-Werror,-Wimplicit-int-conversion] ``` * clang format * undo deleted line	2022-08-04 09:18:19 +01:00
Vy Nguyen	1cca1d091c	Fixed build issues on window (#1449 ) * Fixed build issues on window - Added missing dlimport/export attributes in function definitions. (They are needed in both decls and defs) - Removed dlimport/dlexprt attribute in private field. (global_context is not exported anywhere). * fixed incorrect include path * undo changes w.r.t HelperPrintf * removed forward decl of private variable - instead, introduce a getter and use it. * Removed forward decl from benchmark_gtest too Co-authored-by: Dominic Hamon <dominichamon@users.noreply.github.com>	2022-08-03 09:44:35 +01:00
Dominic Hamon	7b3ac07517	Stop generating the export header and just check it in (#1435 ) * Stop generating the export header and just check it in * format the new header * support windows * format the header again * avoid depending on internal macro * ensure we define the right thing for windows static builds * support older cmake * and for tests	2022-07-20 20:34:39 +01:00
Alexander Popov	dfdda57a12	Fix DoNotOptimize() GCC compile error with some types (#1340 ) (#1424 ) Non-const DoNotOptimize() can't compile when used with some types. Example of code which can't compile: char buffer3[3] = ""; benchmark::DoNotOptimize(buffer3); Error message: error: impossible constraint in 'asm' asm volatile("" : "+r"(value) : : "memory"); Introduced in `8545dfb` (Fix DoNotOptimize() GCC copy overhead (#1340) (#1410)) The cause is compiler can't work with the +r constraint for types that can't be placed perfectly in registers. For example, char array[3] can't be perfectly fit in register on x86_64 so it requires placed in memory but constraint doesn't allow that. Solution - Use +m,r constraint for the small objects so the compiler can decide to use register or/and memory - For the big objects +m constraint is used which allows avoiding extra copy bug(see #1340) - The same approach is used for the const version of DoNotOptimize() although the const version works fine with the "r" constraint only. Using mixed r,m constraint looks more general solution. See - Issue #1340 ([BUG] DoNotOptimize() adds overhead with extra copy of argument(gcc)) - Pull request #1410 (Fix DoNotOptimize() GCC copy overhead (#1340) #1410) - Commit `8545dfb` (Fix DoNotOptimize() GCC copy overhead (#1340) (#1410))	2022-07-04 10:27:05 +01:00
Dominic Hamon	b7afda2cd2	Revert "Add possibility to ask for libbenchmark version number (#1004 ) (#1403 )" (#1417 ) This reverts commit `efadf67a12`.	2022-06-20 17:52:03 +01:00
Alexander Popov	8545dfb3ea	Fix DoNotOptimize() GCC copy overhead (#1340 ) (#1410 ) * Fix DoNotOptimize() GCC copy overhead (#1340) The issue is that GCC DoNotOptimize() does a full copy of an argument if it's not a pointer and it slows down a benchmark. If an argument is big enough there is a memcpy() call for copying the argument. An argument object can be a big object so DoNotOptimize() could add sufficient overhead and affects benchmark results. The cause is in GCC behavior with asm volatile constraints. Looks like GCC trying to use r(register) constraint for all cases despite object size. See: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105519 The solution is the split DoNotOptimize() in two cases - value fits in register and value doesn't fit in register. And use case specific asm constraint. std::is_trivially_copyable trait is needed because "+r" constraint doesn't work with non trivial copyable objects. - Fix requires support C++11 feature std::is_trivially_copyable from GCC compiler. The feature has been supported since GCC 5 - Fallback for GCC version < 5 still exists but it uses "m" constraint which means a little bit more overhead in some cases - Add assembly tests for issued cases Fixes #1340 * Add supported compiler versions info for assembly tests - Assembly tests are inherently non-portable. So explicitly add GCC and Clang versions required for reliable tests passed - Write a warning message if the current compiler version isn't supported	2022-06-20 10:12:58 +01:00
Matthias Donaubauer	efadf67a12	Add possibility to ask for libbenchmark version number (#1004 ) (#1403 ) * Add possibility to ask for libbenchmark version number (#1004) Add a header which holds the current major, minor, and patch number of the library. The header is auto generated by CMake. * Do not generate unused functions (#1004) * Add support for version number in bazel (#1004) * Fix clang format #1004 * Fix more clang format problems (#1004) * Use git version feature of cmake to determine current lib version * Rename version_config header to version * Bake git version into bazel build * Use same input config header as in cmake for version.h * Adapt the releasing.md to include versioning in bazel	2022-06-20 09:45:50 +01:00
Dominic Hamon	2365c4a603	add multiple OSes to bazel workflow (#1412 ) * add multiple OSes to bazel workflow * correct indent * only set copts when they're supported by the OS * os check should work * pull out cxx03_test for per-platform stuff * attempt to fix windows test output	2022-06-13 17:45:20 +01:00
Dominic Hamon	920fa14898	fix some build warnings on type conversions	2022-06-08 10:32:20 +01:00
Matthdonau	7eb8c0fe45	Introduce warmup phase to BenchmarkRunner (#1130 ) (#1399 ) * Introduce warmup phase to BenchmarkRunner (#1130) In order to account for caching effects in user benchmarks introduce a new command line option "--benchmark_min_warmup_time" which allows to specify an amount of time for which the benchmark should be run before results are meaningful. * Adapt review suggestions regarding introduction of warmup phase (#1130) * Fix BM_CHECK call in MinWarmUpTime (#1130) * Fix comment on requirements of MinWarmUpTime (#1130) * Add basic description of warmup phase mechanism to user guide (#1130)	2022-05-23 13:50:17 +01:00
Matthdonau	37be1e8252	Add option to get the verbosity provided by commandline flag -v (#1330 ) (#1397 ) * Add option to get the verbosity provided by commandline flag -v (#1330) * replace assert with test failure asserts are stripped out in non debug builds, and we run tests in non-debug CI bots. * clang-format my own tweak Co-authored-by: Dominic Hamon <dominichamon@users.noreply.github.com>	2022-05-17 17:59:36 +01:00
Dominic Hamon	8d86026c67	Enable -Wconversion (#1390 ) Requires some casts here and there, but nothing unreasonable. Fixes #1268	2022-05-01 19:56:30 +01:00
Dominic Hamon	a162a38ca0	Filter out benchmarks that start with "DISABLED_" (#1387 ) * Filter out benchmarks that start with "DISABLED_" This could be slightly more elegant, in that the registration and the benchmark definition names have to change. Ideally, we'd still register without the DISABLED_ prefix and it would all "just work". Fixes #1365 * add some documentation	2022-05-01 10:41:34 +01:00
Vy Nguyen	eacce0b503	Add SetBenchmarkFilter() to set --benchmark_filter flag value in user code (#1362 ) * Add SetBenchmarkFilter() to set --benchmark_filter flag value in user code. Use case: Provide an API to set this flag indepedence of the flag's implementation (ie., absl flag vs benchmark's flag facility) * add test * added notes on Initialize()	2022-03-08 16:02:37 +00:00
Bátor Tallér	d08e7b6056	Allow setting the default time unit globally (#1337 ) * Add option to set the default time unit globally This commit introduces the `--benchmark_time_unit={ns\|us\|ms\|s}` command line argument. The argument only affects benchmarks where the time unit is not set explicitly. * Update AUTHORS and CONTRIBUTORS * Test `SetDefaultTimeUnit` * clang format * Use `GetDefaultTimeUnit()` for initializing `TimeUnit` variables * Review fixes * Export functions * Add comment	2022-03-04 11:07:01 +00:00
Sergiu Deitsch	9e47d070fe	annotate and export public symbols (#1321 )	2022-02-14 10:48:53 +00:00
Dominic Hamon	6e51dcbcc3	Expose default display reporter creation in public API (#1344 ) * Expose default display reporter creation in public API this is useful when a custom reporter wants to fall back on the default display reporter, but doesn't necessarily have access to the benchmark library flag configuration. * Make use of unique_ptr in the random interleaving test. * clang-format	2022-02-11 10:23:05 +00:00
staffantj	d2cbd4b26a	Avoid potential truncation issues for the integral type parameterized tests. (#1341 ) * The parameterized tests check both floating point and integral types. We might as well use types that avoid truncation warnings across the platforms * static_cast version of how to avoid truncation warnings in basic_test Co-authored-by: Staffan Tjernstrom <staffantj@users.noreply.github.com>	2022-02-08 16:40:43 +00:00
Liqiang TAO	d0fbf8ac23	Cache PerfCounters instance in PerfCountersMeasurement (#1308 ) This patch fixes #1306, by reducing the pinned instances of PerfCounters. The issue is caused by creating multiple pinned events in the same thread, doing so results in the Snapshot(PerfCounterValues* values) failing, and that's now discoverable. Creating multile pinned events is an unsupported behavior currently. The error would be detected at read() time, not perf_event_open() / iotcl() time. The unsupported benavior above is confirmed by Stephane Eranian @seranian, and he also pointed the dectection method. Finished this patch under the guidance of Mircea Trofin @mtrofin.	2022-01-25 10:14:20 +00:00
dominc8	ab867074da	clang-tidy: readability-redundant and performance (#1298 ) * clang-tidy: readability-redundant-* * clang-tidy: performance-*	2021-12-06 11:18:04 +00:00
dominc8	680d3fdbb5	Add clang-tidy check (#1290 ) * Add clang-tidy.yml and .clang-tidy * Add mention to authors/contributors * Temp fix 2 clang-tidy issues * Enable clang-tidy on pull requests * Exclude gtest source files from clang-tidy	2021-11-25 15:47:44 +00:00

1 2 3 4 5 ...

359 Commits