Add stress testing framework, with basic metrics example to demonstrate. #3241

lalitb · 2025-01-10T19:30:20Z

Changes

This PR adds a basic stress testing framework to validate the scalability and reliability of the functionality under high-concurrency and long-running workloads. Unlike Google Benchmark, which focuses on micro-benchmarking and latency measurements for isolated operations, this framework tries to simulate sustained, multi-threaded workloads to test a given workload. The idea is to complement the existing benchmarks by adding stress-tests to addressing long-duration and high-concurrency use-cases.

This is already implemented for .Net and Rust, and most of the ideas are taken from there. I felt the need for this to test some optimizations I am doing for metrics, but feel to comment if this doesn't seem helpful.

Also added a basic stress-testing example for metrics to demonstrate. Below are the results from the metrics stress test as an example:

$ ./stress_metrics
Starting stress test with 16 threads...
Throughput: 5009490 it/s | Avg: 4885764 | Min: 4734280 | Max: 5132395
 
Test completed:
Total iterations: 203373637
Duration: 42 seconds
Average throughput: 4885764 iterations/sec
$

It’s still in the early stages and will need further enhancements but should be a good starting point. Future improvements could include adding memory and CPU usage information alongside the existing throughput, as well as refining the initial warm-up period to sustain consistent data collection.

Implementation Details:

Worker Threads:
- The worker threads (default to number of cores) are spawned to execute the workload.
- Each worker thread executes the workload function (func) in a loop until a global STOP flag is set. (ctrl-c)
- Each thread maintains its own iteration count to minimize contention.

Throughput Monitoring:
- A separate controller thread monitors throughput by periodically summing up iteration counts across threads.
- Throughput is calculated over a sliding window (SLIDING_WINDOW_SIZE) and displayed dynamically.

Final Summary:
- At the end of the test, the program calculates and prints the total iterations, duration, and average throughput.

For significant contributions please make sure you have completed the following items:

CHANGELOG.md updated for non-trivial changes
Unit tests have been added
Changes in public API reviewed

[pull] main from open-telemetry:main

netlify · 2025-01-10T19:30:37Z

✅ Deploy Preview for opentelemetry-cpp-api-docs canceled.

Name	Link
🔨 Latest commit	`eead3a0`
🔍 Latest deploy log	https://app.netlify.com/sites/opentelemetry-cpp-api-docs/deploys/6784eef5722f2a000895043d

codecov · 2025-01-13T05:17:28Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 87.73%. Comparing base (d693e95) to head (eead3a0).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #3241   +/-   ##
=======================================
  Coverage   87.73%   87.73%           
=======================================
  Files         198      198           
  Lines        6258     6258           
=======================================
  Hits         5490     5490           
  Misses        768      768

…o stress-test

lalitb and others added 5 commits December 18, 2024 11:04

Merge pull request #308 from open-telemetry/main

2564cc6

[pull] main from open-telemetry:main

Merge branch 'main' of github.com:lalitb/opentelemetry-cpp into main

9433197

initial commit

11bd32c

formar

22a178b

add docs

449f360

lalitb requested a review from a team as a code owner January 10, 2025 19:30

lalitb marked this pull request as draft January 10, 2025 19:30

lalitb added 2 commits January 11, 2025 01:00

Merge branch 'main' into stress-test

a385503

remove extra endif

f9b0814

lalitb and others added 9 commits January 12, 2025 21:34

maintainer mode build

5f9d0da

fix copyright

32d06ff

copyright

57d99c3

fix format

5a222fd

Add copyright and license information

03ffa54

fix msvc error

ab07553

Merge branch 'stress-test' of github.com:lalitb/opentelemetry-cpp int…

a2f17b1

…o stress-test

add newline

b56f996

Merge branch 'main' into stress-test

eead3a0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stress testing framework, with basic metrics example to demonstrate. #3241

Add stress testing framework, with basic metrics example to demonstrate. #3241

lalitb commented Jan 10, 2025

netlify bot commented Jan 10, 2025 •

edited

Loading

codecov bot commented Jan 13, 2025 •

edited

Loading

Add stress testing framework, with basic metrics example to demonstrate. #3241

Are you sure you want to change the base?

Add stress testing framework, with basic metrics example to demonstrate. #3241

Conversation

lalitb commented Jan 10, 2025

Changes

Implementation Details:

netlify bot commented Jan 10, 2025 • edited Loading

✅ Deploy Preview for opentelemetry-cpp-api-docs canceled.

codecov bot commented Jan 13, 2025 • edited Loading

Codecov Report

netlify bot commented Jan 10, 2025 •

edited

Loading

codecov bot commented Jan 13, 2025 •

edited

Loading