use prom server registry for load generator & adjust buckets #4581

longbowlu · 2022-09-12T20:02:58Z

such that NetworkAuthorityClient's metrics can be reported to prom server

and adjust buckets again following https://prometheus.io/docs/practices/histograms/#errors-of-quantile-estimation

andll

Thanks for updating buckets @longbowlu!

velvia · 2022-09-13T17:19:38Z

crates/sui-benchmark/src/drivers/bench_driver.rs

@@ -43,7 +43,9 @@ pub struct BenchMetrics {
    pub latency_s: HistogramVec,
 }

-const LATENCY_SEC_BUCKETS: &[f64] = &[0.01, 0.1, 1., 2., 3., 5., 10., 20., 30., 60., 180.];
+const LATENCY_SEC_BUCKETS: &[f64] = &[
+    0.01, 0.05, 0.1, 0.5, 1., 2.5, 5., 10., 20., 30., 40., 50., 60., 90.,


Are we sure we don't need a bucket beyond 90?

I think it should be sufficient to use the same buckets as for 1-10, ie: 10 25, 50, 100

Technically, if you want to minimize the amount of error, the buckets should be strictly geometric.
It doesn't really matter where the bucket boundaries are because Prom will linearly interpolate anyways.

The geometric series for 4 buckets between 10 and 100 would be:
10, 18, 32, 56, 100

The reason is that mathematically 10-20 is a much bigger gap (2x) vs between 50-60, thus there will be a much higher rate of relative error in the 10-20 bucket.

Also a global const would be good for LATENCY_SEC_BUCKETS. Maybe we should put some in Mysten-infra, maybe I'll do this.

We may be interested in different buckets for different metrics.

I thought little bit about it and yes, we don't need buckets beyond ~10s even I would say.

However, I also think that it is hard to come up with a decent set of buckets. I do have some PR at work that will try to calculate real pct rather than require to specify buckets initially. It is not going to be super trivial, but I think it should work. In the mean time I thought it would be ok to merge some improvement in bucket list in case we get another deployment to get some better numbers

Note that we already have automatic histograms for all spans. So we shouldn't need to explicitly add histograms where we already have spans - we should instead just add more spans. The span histograms use exponential buckets already.

This is preferred because we will need spans for tracing anyways.

I'll go through and clean this up when I have a chance.

longbowlu requested review from lxfind, mystenmark, sadhansood, velvia and bmwill September 12, 2022 20:02

bmwill approved these changes Sep 12, 2022

View reviewed changes

sadhansood approved these changes Sep 12, 2022

View reviewed changes

longbowlu force-pushed the use-registry-for-load-gen branch from 877d951 to fd43220 Compare September 13, 2022 00:11

longbowlu requested a review from andll September 13, 2022 00:11

longbowlu changed the title ~~use prom server registry for load generator~~ use prom server registry for load generator & adjust buckets Sep 13, 2022

andll approved these changes Sep 13, 2022

View reviewed changes

velvia reviewed Sep 13, 2022

View reviewed changes

longbowlu added 2 commits September 13, 2022 13:36

update/add buckets

60005d8

remove 40 and 50

85c24a4

longbowlu force-pushed the use-registry-for-load-gen branch from fd43220 to 85c24a4 Compare September 13, 2022 20:51

longbowlu enabled auto-merge (squash) September 13, 2022 20:59

longbowlu merged commit 09e052a into main Sep 13, 2022

longbowlu deleted the use-registry-for-load-gen branch September 13, 2022 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use prom server registry for load generator & adjust buckets #4581

use prom server registry for load generator & adjust buckets #4581

longbowlu commented Sep 12, 2022 •

edited

Loading

andll left a comment

velvia Sep 13, 2022

velvia Sep 13, 2022

lxfind Sep 13, 2022

andll Sep 13, 2022

velvia Sep 13, 2022

use prom server registry for load generator & adjust buckets #4581

use prom server registry for load generator & adjust buckets #4581

Conversation

longbowlu commented Sep 12, 2022 • edited Loading

andll left a comment

Choose a reason for hiding this comment

velvia Sep 13, 2022

Choose a reason for hiding this comment

velvia Sep 13, 2022

Choose a reason for hiding this comment

lxfind Sep 13, 2022

Choose a reason for hiding this comment

andll Sep 13, 2022

Choose a reason for hiding this comment

velvia Sep 13, 2022

Choose a reason for hiding this comment

longbowlu commented Sep 12, 2022 •

edited

Loading