Add metrics reporting #37

yusefnapora · 2019-07-25T14:49:51Z

So, last week I was really bothered about libp2p/go-libp2p-kad-dht#283, but after I got over my stomach bug and got my optimism back, I realized that this is a great opportunity to flex our new testlab and measure the impact of requiring peers to be connected.

To get us towards a meaningful test scenario, this adds some metrics about k-bucket utilization. It's a bit awkward because the Update method where pretty much all the metrics are recorded doesn't have a Context parameter, so I made it default to context.Background and added a new UpdateAndRecordMetrics option that takes a Context.

Also not sure if I like how the measures and views are exported - I came across the discussion at libp2p/go-libp2p-kad-dht#327 and agree that long-term we need to have a standard way to do this. For now I stuffed the views in a map keyed by measure name, but that was before I realized that Go doesn't have a Map.Values method, so now I kind of hate it and will probably just put them in a slice after all.

Anyway, hopefully this is close-ish to what we want - all feedback welcome.

Stebalien · 2019-07-25T21:13:24Z

We may want to make this PR against the stabilize branch.

bigs · 2019-07-26T03:33:59Z

@yusefnapora the slice definitely plays nicely with our metrics package. you can always iterate the values and build the slice yourself. it's equivalent to a Values method anyway.

bigs

generally looks pretty good. probably good to target the stabilize branch as steven mentioned. let's test this out w/ prometheus and grafana!

bigs · 2019-07-26T03:46:34Z

table.go

@@ -69,23 +73,44 @@ func (rt *RoutingTable) Update(p peer.ID) (evicted peer.ID, err error) {
 		bucketID = len(rt.Buckets) - 1
 	}

+	var full = 0


something as intensive as this should maybe be enabled with a flag. could just be a module scope var

lanzafame

Looks good, mainly just adjustments to reduce the perf overhead, which in general, equates to reducing the number of tags added and not recording any stats in loops.

lanzafame · 2019-07-29T07:25:14Z

metrics/metrics.go

+	recordWithBucketIndex(ctx, bucketIndex, KBucketPeersRemoved.M(1))
+}
+
+var DefaultViews = map[string]*view.View{


Initialisation of views should be separate from the construction of the default views. Also DefaultViews should be a slice so that when registered it looks like, err := view.Register(kbmetrics.DefaultsViews...).

lanzafame · 2019-07-29T08:03:17Z

metrics/metrics.go

+// aren't sufficient. However, they should be updated using the functions below to avoid
+// leaking OpenCensus cruft throughout the rest of the code.
+var (
+	KBucketsFull = stats.Int64(MeasureBucketsFull,


nitpick: If you are multi-lining a function call, put each parameter on a new line and not a combination. In this particular case, the third parameter is lost at the end of the line.

lanzafame · 2019-07-29T08:07:00Z

table.go

-func (rt *RoutingTable) Update(p peer.ID) (evicted peer.ID, err error) {
+// UpdateAndRecordMetrics adds or moves the given peer to the front of its respective bucket, while recording
+// metrics about bucket capacities and peer additions and removals.
+func (rt *RoutingTable) UpdateAndRecordMetrics(ctx context.Context, p peer.ID) (evicted peer.ID, err error) {


Recording metrics shouldn't be considered 'extra' functionality, it is just a requirement of running a production system, if the rename is because of the change to the parameters, it is a common pattern to append Ctx to the function name when the only change is to add the ctx parameter to the function signature, i.e. UpdateCtx.

Thanks, I didn't like the new name and am glad there's an idiom I can use 😄

table.go

lanzafame · 2019-07-29T09:10:28Z

metrics/metrics.go

+// indicating the index of the bucket to which the measurement applies.
+func recordWithBucketIndex(ctx context.Context, bucketIndex int, ms ...stats.Measurement) {
+	_ = stats.RecordWithTags(ctx,
+		[]tag.Mutator{tag.Upsert(keyBucketIndex, string(bucketIndex))},


tag.Upsert is a performance killer, unfortunately, so the way that this helper operates results in a lot of extra allocations. I suggest creating a helper that does the same as LocalContext but for bucket index and gets called after the bucketID is defined, just once.

lanzafame · 2019-07-29T09:18:23Z

table.go

@@ -69,23 +73,44 @@ func (rt *RoutingTable) Update(p peer.ID) (evicted peer.ID, err error) {
 		bucketID = len(rt.Buckets) - 1
 	}

+	var full = 0
+	var nonEmpty = 0
+	for i, buck := range rt.Buckets {


Ideally, we could get a snapshot of all the buckets straight away, but it is really expensive. I think a better way to approach this would be to build up the view of all buckets one bucket at a time. So whichever bucket is chosen for the Update operation is the one we record metrics for.

Good idea - the measures for the # of full and non-empty buckets are redundant with the utilization measure anyway. I'll rewrite this to just record utilization for the buckets we actually visit in the Update method and remove the loop.

- renames the Update and Remove methods that accept a Context to UpdateCtx and RemoveCtx. - removes the full and non-empty bucket measures (since they can be derived from utilization. - removes the iteration of all buckets on update and only records utilization for the bucket being modified. - exports default views individually and in a slice

defining the metrics ctx just before use looks nicer

lanzafame

Would love for the multiline nitpick to be addressed, but am happy to see this merged either way.

yusefnapora added 2 commits July 25, 2019 10:37

add metrics reporting

1d8019d

someday I will totally add a go fmt pre-commit hook, I swear

bdb78a4

Stebalien requested a review from bigs July 25, 2019 21:13

bigs suggested changes Jul 26, 2019

View reviewed changes

lanzafame suggested changes Jul 29, 2019

View reviewed changes

yusefnapora added 4 commits July 29, 2019 10:02

go fmt

b68529c

combine LocalContext and BucketContext

9e2000b

use BucketContext & record utilization on removal

487d482

yusefnapora force-pushed the feat/bucket-metrics branch from 189b505 to 487d482 Compare July 29, 2019 14:34

cosmetic line reordering

cf1e083

defining the metrics ctx just before use looks nicer

lanzafame approved these changes Jul 30, 2019

View reviewed changes

iand mentioned this pull request Jun 29, 2023

Instrument DHT with metrics probe-lab/zikade#24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add metrics reporting #37

Add metrics reporting #37

yusefnapora commented Jul 25, 2019 •

edited

Loading

Stebalien commented Jul 25, 2019

bigs commented Jul 26, 2019 •

edited

Loading

bigs left a comment

bigs Jul 26, 2019

lanzafame left a comment

lanzafame Jul 29, 2019

lanzafame Jul 29, 2019

lanzafame Jul 29, 2019

yusefnapora Jul 29, 2019

lanzafame Jul 29, 2019

lanzafame Jul 29, 2019

yusefnapora Jul 29, 2019

lanzafame left a comment

Add metrics reporting #37

Are you sure you want to change the base?

Add metrics reporting #37

Conversation

yusefnapora commented Jul 25, 2019 • edited Loading

Stebalien commented Jul 25, 2019

bigs commented Jul 26, 2019 • edited Loading

bigs left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lanzafame left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lanzafame left a comment

Choose a reason for hiding this comment

yusefnapora commented Jul 25, 2019 •

edited

Loading

bigs commented Jul 26, 2019 •

edited

Loading