Implemented metrics for LoadBasedPartitionAssignmentStrategy #840

jzakaryan · 2021-06-25T00:58:49Z

Implemented the following metrics for LoadBasedPartitionAssignmentStrategy:

Minimum partitions across tasks (per datastream)
Maximum partitions across tasks (per datastream)
Throughput information fetch rate

Important: DO NOT REPORT SECURITY ISSUES DIRECTLY ON GITHUB.
For reporting security issues and contributing security fixes,
please, email security@linkedin.com instead, as described in
the contribution guidelines.

Please, take a minute to review the contribution guidelines at:
https://github.com/linkedin/Brooklin/blob/master/CONTRIBUTING.md

vmaheshw

My recommendation is to not pass the metrics from the assigner class. Instead emit the metric from the assigner class itself and expose additional methods like getMetricsInfos() that you can use. This way all the metric logic will remain in one place and the code will look cleaner.

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java

jzakaryan · 2021-06-29T20:48:29Z

...ain/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssignmentStrategy.java

+
+  @Override
+  protected void unregisterMetrics(String datastream) {
+    _assigner.unregisterMetricsForDatastream(datastream);


This method still has to be here, because there's a path for cleaning metrics in StickyPartitionAssignmentStrategy which calls this method.

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java

vmaheshw · 2021-06-30T18:34:07Z

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java

@@ -144,6 +169,14 @@
      newAssignments.put(instance, newTasks);
    });

+    // update metrics
+    PartitionAssignmentStats stats = new PartitionAssignmentStats(minPartitionsAcrossTasks.get(),


Do you need a separate PartitionAssignmentStats class? You can probably get rid of it.

I think it makes the code cleaner. If I remove it, I'll have to maintain two separate maps for min/max partitions across tasks per datastream. This is more extensible as we can add more stats and add more per-datastream metrics in the future (e.g. the maxThroughputAcrossTasks we discussed earlier).

The major problem I can see with this future extension is, if there is another method created and that method calculates some stats, then either you will have to define all the getter and setters and make the fields non-final. Suppose we want to create any alert based metric, then all the other fields also need to be set. To overcome this, you may have to optional to figure out which fields are set and it will be an overkill.

You can have separate methods for each metric for create/Update the metric and just call those method, just like it is done in other classes in this repo.

I'm not sure what you mean by an alert-based metric, but from the description it seems like it doesn't fit with what I see as partition assignment stats. Semantically they might be incompatible, and we don't have to put it in the class. I still see value in keeping PartitionAssignmentStats around, for the reasons I mentioned above.

I mean if you have to emit a new metric which is in the exception path (and you don't calculate the other stats), then reusing this stat will be an issue and will require refactoring, V/s addition of new metric not impacting any other code change.

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java

shrinandthakkar

Also, in the case when a datastream is deleted, how would this work? Since we are never deleting the record for a datastream from the _partitionAssignmentStatsMap.

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java

shrinandthakkar · 2021-07-02T18:24:26Z

Also, in the case when a datastream is deleted, how would this work? Since we are never deleting the record for a datastream from the _partitionAssignmentStatsMap.

talked with jhora offline and he said, "on datastream delete/stop, the metric gets unregistered i.e. the values for deleted/stopped metrics don’t get accessed and emitted"

vmaheshw

Approving with the note that there might be refactoring required for stats in the future based on the requirement.

…n#840)

Implemented metrics

d53286f

jzakaryan requested review from atoomula, vmaheshw and shrinandthakkar June 25, 2021 00:58

jzakaryan mentioned this pull request Jun 28, 2021

Make the Throughput based assignment and task estimation based on partition assignment configurable. #841

Merged

vmaheshw requested changes Jun 28, 2021

View reviewed changes

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java Outdated Show resolved Hide resolved

jzakaryan added 2 commits June 29, 2021 10:20

Moved some of the metrics to LoadBasedPartitionAssigner

162f197

Merged changes from upstream master

2583181

jzakaryan requested a review from vmaheshw June 29, 2021 20:45

jzakaryan commented Jun 29, 2021

View reviewed changes

...rver/src/main/java/com/linkedin/datastream/server/assignment/LoadBasedPartitionAssigner.java Outdated Show resolved Hide resolved

jzakaryan commented Jun 29, 2021

View reviewed changes

Minor fix

752ac9a

vmaheshw reviewed Jun 30, 2021

View reviewed changes

jzakaryan requested a review from vmaheshw July 1, 2021 01:16

shrinandthakkar reviewed Jul 1, 2021

View reviewed changes

Minor improvements

9d74133

jzakaryan requested a review from shrinandthakkar July 2, 2021 18:01

shrinandthakkar approved these changes Jul 2, 2021

View reviewed changes

vmaheshw approved these changes Jul 3, 2021

View reviewed changes

jzakaryan merged commit ddde7ae into linkedin:master Jul 12, 2021

vmaheshw pushed a commit to vmaheshw/brooklin that referenced this pull request Mar 1, 2022

Implemented metrics for LoadBasedPartitionAssignmentStrategy (linkedi…

1741d1f

…n#840)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented metrics for LoadBasedPartitionAssignmentStrategy #840

Implemented metrics for LoadBasedPartitionAssignmentStrategy #840

jzakaryan commented Jun 25, 2021

vmaheshw left a comment

jzakaryan Jun 29, 2021

vmaheshw Jun 30, 2021

jzakaryan Jul 1, 2021

vmaheshw Jul 2, 2021

jzakaryan Jul 2, 2021

vmaheshw Jul 3, 2021

shrinandthakkar left a comment

shrinandthakkar commented Jul 2, 2021

vmaheshw left a comment

Implemented metrics for LoadBasedPartitionAssignmentStrategy #840

Implemented metrics for LoadBasedPartitionAssignmentStrategy #840

Conversation

jzakaryan commented Jun 25, 2021

vmaheshw left a comment

Choose a reason for hiding this comment

jzakaryan Jun 29, 2021

Choose a reason for hiding this comment

vmaheshw Jun 30, 2021

Choose a reason for hiding this comment

jzakaryan Jul 1, 2021

Choose a reason for hiding this comment

vmaheshw Jul 2, 2021

Choose a reason for hiding this comment

jzakaryan Jul 2, 2021

Choose a reason for hiding this comment

vmaheshw Jul 3, 2021

Choose a reason for hiding this comment

shrinandthakkar left a comment

Choose a reason for hiding this comment

shrinandthakkar commented Jul 2, 2021

vmaheshw left a comment

Choose a reason for hiding this comment