Add in-memory storage support for adaptive sampling #3335

srikanthccv · 2021-10-21T05:25:40Z

Signed-off-by: Srikanth Chekuri srikanth.chekuri92@gmail.com

Which problem is this PR solving?

Part of Storage backends for adaptive sampling #3305

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

yurishkuro

In this form the implementation has a memory leak where data will grow unbounded. Note these two things:

The storage only needs to keep N buckets of throughput because the processor uses them with exponential decay (there is a parameter in the processor that specifies how many). So we could bound the storage by that number.
Even though the final probabilities are appended to the storage, the processor only uses the latest record. The only reason we used to store the previous ones is to have a history of calculations for debugging purposes (it's not even exposed via API, only by looking at the database directly). So the simplest implementation would be to keep exactly one record of probabilities in memory, 2nd simplest to keep a fixed-length round-robin buffer (but then there should be a way to inspect it since there's no database we can query, e.g. by exposing via expvar).

plugin/storage/memory/lock.go

plugin/storage/memory/lock_test.go

plugin/storage/memory/sampling.go

plugin/storage/memory/sampling_test.go

srikanthccv · 2021-10-22T02:19:38Z

Thanks @yurishkuro for the review. I took the default unbounded number of traces as an inspiration but given the processor implementation it makes sense to limit the number of entries to N. Regarding probabilities, I think we can go with the simplest approach first.

Co-authored-by: Yuri Shkuro <yurishkuro@users.noreply.github.com> Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

…nto srikanth/in-memory Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

plugin/storage/memory/lock_test.go

plugin/storage/memory/sampling.go

codecov · 2021-10-25T03:40:07Z

Codecov Report

Merging #3335 (fab68ef) into master (b53d901) will increase coverage by 0.03%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #3335      +/-   ##
==========================================
+ Coverage   96.01%   96.04%   +0.03%     
==========================================
  Files         259      261       +2     
  Lines       15422    15464      +42     
==========================================
+ Hits        14807    14852      +45     
+ Misses        523      520       -3     
  Partials       92       92

Impacted Files	Coverage Δ
plugin/sampling/strategystore/adaptive/factory.go	`100.00% <100.00%> (ø)`
plugin/storage/cassandra/factory.go	`97.08% <100.00%> (ø)`
plugin/storage/memory/factory.go	`100.00% <100.00%> (ø)`
plugin/storage/memory/lock.go	`100.00% <100.00%> (ø)`
plugin/storage/memory/sampling.go	`100.00% <100.00%> (ø)`
cmd/collector/app/server/zipkin.go	`68.29% <0.00%> (-2.44%)`	⬇️
cmd/query/app/static_handler.go	`97.60% <0.00%> (-1.20%)`	⬇️
plugin/storage/integration/integration.go	`79.28% <0.00%> (+0.39%)`	⬆️
plugin/storage/badger/spanstore/reader.go	`96.21% <0.00%> (+0.70%)`	⬆️
pkg/config/tlscfg/cert_watcher.go	`94.73% <0.00%> (+2.10%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b53d901...fab68ef. Read the comment docs.

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

yurishkuro · 2021-10-25T04:51:56Z

plugin/storage/memory/sampling.go

+	maxBuckets          int
+}
+
+type Throughput struct {


nit: these don't look like they need to be public

Please suggest some name. I had a hard time differentiating b/w throughputs and lowercase only made it more difficult.

storedThroughput?

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

yurishkuro · 2021-10-25T12:46:07Z

Thanks! There's probably a place in the docs we can tweak to mention this new backend.

srikanthccv · 2021-10-25T12:59:21Z

It is the docs https://www.jaegertracing.io/docs/1.27/sampling/#adaptive-sampling that led me to the issue. Do you want me to send a PR updating the docs now itself or once the remaining backends are also supported? I am planning to send a follow up PR for es (and badger later).

yurishkuro · 2021-10-25T13:51:46Z

Hey, it's up to you. We're approaching the next release (in a week), I am not sure if there's enough time to implement other (real) backends. Maybe makes sense to just send a small PR for the next-release docs mentioning memory option.

Add in-memory storage support for adaptive sampling

8d97f89

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

srikanthccv marked this pull request as ready for review October 21, 2021 05:28

srikanthccv requested a review from a team as a code owner October 21, 2021 05:28

srikanthccv requested a review from vprithvi October 21, 2021 05:28

Merge branch 'master' into srikanth/in-memory

11c81ae

yurishkuro reviewed Oct 21, 2021

View reviewed changes

Apply suggestions from code review

9dc6d6d

Co-authored-by: Yuri Shkuro <yurishkuro@users.noreply.github.com> Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

srikanthccv force-pushed the srikanth/in-memory branch from 64e8335 to 9dc6d6d Compare October 22, 2021 02:22

srikanthccv added 3 commits October 22, 2021 07:53

Merge branch 'master' into srikanth/in-memory

7e7d768

Address review comments

7436ec9

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

Merge branch 'srikanth/in-memory' of github.com:lonewolf3739/jaeger i…

3a9a96d

…nto srikanth/in-memory Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

yurishkuro reviewed Oct 24, 2021

View reviewed changes

srikanthccv mentioned this pull request Oct 24, 2021

Remove unused method GetProbabilitiesAndQPS from samplingstore #3339

Merged

Merge branch 'master' into srikanth/in-memory

ebb2dc5

Address review comments and incorporate suggestions

9184996

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

yurishkuro reviewed Oct 25, 2021

View reviewed changes

srikanthccv added 2 commits October 25, 2021 10:52

Code coverage for storage factory methods

88feae3

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

Make Throughput and ServiceOpProbs private

fab68ef

Signed-off-by: Srikanth Chekuri <srikanth.chekuri92@gmail.com>

yurishkuro enabled auto-merge (squash) October 25, 2021 12:42

yurishkuro approved these changes Oct 25, 2021

View reviewed changes

yurishkuro merged commit e9bf6ed into jaegertracing:master Oct 25, 2021

yurishkuro mentioned this pull request Oct 25, 2021

Storage backends for adaptive sampling #3305

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add in-memory storage support for adaptive sampling #3335

Add in-memory storage support for adaptive sampling #3335

srikanthccv commented Oct 21, 2021

yurishkuro left a comment

srikanthccv commented Oct 22, 2021

codecov bot commented Oct 25, 2021 •

edited

Loading

yurishkuro Oct 25, 2021

srikanthccv Oct 25, 2021

yurishkuro Oct 25, 2021

yurishkuro commented Oct 25, 2021

srikanthccv commented Oct 25, 2021

yurishkuro commented Oct 25, 2021

Add in-memory storage support for adaptive sampling #3335

Add in-memory storage support for adaptive sampling #3335

Conversation

srikanthccv commented Oct 21, 2021

Which problem is this PR solving?

yurishkuro left a comment

Choose a reason for hiding this comment

srikanthccv commented Oct 22, 2021

codecov bot commented Oct 25, 2021 • edited Loading

Codecov Report

yurishkuro Oct 25, 2021

Choose a reason for hiding this comment

srikanthccv Oct 25, 2021

Choose a reason for hiding this comment

yurishkuro Oct 25, 2021

Choose a reason for hiding this comment

yurishkuro commented Oct 25, 2021

srikanthccv commented Oct 25, 2021

yurishkuro commented Oct 25, 2021

codecov bot commented Oct 25, 2021 •

edited

Loading