Add DB Read/Write Tracking to Benchmarking Pipeline #6386

shawntabrizi · 2020-06-18T03:18:52Z

This PR is the first step toward adding full automation to the benchmarking pipeline.

Here, we modify the Bench DB to add an additional HashMap to track keys that have been read from and written to.

We are then able to run out bechmarks, and inspect this read/write tracker to see the number of times we do a read, repeat read, write, or repeat write.

This information is then stored as part of our BenchmarkResults output and displayed as part of the CLI.

Next steps for this is:

Add a whitelist of keys to ignore for counting
Add regression analysis to the reads/writes
Output to Rust module

shawntabrizi · 2020-06-18T10:53:41Z

Here is an example of the output:

➜  cli git:(shawntabrizi-bench-db-tracking) ✗ ../../../target/debug/substrate benchmark --pallet balances --extrinsic transfer --raw
2020-06-18 12:48:11 💸 new validator set of size 1 has been elected via ElectionCompute::OnChain for era 0
Pallet: "balances", Extrinsic: "transfer", Lowest values: [], Highest values: [], Steps: [], Repeat: 1
u,e,extrinsic_time,storage_root_time,reads,repeat_reads,writes,repeat_writes
1,1000,724000,53000,7,10,5,0
100,1000,715000,49000,7,10,5,0
199,1000,745000,33000,7,10,5,0
298,1000,743000,35000,7,10,5,0
397,1000,743000,33000,7,10,5,0
496,1000,790000,31000,7,10,5,0
595,1000,650000,29000,7,10,5,0
694,1000,595000,27000,7,10,5,0
793,1000,593000,28000,7,10,5,0
892,1000,608000,28000,7,10,5,0
991,1000,610000,28000,7,10,5,0
1000,2,600000,101000,7,10,5,0
1000,101,602000,28000,7,10,5,0
1000,200,603000,28000,7,10,5,0
1000,299,610000,28000,7,10,5,0
1000,398,613000,27000,7,10,5,0
1000,497,604000,29000,7,10,5,0
1000,596,604000,28000,7,10,5,0
1000,695,611000,28000,7,10,5,0
1000,794,612000,29000,7,10,5,0
1000,893,648000,27000,7,10,5,0
1000,992,604000,28000,7,10,5,0

client/db/src/bench.rs

kianenigma

Looks good to me, I'll try and do another review later.

This means that we don't have to do that manual code analysis counting reads and writes right? I am curious to just see how off we were before 😁

shawntabrizi · 2020-06-18T12:37:13Z

@kianenigma This fixes the static read/write counting we were doing for extrinsics. It should give us the data so that we can create a weight formula with a "maximum" number of reads/writes the extrinsic will do.

But it does not fix the on_initialize/on_finalize counting that we are doing and still need to do.

shawntabrizi · 2020-06-18T22:17:00Z

Could I also get a review on: https://github.com/paritytech/substrate/pull/6405/files

This is a PR on top of this PR to add whitelisting to to the DB Read/Write tracking

client/db/src/bench.rs

* hardcoded whitelist * Add whitelist to pipeline * Remove whitelist pipeline from CLI, add to runtime * clean-up unused db initialized whitelist

* Add selector * add tests * debug formatter for easy formula

kianenigma · 2020-06-24T06:49:53Z

client/db/src/bench.rs

@@ -109,6 +151,86 @@ impl<B: BlockT> BenchmarkingState<B> {
 		));
 		Ok(())
 	}
+
+	fn add_whitelist_to_tracker(&self) {


correct me if I am wrong: by whitelisting, basically we already assume that they have been read once? there's no inherent meaning to being whitelisted other than that.

Yes, this key is free to read and write to, and does not count in DB tracking.

Things like BlockNumber, the Sender Account, Events, etc...

Okay, one question then: won't this cause confusion with the read-write count? Say I submit a tx that has no reads and writes. Won't the read/write count of my tx then be equal to all the whitelisted ones?

(Either way I think it is all okay, I am just trying to make it clear for myself.)

no, I dont increment the read/write count when I do whitelisting. So assuming nothing is actually read/written from, you will get 0 reads and 0 writes.

client/db/src/bench.rs

kianenigma · 2020-06-24T07:07:02Z

frame/benchmarking/src/lib.rs

 							frame_support::debug::trace!(target: "benchmark", "End Benchmark: {} ns", elapsed_extrinsic);
+							let read_write_count = $crate::benchmarking::read_write_count();
+							frame_support::debug::trace!(target: "benchmark", "Read/Write Count {:?}", read_write_count);


this logs also in wasm, which is probably not desirable I think?

Suggested change

frame_support::debug::trace!(target: "benchmark", "Read/Write Count {:?}", read_write_count);

frame_support::debug::native::trace!(target: "benchmark", "Read/Write Count {:?}", read_write_count);

I also want these logs to emit during wasm execution, which is the standard way to run these benchmarks. Per conversation in "dumb questions", this should have no overhead when the log flag is not included

kianenigma

I gave this another read and still positive about it.

shawntabrizi · 2020-06-24T08:13:25Z

@kianenigma i agree, I would like to keep it separate if possible, and start merging in this working first step

Co-authored-by: Kian Paimani <5588131+kianenigma@users.noreply.github.com>

shawntabrizi · 2020-06-24T16:45:13Z

For anyone reviewing, line width error here I think is unavoidable (caused by string literal in hex! macro.

shawntabrizi added 4 commits June 17, 2020 13:41

initial mockup

7468464

add and wipe

947d68c

track writes

9c77f50

start to add to pipeline

2f198b9

shawntabrizi added the A3-in_progress Pull request is in progress. No review needed at this stage. label Jun 18, 2020

shawntabrizi and others added 11 commits June 18, 2020 09:21

return all reads/writes

12395ae

Log reads and writes from bench db

69f137d

causes panic

60f22c3

Allow multiple commits

e08ffef

commit before ending benchmark

86ecbeb

Merge branch 'master' into shawntabrizi-bench-db-tracking

f7305a2

doesn't work???

004bea1

fix

1b25a8b

Update lib.rs

c995215

switch to struct for BenchmarkResults

2eef7fa

add to output

473f492

shawntabrizi marked this pull request as ready for review June 18, 2020 10:53

github-actions bot added A0-please_review Pull request needs code review. and removed A3-in_progress Pull request is in progress. No review needed at this stage. labels Jun 18, 2020

shawntabrizi added 2 commits June 18, 2020 13:14

fix test

0ae5300

line width

fd99bda

shawntabrizi added B0-silent Changes should not be mentioned in any release notes C1-low PR touches the given topic and has a low impact on builders. labels Jun 18, 2020

kianenigma reviewed Jun 18, 2020

View reviewed changes

client/db/src/bench.rs Outdated Show resolved Hide resolved

kianenigma approved these changes Jun 18, 2020

View reviewed changes

@kianenigma review

79cd015

shawntabrizi mentioned this pull request Jun 18, 2020

Companion PR for #6386 (Read/Write Tracking) paritytech/polkadot#1284

Merged

cheme reviewed Jun 19, 2020

View reviewed changes

client/db/src/bench.rs Show resolved Hide resolved

Add Whitelist to DB Tracking in Benchmarks Pipeline (#6405)

9464475

* hardcoded whitelist * Add whitelist to pipeline * Remove whitelist pipeline from CLI, add to runtime * clean-up unused db initialized whitelist

github-actions bot added the A7-needspolkadotpr label Jun 20, 2020

Add regression analysis to DB Tracking (#6475)

956cda1

* Add selector * add tests * debug formatter for easy formula

kianenigma reviewed Jun 24, 2020

View reviewed changes

client/db/src/bench.rs Outdated Show resolved Hide resolved

kianenigma reviewed Jun 24, 2020

View reviewed changes

shawntabrizi and others added 2 commits June 24, 2020 18:12

Update client/db/src/bench.rs

893bc52

Co-authored-by: Kian Paimani <5588131+kianenigma@users.noreply.github.com>

Merge branch 'master' into shawntabrizi-bench-db-tracking

00cb79b

github-actions bot removed the A7-needspolkadotpr label Jun 24, 2020

gavofyork merged commit 7f5dd73 into master Jun 24, 2020

gavofyork deleted the shawntabrizi-bench-db-tracking branch June 24, 2020 19:03

shawntabrizi mentioned this pull request Jul 6, 2020

Fully Automated Benchmarking and Weight Generation #6168

Closed

23 tasks

shawntabrizi mentioned this pull request Sep 27, 2022

Member Request polkadot-fellows/seeding#8

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DB Read/Write Tracking to Benchmarking Pipeline #6386

Add DB Read/Write Tracking to Benchmarking Pipeline #6386

shawntabrizi commented Jun 18, 2020 •

edited

Loading

shawntabrizi commented Jun 18, 2020

kianenigma left a comment

shawntabrizi commented Jun 18, 2020

shawntabrizi commented Jun 18, 2020

kianenigma Jun 24, 2020

shawntabrizi Jun 24, 2020

kianenigma Jun 24, 2020

kianenigma Jun 24, 2020

shawntabrizi Jun 24, 2020

kianenigma Jun 24, 2020 •

edited

Loading

shawntabrizi Jun 24, 2020

kianenigma left a comment

shawntabrizi commented Jun 24, 2020

shawntabrizi commented Jun 24, 2020

	frame_support::debug::trace!(target: "benchmark", "Read/Write Count {:?}", read_write_count);
	frame_support::debug::native::trace!(target: "benchmark", "Read/Write Count {:?}", read_write_count);

Add DB Read/Write Tracking to Benchmarking Pipeline #6386

Add DB Read/Write Tracking to Benchmarking Pipeline #6386

Conversation

shawntabrizi commented Jun 18, 2020 • edited Loading

shawntabrizi commented Jun 18, 2020

kianenigma left a comment

Choose a reason for hiding this comment

shawntabrizi commented Jun 18, 2020

shawntabrizi commented Jun 18, 2020

kianenigma Jun 24, 2020

Choose a reason for hiding this comment

shawntabrizi Jun 24, 2020

Choose a reason for hiding this comment

kianenigma Jun 24, 2020

Choose a reason for hiding this comment

kianenigma Jun 24, 2020

Choose a reason for hiding this comment

shawntabrizi Jun 24, 2020

Choose a reason for hiding this comment

kianenigma Jun 24, 2020 • edited Loading

Choose a reason for hiding this comment

shawntabrizi Jun 24, 2020

Choose a reason for hiding this comment

kianenigma left a comment

Choose a reason for hiding this comment

shawntabrizi commented Jun 24, 2020

shawntabrizi commented Jun 24, 2020

shawntabrizi commented Jun 18, 2020 •

edited

Loading

kianenigma Jun 24, 2020 •

edited

Loading