Display walltime benchmarks with subnanosecond precision #124774

the8472 · 2024-05-05T21:13:22Z

With modern CPUs running at more than one cycle per nanosecond the current precision is insufficient to resolve differences worth several cycles per iteration.

Granted, walltime benchmarks often are noisy but occasionally, especially when no allocations are involved, the difference really is just a few cycles.

example results when benchmarking 1-4 serialized ADD instructions and an empty bench body

running 4 tests
test add  ... bench:           0.24 ns/iter (+/- 0.00)
test add2 ... bench:           0.48 ns/iter (+/- 0.01)
test add3 ... bench:           0.72 ns/iter (+/- 0.01)
test add4 ... bench:           0.96 ns/iter (+/- 0.01)
test empty ... bench:           0.24 ns/iter (+/- 0.00)

rustbot · 2024-05-05T21:13:29Z

r? @jhpratt

rustbot has assigned @jhpratt.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

example results when benchmarking 1-4 serialized ADD instructions ``` running 4 tests test add ... bench: 0.24 ns/iter (+/- 0.00) test add2 ... bench: 0.48 ns/iter (+/- 0.01) test add3 ... bench: 0.72 ns/iter (+/- 0.01) test add4 ... bench: 0.96 ns/iter (+/- 0.01) ```

rustbot · 2024-05-05T22:25:07Z

Some changes occurred in run-make tests.

cc @jieyouxu

jhpratt · 2024-05-05T23:34:19Z

Do you know if there's a reason that it's always calculated in nanoseconds? Being most familiar with criterion (as are many others), I suspect that displaying ms, µs, ns, and ps as appropriate is reasonable.

the8472 · 2024-05-06T08:00:48Z

Switching units when doing a before/after comparison would makes thing more difficult to eyeball.

jhpratt · 2024-05-09T21:52:43Z

@bors r+

bors · 2024-05-09T21:52:45Z

📌 Commit 2a7c42f has been approved by jhpratt

It is now in the queue for this repository.

bors · 2024-05-10T08:59:11Z

⌛ Testing commit 2a7c42f with merge e93f342...

bors · 2024-05-10T11:05:25Z

☀️ Test successful - checks-actions
Approved by: jhpratt
Pushing e93f342 to master...

rust-timer · 2024-05-10T12:24:01Z

Finished benchmarking commit (e93f342): comparison URL.

Overall result: no relevant changes - no action needed

@rustbot label: -perf-regression

Instruction count

This benchmark run did not return any relevant results for this metric.

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-2.7%	[-2.7%, -2.7%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-2.7%	[-2.7%, -2.7%]	1

Cycles

This benchmark run did not return any relevant results for this metric.

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 674.951s -> 674.435s (-0.08%)
Artifact size: 315.92 MiB -> 315.80 MiB (-0.04%)

rustbot assigned jhpratt May 5, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-bootstrap Relevant to the bootstrap subteam: Rust's build system (x.py and src/bootstrap) T-libs Relevant to the library team, which will review and decide on the PR/issue. labels May 5, 2024

the8472 added the A-libtest Area: #[test] related label May 5, 2024

This comment has been minimized.

Sign in to view

the8472 added 3 commits May 6, 2024 00:25

emit fractional benchmark nanoseconds in libtest's JSON output format

e867d7c

bootstrap should also render fractional nanoseconds for benchmarks

2a7c42f

the8472 force-pushed the subnanosecond-benches branch from 1d9e681 to 2a7c42f Compare May 5, 2024 22:25

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 9, 2024

bors added the merged-by-bors This PR was explicitly merged by bors. label May 10, 2024

bors merged commit e93f342 into rust-lang:master May 10, 2024
7 checks passed

rustbot added this to the 1.80.0 milestone May 10, 2024

Swatinem mentioned this pull request May 12, 2024

Add suppor for fractional libtest output BurntSushi/cargo-benchcmp#48

Open

blaine-arcjet mentioned this pull request May 13, 2024

fix: Support sub-nanosecond precision on Cargo benchmarks benchmark-action/github-action-benchmark#243

Closed

epompeii mentioned this pull request May 14, 2024

New version error: Failed to convert results with adapter rust/rust_bench bencherdev/bencher#390

Closed

ktrz mentioned this pull request May 19, 2024

Support sub-nanosecond precision on Cargo benchmarks benchmark-action/github-action-benchmark#244

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Display walltime benchmarks with subnanosecond precision #124774

Display walltime benchmarks with subnanosecond precision #124774

the8472 commented May 5, 2024

rustbot commented May 5, 2024

This comment has been minimized.

rustbot commented May 5, 2024

jhpratt commented May 5, 2024

the8472 commented May 6, 2024

jhpratt commented May 9, 2024

bors commented May 9, 2024

bors commented May 10, 2024

bors commented May 10, 2024

rust-timer commented May 10, 2024

Display walltime benchmarks with subnanosecond precision #124774

Display walltime benchmarks with subnanosecond precision #124774

Conversation

the8472 commented May 5, 2024

rustbot commented May 5, 2024

This comment has been minimized.

rustbot commented May 5, 2024

jhpratt commented May 5, 2024

the8472 commented May 6, 2024

jhpratt commented May 9, 2024

bors commented May 9, 2024

bors commented May 10, 2024

bors commented May 10, 2024

rust-timer commented May 10, 2024

Overall result: no relevant changes - no action needed

Instruction count

Max RSS (memory usage)

Cycles

Binary size