Benchmarking #1729

kvark · 2022-02-16T02:48:50Z

Integrates Criterion and also adds support for bincode-encoded IR form. The purpose here is to see the upper bound for shader loading performance. Today, bincode IR deserialization is roughly 7x faster than WGSL parsing.

kvark · 2022-02-16T03:08:09Z

Results from my machine:

Gnuplot not found, using plotters backend
front/bin               time:   [143.70 us 146.65 us 149.74 us]                      
Found 4 outliers among 100 measurements (4.00%)
  3 (3.00%) high mild
  1 (1.00%) high severe
Benchmarking front/wgsl: Warming up for 3.0000 s
Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 5.7s, enable flat sampling, or reduce sample count to 60.
front/wgsl              time:   [1.1065 ms 1.1268 ms 1.1453 ms]                        
front/spv               time:   [56.855 us 58.214 us 59.836 us]                      
Found 14 outliers among 100 measurements (14.00%)
  5 (5.00%) high mild
  9 (9.00%) high severe
front/glsl              time:   [3.7642 ms 3.8215 ms 3.8864 ms]                        
Found 11 outliers among 100 measurements (11.00%)
  5 (5.00%) high mild
  6 (6.00%) high severe

valid/safe              time:   [122.50 us 123.12 us 123.97 us]                       
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high severe
valid/unsafe            time:   [86.239 us 86.530 us 86.842 us]                         
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe

back/wgsl               time:   [637.90 us 640.21 us 642.65 us]                      
back/spv                time:   [402.15 us 403.86 us 405.86 us]                     
Found 5 outliers among 100 measurements (5.00%)
  2 (2.00%) high mild
  3 (3.00%) high severe
back/spv-separate       time:   [323.96 us 324.86 us 325.73 us]                              
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe
back/msl                time:   [895.71 us 900.71 us 906.08 us]                     
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high mild
back/hlsl               time:   [774.44 us 778.98 us 784.89 us]                      
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) high mild
  3 (3.00%) high severe
back/glsl-separate      time:   [930.78 us 932.80 us 934.88 us]                               
Found 6 outliers among 100 measurements (6.00%)
  3 (3.00%) high mild
  3 (3.00%) high severe

jimblandy

Looks good!

I think we should do better with feature handling; see comments.

.github/workflows/pipeline.yml

jimblandy · 2022-02-17T06:35:18Z

benches/criterion.rs

+        .unwrap();
+    for file_entry in read_dir {
+        match file_entry {
+            Ok(entry) => match entry.path().extension() {


Couldn't this just be:

Ok(entry) => if entry.path().extension() == extension { ... }

?

That would require some shenanigans on going from OSStr to str to be able to compare them

benches/criterion.rs

jimblandy · 2022-02-17T07:48:18Z

benches/criterion.rs

+#[cfg(not(feature = "wgsl-in"))]
+fn gather_modules() -> Vec<naga::Module> {
+    Vec::new()
+}


It's a little confusing that when necessary features aren't available, we do still run benchmarks and get meaningless numbers. The loop over the modules is inside the b.iter call, not outside it. It would be clearer to somehow just not run benchmarks that require features we don't have.

that's a bit of an edge case. We don't really care about the numbers when features aren't enabled. What I did was a minimal thing to make it compile for these cases

jimblandy · 2022-02-17T07:50:20Z

benches/criterion.rs

+            .collect::<Vec<_>>()
+    };
+    #[cfg(not(feature = "validate"))]
+    let inputs = Vec::<(naga::Module, naga::valid::ModuleInfo)>::new();


Again, we shouldn't run benchmarks on empty input lists because a feature was turned off.

similar reasoning to the last one - this is the minimal change to compile

src/front/wgsl/mod.rs

kvark · 2022-02-17T14:47:17Z

Thanks for the review! Some of the notes are addressed, other left for follow-ups

kvark requested a review from jimblandy February 16, 2022 02:48

jimblandy approved these changes Feb 17, 2022

View reviewed changes

kvark added 7 commits February 17, 2022 09:45

Bincode support for naga-cli

5fbb4a5

bench: WGSL parsing

24278d4

bench: glsl, spirv, and bincode inputs

131f5a2

bench: validation

6f076b3

bench: backends

045e558

Don't test on MSRV, just check

20b93e4

Address Jim's notes

b84338e

kvark force-pushed the binary branch from 97168a9 to b84338e Compare February 17, 2022 14:46

kvark enabled auto-merge (rebase) February 17, 2022 14:47

kvark merged commit 8fd8e7d into gfx-rs:master Feb 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking #1729

Benchmarking #1729

kvark commented Feb 16, 2022

kvark commented Feb 16, 2022

jimblandy left a comment

jimblandy Feb 17, 2022

kvark Feb 17, 2022

jimblandy Feb 17, 2022

kvark Feb 17, 2022

jimblandy Feb 17, 2022

kvark Feb 17, 2022

kvark commented Feb 17, 2022

Benchmarking #1729

Benchmarking #1729

Conversation

kvark commented Feb 16, 2022

kvark commented Feb 16, 2022

jimblandy left a comment

Choose a reason for hiding this comment

jimblandy Feb 17, 2022

Choose a reason for hiding this comment

kvark Feb 17, 2022

Choose a reason for hiding this comment

jimblandy Feb 17, 2022

Choose a reason for hiding this comment

kvark Feb 17, 2022

Choose a reason for hiding this comment

jimblandy Feb 17, 2022

Choose a reason for hiding this comment

kvark Feb 17, 2022

Choose a reason for hiding this comment

kvark commented Feb 17, 2022