SPMI: Improve speed significantly for large diffs #76238

jakobbotsch · 2022-09-27T11:57:07Z

This starts communicating more information about diffs back from
superpmi and starts using it in the driver. The current information is
the base size, diff size, base instructions, diff instructions and
context size.

The driver uses the base size/diff size to pick a small number of
interesting contexts and only creates disassembly for these contexts.
jit-analyze is then also invoked on only these contexts, but the
contexts are picked so that they overlap with what jit-analyze would
display. Additionally, we also pick the 20 smallest contexts (in
terms of context size) -- I frequently use the smallest contexts with
diffs to investigate my changes.

The new behavior is only enabled when no custom metrics are specified.
If custom metrics are specified we fall back to producing all
disassembly files and leave it up to jit-analyze to extract and analyze
the metrics specified.

Also, the retainTopFilesOnly option is no longer necessary since our CI
pipeline will at most produce 100 .dasm files now. Another benefit is that
this means that all contexts mentioned in the jit-analyze output will now be
part of the artifacts.

The net result is that we can now get SPMI diffs for changes with even
hundreds of thousands of diffs in the same time as it takes to get diffs
for a small change.

Fix #76178

This starts communicating more information about diffs back from superpmi and starts using it in the driver. The current information is the base size, diff size, base instructions, diff instructions and context size. The driver uses the base size/diff size to pick a small number of interesting contexts and only creates disassembly for these contexts. jit-analyze is then also invoked on only these contexts, but the contexts are picked so that they overlap with what jit-analyze would display. Additionally, we also pick the top 20 smallest contexts (in terms of context size) -- I frequently use the smallest contexts with diffs to investigate my change. The new behavior is only enabled when no custom metrics are specified. If custom metrics are specified we fall back to producing all disassembly files and leave it up to jit-analyze to extract and analyze the metrics specified. The net result is that we can now get SPMI diffs for changes with even hundreds of thousands of diffs in the same time as it takes to get diffs for a small change. Fix dotnet#76178

This is no longer necessary since we avoid creating these disassembly files in the first place. Another benefit is that all contexts mentioned by jit-analyze output will now be part of the artifacts.

ghost · 2022-09-27T11:57:25Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

This starts communicating more information about diffs back from
superpmi and starts using it in the driver. The current information is
the base size, diff size, base instructions, diff instructions and
context size.

The driver uses the base size/diff size to pick a small number of
interesting contexts and only creates disassembly for these contexts.
jit-analyze is then also invoked on only these contexts, but the
contexts are picked so that they overlap with what jit-analyze would
display. Additionally, we also pick the 20 smallest contexts (in
terms of context size) -- I frequently use the smallest contexts with
diffs to investigate my changes.

The new behavior is only enabled when no custom metrics are specified.
If custom metrics are specified we fall back to producing all
disassembly files and leave it up to jit-analyze to extract and analyze
the metrics specified.

The net result is that we can now get SPMI diffs for changes with even
hundreds of thousands of diffs in the same time as it takes to get diffs
for a small change.

Fix #76178

Author:	jakobbotsch
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

jakobbotsch · 2022-09-27T12:00:16Z

src/coreclr/tools/superpmi/superpmi/superpmi.cpp

@@ -171,7 +181,8 @@ int __cdecl main(int argc, char* argv[])
 #endif

    bool   collectThroughput = false;
-    MCList failingToReplayMCL, diffMCL;
+    MCList failingToReplayMCL;


It will likely be useful to give the failing list the same treatment so that we can have superpmi.py replay print the smallest contexts with failures, instead of printing everything. I will leave this for a future PR however.

Coercing integer to pointer in constexpr context is invalid. Clang happily miscompiles the 'invalid handle check' to a null pointer check.

kunalspathak · 2022-09-27T20:15:00Z

Very nice to see it.

jakobbotsch · 2022-09-27T21:06:40Z

cc @dotnet/jit-contrib PTAL @AndyAyersMS
This should be ready for review barring any unforeseen failures in the related pipelines. The morph.cpp change is just to see it work, I will revert it before merging.

Hopefully I can get to see diffs for #76017 and #76185 with this change.

jakobbotsch · 2022-09-28T09:32:23Z

The linux-arm64 job seems to have hung during the .mch download. The superpmi replay is failing on arm32 due to the morph change, seems like a bug in either lowering or LSRA:

[22:19:58] ISSUE: <ASSERT> #271154 D:\a\_work\1\s\src\coreclr\jit\lsra.cpp (11934) - Assertion failed 'refPosition->RegOptional()' in 'System.Numerics.Tests.ComplexTests:Abs(double,double)' during 'LSRA build intervals' (IL size 22; hash 0xbf3bc3d1; FullOpts)

I'll try a different change with large diffs.

jakobbotsch · 2022-09-28T14:31:51Z

Clean run for a JIT change that has diffs in around 100k contexts: https://dev.azure.com/dnceng-public/public/_build/results?buildId=33036&view=ms.vss-build-web.run-extensions-tab

kunalspathak · 2022-09-28T20:53:31Z

seems like a bug in either lowering or LSRA

Can we have an issue for this?

jakobbotsch · 2022-09-29T14:08:33Z

seems like a bug in either lowering or LSRA

Can we have an issue for this?

Opened #76382.

jakobbotsch · 2022-09-29T16:51:02Z

Ping @AndyAyersMS (or maybe @kunalspathak / @BruceForstall want to take a look?)

AndyAyersMS

Overall this looks good.

I usually use IL size to prioritize simple repro cases, but I imagine MC size likely ends up being even a better choice.

AndyAyersMS · 2022-09-29T17:09:31Z

src/coreclr/tools/superpmi/superpmi/superpmi.cpp

+
+                            // This is a difference in ASM outputs from Jit1 & Jit2 and not a playback failure
+                            // We will add this MC to the diffs info if there is one.
+                            // Otherwise this will end up in failingMCList


I find this a bit puzzling. Can you say more about why we would add this to the failing MCL?

This is just preserving the existing behavior (it was inside InvokeNearDiffer before).
I'm not sure why we are regarding diffs as failures when -diffMCList (-diffsInfo after this change) is unspecified.

BruceForstall · 2022-10-01T04:21:00Z

This is great work; thanks for doing this.

BruceForstall · 2022-10-01T04:24:03Z

src/coreclr/scripts/superpmi.py

                    subproc_helper.run_to_completion(create_replay_artifacts, self, mch_file, asm_complus_vars_full_env, text_differences, base_asm_location, diff_asm_location, ".dasm")

                    if self.coreclr_args.diff_jit_dump:
                        logging.info("Creating JitDump files: %s %s", base_dump_location, diff_dump_location)
                        subproc_helper.run_to_completion(create_replay_artifacts, self, mch_file, jit_dump_complus_vars_full_env, jit_dump_differences, base_dump_location, diff_dump_location, ".txt")

-                    logging.info("Differences found. To replay SuperPMI use:")
+                    logging.info("Differences found. To replay SuperPMI use:".format(len(diffs)))
+


What is this ".format..."? Shouldn't it use the logging '%s' format? And I don't see any replacement token.

logging uses the same conventions as the old % formatting operator. This is the newer str.format, which I think would normally be preferred... except that logging probably isn't going to format the string unless the particular logging level has been specified.

(and yes, it is missing {} or {0})

Whoops yes, at one point I was printing the number of diffs as part of this line, but I decided to print it on a new line at the end of each collection diff so that it gets printed last instead, and you can see it while waiting for the next diffs.

I'll keep in mind to remove this with my next change.

It looks like logging can be switched to the new formatter: https://docs.python.org/3/howto/logging-cookbook.html#using-particular-formatting-styles-throughout-your-application

That would maybe be a nice cleanup at some point.

Ah, looks like 'style' came with 3.2. Not sure what version of Python is on the CI systems, but presumably better than that.

Yes, I (and some others) have been using new style formatting in various scripts for a while now without issues.

jakobbotsch added 3 commits September 27, 2022 13:46

Remove --retainOnlyTopFiles

6d78190

This is no longer necessary since we avoid creating these disassembly files in the first place. Another benefit is that all contexts mentioned by jit-analyze output will now be part of the artifacts.

Test change with tons of diffs

c96e35c

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Sep 27, 2022

ghost assigned jakobbotsch Sep 27, 2022

jakobbotsch commented Sep 27, 2022

View reviewed changes

jakobbotsch added 9 commits September 27, 2022 14:22

DRY it up

f079301

Fix bad refactoring

67aca1a

Include utility for std::move

95583e3

Fix Clang/GCC compilation

be172a6

Coercing integer to pointer in constexpr context is invalid. Clang happily miscompiles the 'invalid handle check' to a null pointer check.

Fix build

22640fc

Proper move constructor for clr_std::vector, delete copy constructors

4a77ff5

Fix

c17deea

Fix superpmi_diffs.py

504d368

Run jit-format and force new CI run

30c0a8d

jakobbotsch mentioned this pull request Sep 27, 2022

Add --is-diffs-only and --is-subset-of-diffs dotnet/jitutils#356

Merged

jakobbotsch added 2 commits September 27, 2022 22:29

Minor fixes

ccb37d8

Nitpicking order..

55ae448

jakobbotsch marked this pull request as ready for review September 27, 2022 21:02

jakobbotsch requested a review from AndyAyersMS September 27, 2022 21:06

jakobbotsch added 6 commits September 28, 2022 11:33

Revert morph change

1f3d9f0

JIT: Smarter ordering of late args based on register uses

c36637d

Fix build

a76ef14

Fix arg reg check

472749f

Generalize

f283a60

Revert JIT changes

5536609

AndyAyersMS approved these changes Sep 29, 2022

View reviewed changes

jakobbotsch merged commit cc934c7 into dotnet:main Sep 29, 2022

jakobbotsch deleted the jit-analyze-spmi branch September 29, 2022 17:24

BruceForstall reviewed Oct 1, 2022

View reviewed changes

This was referenced Oct 3, 2022

[JIT] Cleanup to lsra inspired by #73424 #76481

Merged

JIT: Preference locals away from PUTARG_REG killed registers #76671

Merged

JIT: Slightly relax "const vector propagation" heuristics #76788

Closed

ghost locked as resolved and limited conversation to collaborators Oct 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPMI: Improve speed significantly for large diffs #76238

SPMI: Improve speed significantly for large diffs #76238

jakobbotsch commented Sep 27, 2022 •

edited

Loading

ghost commented Sep 27, 2022

jakobbotsch Sep 27, 2022

kunalspathak commented Sep 27, 2022

jakobbotsch commented Sep 27, 2022

jakobbotsch commented Sep 28, 2022

jakobbotsch commented Sep 28, 2022

kunalspathak commented Sep 28, 2022

jakobbotsch commented Sep 29, 2022

jakobbotsch commented Sep 29, 2022 •

edited

Loading

AndyAyersMS left a comment

AndyAyersMS Sep 29, 2022

jakobbotsch Sep 29, 2022

BruceForstall commented Oct 1, 2022

BruceForstall Oct 1, 2022

markples Oct 1, 2022

jakobbotsch Oct 1, 2022

jakobbotsch Oct 1, 2022 •

edited

Loading

BruceForstall Oct 1, 2022

jakobbotsch Oct 1, 2022

SPMI: Improve speed significantly for large diffs #76238

SPMI: Improve speed significantly for large diffs #76238

Conversation

jakobbotsch commented Sep 27, 2022 • edited Loading

ghost commented Sep 27, 2022

Choose a reason for hiding this comment

kunalspathak commented Sep 27, 2022

jakobbotsch commented Sep 27, 2022

jakobbotsch commented Sep 28, 2022

jakobbotsch commented Sep 28, 2022

kunalspathak commented Sep 28, 2022

jakobbotsch commented Sep 29, 2022

jakobbotsch commented Sep 29, 2022 • edited Loading

AndyAyersMS left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BruceForstall commented Oct 1, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jakobbotsch Oct 1, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jakobbotsch commented Sep 27, 2022 •

edited

Loading

jakobbotsch commented Sep 29, 2022 •

edited

Loading

jakobbotsch Oct 1, 2022 •

edited

Loading