JIT: Return multiple likely classes in getLikelyClass (for better GDV) #58984

EgorBo · 2021-09-11T14:55:33Z

This PR extends getLikelyClass to return multiple likely classes like for the following sample:

using System;
using System.Runtime.CompilerServices;
using System.Threading;

class ClassA
{
    public virtual int DoWork() => 42;
}

class ClassB : ClassA { }
class ClassC : ClassB { }

public class Program
{
    public static void Main()
    {
        for (int i = 0; i < 1000; i++)
        {
            if (i % 5 == 0)
                Test(new ClassC());
            else if (i % 3 == 0)
                Test(new ClassB());
            else
                Test(new ClassA());
            Thread.Sleep(5);
        }

        Console.ReadKey();
    }

    [MethodImpl(MethodImplOptions.NoInlining)]
    static int Test(ClassA? a) => a?.DoWork() ?? 0;
}

When it tries to devirtualize a.DoWork() it now prints:

Considering guarded devirtualization at IL offset 6 (0x6)
Likely classes for 00007FFBE5DC2C10 (ClassA):
  1) 00007FFBE5DC2C10 (ClassA) [likelihood:62%]
  2) 00007FFBE5DC3580 (ClassC) [likelihood:25%]
  3) 00007FFBE5DC3438 (ClassB) [likelihood:12%]
virtual call would invoke method DoWork
Marking call [000005] as guarded devirtualization candidate; will guess for class ClassA

I'm locally playing with it to do multiple type checks for GDV like:

if (obj is A) 
    tmp = (A)a.DoWork();
else if (obj is B)
    tmp = (B)b.DoWork();
else if (obj is C)
    tmp = (C)c.DoWork(); 
else
    obj.DoWork(); // non-devirtualized fallback

or merge multiple checks if they point to the same method (it means DoWork is not overridden in B)

if (obj is A || obj is B)
    tmp = (A)a.DoWork();
else
    obj.DoWork(); // non-devirtualized fallback

Btw, here is some statistics I've harvested on AvaloniaILSpy application in FullPGO mode:

(So only 57% of virtual calls with class-probes data are 100% monomorphic in this app)

/cc @dotnet/jit-contrib @AndyAyersMS @jakobbotsch

ghost · 2021-09-11T14:55:41Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

This PR extends getLikelyClass to return multiple likely classes like for the following sample:

using System;
using System.Runtime.CompilerServices;
using System.Threading;

class ClassA
{
    public virtual int DoWork() => 42;
}

class ClassB : ClassA { }
class ClassC : ClassB { }

public class Program
{
    public static void Main()
    {
        for (int i = 0; i < 1000; i++)
        {
            if (i % 5 == 0)
                Test(new ClassC());
            else if (i % 3 == 0)
                Test(new ClassB());
            else
                Test(new ClassA());
            Thread.Sleep(5);
        }

        Console.ReadKey();
    }

    [MethodImpl(MethodImplOptions.NoInlining)]
    static int Test(ClassA? a) => a?.DoWork() ?? 0;
}

When it tries to devirtualize a.DoWork() it now prints:

Considering guarded devirtualization at IL offset 6 (0x6)
Likely classes for 00007FFBE5DC2C10 (ClassA):
  1) 00007FFBE5DC2C10 (ClassA) [likelihood:62%]
  2) 00007FFBE5DC3580 (ClassC) [likelihood:25%]
  3) 00007FFBE5DC3438 (ClassB) [likelihood:12%]
virtual call would invoke method DoWork
Marking call [000005] as guarded devirtualization candidate; will guess for class ClassA

I'm locally playing with it to do multiple type checks for GDV like:

if (obj is A) 
    tmp = (A)a.DoWork();
else if (obj is B)
    tmp = (B)b.DoWork();
else if (obj is C)
    tmp = (C)c.DoWork(); 
else
    obj.DoWork(); // non-devirtualized fallback

or merge multiple checks if they point to the same method (it means DoWork is not overridden in B)

if (obj is A || obj is B)
    tmp = (A)a.DoWork();
else
    obj.DoWork(); // non-devirtualized fallback

Btw, here is some statistics I've harvested on AvaloniaILSpy application in FullPGO mode:

so non-monomorphic callsites aren't that rare as I thought.

/cc @dotnet/jit-contrib @AndyAyersMS @jakobbotsch

Author:	EgorBo
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

EgorBo · 2021-09-11T15:59:07Z

hm.. where should I rename getLikelyClass for clrjitilc, is it in dotnet/runtimelab?

AndyAyersMS · 2021-09-11T16:37:38Z

It will be interesting to see how this plays out.

Couple of notes:

We currently only have 8 slots in the class histogram table, so the non-dominant class probabilities are likely subject to greater relative error (e.g. the smallest non-zero likelihood you will see is 12%). I wonder if we should increase the size here so we get a more accurate picture? I didn't do this originally because there ends up being a lot of wasted table space (since most sites are monomorphic) and there's no simple way to size the tables dynamically.
The table compression done for static PGO will need revision, either allowing for MAX_LIKELY_CLASSES or a family of records that can describe more info -- we want to ensure static PGO's optimization ability stays on more or less even footing with dynamic.
I had originally though the total number of unique table entries seen would play into the heuristics; it doesn't currently, but it would still perhaps be nice to know that number, instead of capping it at MAX_LIKELY_CLASSES.
For virtual calls there's typically a 3-stage lookup: (1) method table, (2) method table chunk; (3) method table slot. In some cases one can check for equality at a later stage and cut down on the number of comparisons, when the likely cases all share a chunk or method. That is, if they share a chunk, instead of needing two method table compares as in (mt == MT1) || (mt == MT2) one can do one chunk compare (mtc = mt[...]; mtc == MT12). Not clear if this is always better, but if there are two likely classes with similar likelihoods, perhaps it would be.
Vtable shape (say whether it is chunked) can vary by type and slot and may be something we rethink someday. So the strategy needs to adapt to the types in question.
The slot entry comparison may be tricky to engineer as the optimized callee entry point address may not be known. We should think carefully about how to handle this.
When looking at secondary entries I've often thought we'd "renormalize" the histogram. Say we have 50, 20, 10, (20) as the most likely probabilities + tail. Then after we've checked for the 50 case, we divide this out, and the conditional probabilities become 40, 20, (40). And if we then check for the 40 case, we end up with 33, (66). At each level we use something like our current threshold. But to make this work well we probably need more confidence in the tail probabilities.
Not sure how this should interact with chained GDV... thoughts?
Likewise, this has the potential to greatly increase code size.... how should we think about that?

EgorBo · 2021-09-12T09:55:50Z

Thanks @AndyAyersMS these are useful notes!

instead of capping it at MAX_LIKELY_CLASSES.

I'm going to change MAX_LIKELY_CLASSES to some big value like 128 just to still allow getLikelyUsers to use a stack-allocated array as an output

Not sure how this should interact with chained GDV... thoughts?
Likewise, this has the potential to greatly increase code size.... how should we think about that?

So for now this PR doesn't change anything other than prints more info in JitDump. I'd like to start from the case where the second most popular type uses the same method as the first one. From what I see there are ~2000 such virtual calls in AvaloniaUI app. I hope it won't cause any problem for things like "chained GDV" since I want it to emit something like:

single pMT check:

JTRUE
   EQ
     IND(LCL) ;; pMT
     CNS

two pMT checks:

JTRUE
  NE
    OR
      EQ
        IND(LCL)
        CNS ;; Type1
      EQ
        IND(LCL)
        CNS ;; Type2
    CNS 0

so it won't produce any new basic block and should be still thread-jump friendly. I hope it won't affect binary-size/memory footprint much or we can always make inliner a bit less aggressive if overall performance will be better 🙂

src/coreclr/jit/likelyclass.cpp

Co-authored-by: Jakob Botsch Nielsen <Jakob.botsch.nielsen@gmail.com>

EgorBo · 2021-09-12T10:52:41Z

@jakobbotsch btw, any idea what clrjitilc is?

jakobbotsch · 2021-09-12T10:59:09Z

@jakobbotsch btw, any idea what clrjitilc is?

I think it's just the internal name used by crossgen2 for the JIT native library. It redirects to the proper dll here:

runtime/src/coreclr/tools/Common/JitInterface/JitConfigProvider.cs

Lines 44 to 63 in f29484a

    
                       NativeLibrary.SetDllImportResolver(typeof(CorInfoImpl).Assembly, (libName, assembly, searchPath) => 
        
                       { 
        
                           IntPtr libHandle = IntPtr.Zero; 
        
                           if (libName == CorInfoImpl.JitLibrary) 
        
                           { 
        
                               if (!string.IsNullOrEmpty(jitPath)) 
        
                               { 
        
                                   libHandle = NativeLibrary.Load(jitPath); 
        
                               } 
        
                               else 
        
                               { 
        
                                   libHandle = NativeLibrary.Load("clrjit_" + GetTargetSpec(target), assembly, searchPath); 
        
                               } 
        
                           } 
        
                           if (libName == CorInfoImpl.JitSupportLibrary) 
        
                           { 
        
                               libHandle = NativeLibrary.Load("jitinterface_" + RuntimeInformation.ProcessArchitecture.ToString().ToLowerInvariant(), assembly, searchPath); 
        
                           } 
        
                           return libHandle; 
        
                       });

So since you already renamed the export and pinvoke name I think it's fine.

…into gdv-improvement

src/coreclr/jit/likelyclass.cpp

Co-authored-by: Jakob Botsch Nielsen <Jakob.botsch.nielsen@gmail.com>

EgorBo · 2021-09-13T19:08:37Z

So since you already renamed the export and pinvoke name I think it's fine.

For some reason CI still screams it can't find that entrypoint in clrjitilc while locally it builds just fine :|

jakobbotsch · 2021-09-13T20:25:35Z

For some reason CI still screams it can't find that entrypoint in clrjitilc while locally it builds just fine :|

That's weird, I wonder if it's somehow using an outdated JIT dll (maybe from the sdk)? It looks like it's only happening on non-Windows builds. @davidwrighton do you have an idea what could be going on here?

src/coreclr/jit/likelyclass.cpp

davidwrighton · 2021-09-16T23:40:29Z

@EgorBo can you describe the format change to the PgoSchema data. In particular, as this is part of the static pgo format, we need to be certain that the new parser can read the old data.

Also, I think I found the spot where you failed to mark the symbol as a public symbol. See my comment above.

Co-authored-by: Bruce Forstall <brucefo@microsoft.com>

…rovement

EgorBo · 2021-09-17T09:42:30Z

@EgorBo can you describe the format change to the PgoSchema data. In particular, as this is part of the static pgo format, we need to be certain that the new parser can read the old data.

Also, I think I found the spot where you failed to mark the symbol as a public symbol. See my comment above.

From my understanding, it doesn't change the format anyhow. ComputeLikelyClass will still save only a single (most popular) class under PgoInstrumentationKind.GetLikelyClass - I left it this way because storing multiple classes there will increase profile size (this PR doesn't use that data anyhow other than printing via JITDUMP yet).

src/coreclr/tools/Common/JitInterface/CorInfoImpl.cs

src/coreclr/jit/importer.cpp

src/coreclr/jit/likelyclass.cpp

Co-authored-by: Andy Ayers <andya@microsoft.com>

EgorBo · 2021-09-17T19:24:50Z

@AndyAyersMS Thanks, addressed.

AndyAyersMS

Looks good overall. Just a few nits.

I don't suppose we can look at diffs easily. Make sure you spot-check a few cases by hand I guess.

src/coreclr/tools/Common/JitInterface/CorInfoImpl.cs

src/coreclr/jit/importer.cpp

AndyAyersMS · 2021-09-19T16:58:45Z

I'm confused why you are seeing null handles.

The LikelyClassHistogramconstructor should be filtering them out, and we should never be creating a GetLikelyClass schema record with a null handle.

EgorBo · 2021-09-19T17:57:34Z

I'm confused why you are seeing null handles.

The LikelyClassHistogramconstructor should be filtering them out, and we should never be creating a GetLikelyClass schema record with a null handle.

Thanks! So if a histogram contains a few (or even a single) nulls - does it mean such a callsite is super rare and is not worth devirtualizing? (However, I guess its block will be cold and we'll ignore it anyway)

AndyAyersMS · 2021-09-19T19:34:43Z

The runtime helpers for class profiles perform what's known as reservoir sampling. The data we track includes a count of the number of times the call site was hit.

The histogram table can have 0..8 non-null entries with current table size. They can only all be null if the counter is zero, meaning this call site was never hit at runtime (this is fairly common).

There may be "trailing" nulls if the call site is lightly hit (count 1...7).

If the call site is more frequently hit all entries should be non-null.

The same handle may appear in more than one slot in the table, which is how we deduce its relative likelihood.

AndyAyersMS

I'm still unsure on one point.

If the observations include collectible classes we put in placeholders (potentially up to 32 different values, though we currently just always use one). And we want the presence of those placeholders to show up in the estimated number of classes seen.

However, we filter the placeholders from the output array, so there's no way the count of elements in the output array can represent the estimated number of classes.

Am I missing something here...?

…rovement

EgorBo · 2021-09-23T10:28:28Z

I'm still unsure on one point.

If the observations include collectible classes we put in placeholders (potentially up to 32 different values, though we currently just always use one). And we want the presence of those placeholders to show up in the estimated number of classes seen.

However, we filter the placeholders from the output array, so there's no way the count of elements in the output array can represent the estimated number of classes.

Am I missing something here...?

@AndyAyersMS Sorry for the delay,

so currently for numberOfClasses we return the number of only known handles but we do take them into the account when we calculate likelihood (it was like that even before my PR). I did a quick test with ALCs:

for this Test getLikelyClasses return 2 ([Class1, 50%], [Class2, 25%]) - so we took the dynamic one into the account here.

EgorBo · 2021-09-23T16:21:24Z

Failures aren't related (fail on other PRs too)

EgorBo added 3 commits September 11, 2021 15:30

Extend getLikelyClass to return multiple likely classes

51fb90e

getLikelyClass -> getLikelyClasses

baab6b0

Print all likelyclasses

210b7ad

EgorBo requested a review from MichalStrehovsky as a code owner September 11, 2021 14:55

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Sep 11, 2021

EgorBo added 3 commits September 11, 2021 18:06

Clean up

6124628

fix build

5b4b7cf

bump JITEEVersionIdentifier

945875d

jakobbotsch reviewed Sep 12, 2021

View reviewed changes

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

jakobbotsch reviewed Sep 12, 2021

View reviewed changes

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

Update src/coreclr/jit/likelyclass.cpp

5796de6

Co-authored-by: Jakob Botsch Nielsen <Jakob.botsch.nielsen@gmail.com>

EgorBo added 2 commits September 12, 2021 17:53

Address feedback

acc6ef6

Merge branch 'gdv-improvement' of https://github.com/EgorBo/runtime-1 …

fc45ecb

…into gdv-improvement

jakobbotsch reviewed Sep 12, 2021

View reviewed changes

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

Update src/coreclr/jit/likelyclass.cpp

2899705

Co-authored-by: Jakob Botsch Nielsen <Jakob.botsch.nielsen@gmail.com>

BruceForstall reviewed Sep 16, 2021

View reviewed changes

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

davidwrighton reviewed Sep 16, 2021

View reviewed changes

src/coreclr/jit/likelyclass.cpp Outdated Show resolved Hide resolved

EgorBo and others added 4 commits September 17, 2021 11:57

Update src/coreclr/jit/likelyclass.cpp

119ba1a

Co-authored-by: Bruce Forstall <brucefo@microsoft.com>

Merge branch 'main' of https://github.com/dotnet/runtime into gdv-imp…

e58166d

…rovement

Address feedback

df1be48

fix build

1d1dda4

fix build

469858a

AndyAyersMS reviewed Sep 17, 2021

View reviewed changes

src/coreclr/tools/Common/JitInterface/CorInfoImpl.cs Outdated Show resolved Hide resolved

src/coreclr/jit/importer.cpp Outdated Show resolved Hide resolved

src/coreclr/jit/likelyclass.cpp Show resolved Hide resolved

EgorBo and others added 2 commits September 17, 2021 21:55

Update src/coreclr/tools/Common/JitInterface/CorInfoImpl.cs

07258af

Co-authored-by: Andy Ayers <andya@microsoft.com>

Address feedback

65c73a5

AndyAyersMS approved these changes Sep 17, 2021

View reviewed changes

src/coreclr/tools/Common/JitInterface/CorInfoImpl.cs Show resolved Hide resolved

src/coreclr/jit/importer.cpp Outdated Show resolved Hide resolved

EgorBo added 4 commits September 17, 2021 22:41

Address feedback

02949b5

Update likelyclass.cpp

75e9bd1

Update importer.cpp

9271fc2

clean up and ignore null handles

a8c3b94

Fix buffer overrun

32871ab

EgorBo force-pushed the gdv-improvement branch from 96646d0 to 32871ab Compare September 20, 2021 12:50

EgorBo added 2 commits September 20, 2021 17:17

Remove unnecessary nullchecks

7b80cce

Update likelyclass.cpp

769387f

AndyAyersMS reviewed Sep 20, 2021

View reviewed changes

EgorBo added 2 commits September 23, 2021 11:21

Merge branch 'main' of https://github.com/dotnet/runtime into gdv-imp…

d579d5c

…rovement

Clean up

df6292b

Clean up

15cd3d2

runfoapp bot mentioned this pull request Sep 23, 2021

System.Text.RegularExpressions.Tests.RegexMatchTests.Match_ExcessPrefix Failing in CI #59541

Closed

EgorBo merged commit 43740ce into dotnet:main Sep 24, 2021

EgorBo mentioned this pull request Sep 25, 2021

Guarded Devirt: multiple type checks #59604

Closed

9 tasks

ghost locked as resolved and limited conversation to collaborators Nov 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Return multiple likely classes in getLikelyClass (for better GDV) #58984

JIT: Return multiple likely classes in getLikelyClass (for better GDV) #58984

EgorBo commented Sep 11, 2021 •

edited

Loading

ghost commented Sep 11, 2021

EgorBo commented Sep 11, 2021

AndyAyersMS commented Sep 11, 2021

EgorBo commented Sep 12, 2021

EgorBo commented Sep 12, 2021

jakobbotsch commented Sep 12, 2021

EgorBo commented Sep 13, 2021 •

edited

Loading

jakobbotsch commented Sep 13, 2021 •

edited

Loading

davidwrighton commented Sep 16, 2021

EgorBo commented Sep 17, 2021 •

edited

Loading

EgorBo commented Sep 17, 2021

AndyAyersMS left a comment

AndyAyersMS commented Sep 19, 2021

EgorBo commented Sep 19, 2021 •

edited

Loading

AndyAyersMS commented Sep 19, 2021

AndyAyersMS left a comment

EgorBo commented Sep 23, 2021

EgorBo commented Sep 23, 2021

JIT: Return multiple likely classes in getLikelyClass (for better GDV) #58984

JIT: Return multiple likely classes in getLikelyClass (for better GDV) #58984

Conversation

EgorBo commented Sep 11, 2021 • edited Loading

ghost commented Sep 11, 2021

EgorBo commented Sep 11, 2021

AndyAyersMS commented Sep 11, 2021

EgorBo commented Sep 12, 2021

EgorBo commented Sep 12, 2021

jakobbotsch commented Sep 12, 2021

EgorBo commented Sep 13, 2021 • edited Loading

jakobbotsch commented Sep 13, 2021 • edited Loading

davidwrighton commented Sep 16, 2021

EgorBo commented Sep 17, 2021 • edited Loading

EgorBo commented Sep 17, 2021

AndyAyersMS left a comment

Choose a reason for hiding this comment

AndyAyersMS commented Sep 19, 2021

EgorBo commented Sep 19, 2021 • edited Loading

AndyAyersMS commented Sep 19, 2021

AndyAyersMS left a comment

Choose a reason for hiding this comment

EgorBo commented Sep 23, 2021

EgorBo commented Sep 23, 2021

EgorBo commented Sep 11, 2021 •

edited

Loading

EgorBo commented Sep 13, 2021 •

edited

Loading

jakobbotsch commented Sep 13, 2021 •

edited

Loading

EgorBo commented Sep 17, 2021 •

edited

Loading

EgorBo commented Sep 19, 2021 •

edited

Loading