Examples where heavy intrinsics usage runs into internal jit limits on optimization #11905

AndyAyersMS · 2019-01-27T19:13:02Z

Tracking issue for cases where heavy intrinsics usage leads to poor optimization because methods hit various internal jit limits.

Vector128/256<T>: Too many calls to "ThrowIfUnsupportedType" prevents inlining #11744 inlining stops because the inlining budget was exceeded (partially addressed by JIT: modify inline budget update to use estimated imported IL size coreclr#21893)
JIT fails to inline methods called from a large/complex outer method #11903 inlining stops after hitting "too many locals" limit
Use less stack for HttpResponseHeaders.CopyToFast aspnetcore#7724 inlining stops after hitting "too many locals" limit (and no/few hw intrinsics)

category:cq
theme:inlining
skill-level:expert
cost:medium

saucecontrol · 2019-01-28T20:29:50Z

I closed #11903 because it's being addressed in a different way. However, absent the regression caused by the HWIntrinsics API change, that example was still very close to the JIT throttling limits without being absurdly complex. I wanted to bring over @AndyAyersMS comment from over there so it doesn't get lost, as it would be a good compromise solution for these cases.

The limits are there to prevent jit algorithms from taking up too much memory, too much time, or both. Perhaps we could tie increasing the limits into AggressiveOptimization so we have a better idea that the performance of a method is deemed critical and so optimizing it is worth the extra jit time and memory.

benaadams · 2020-07-30T01:25:47Z

@AndyAyersMS will this have become more problematic now Arm paths are being added, or are the .IsSupported paths dropped early?

AndyAyersMS · 2020-07-30T03:04:49Z

I think we're ok. Early pruning helps. Also, the jit will create temps for inlinee args and locals lazily as it is importing the inlinee, so increasing the number of locals in a method (say because C# now sees much more code) should not be problem, provided only a subset of them can be reached from any particular architecture.

@kunalspathak did some checking to make sure that adding arm specialization to methods that already has xarch specialization didn't cause any changes in the xarch code.

SingleAccretion · 2021-02-24T04:40:12Z

So that this doesn't get lost. From #48669:

We may want to revaluate this limit. Last time we looked (~5 years ago) there were very few methods that came near. But perhaps things have changed.

I have collected some quick data from the PMI diffs of the shared framework (for win-x64). It looks like the situations is still that most methods have a relatively small number of locals.

Locals        Methods 
0    - 100  : 352956 : 99.230%
100  - 200  : 2136   : 00.601%
200  - 300  : 382    : 00.107%
300  - 400  : 138    : 00.039%
400  - 500  : 31     : 00.009%
500  - 2334 : 51     : 00.014%

AndyAyersMS self-assigned this Mar 6, 2019

msftgits transferred this issue from dotnet/coreclr Jan 31, 2020

msftgits added this to the Future milestone Jan 31, 2020

AndyAyersMS mentioned this issue Feb 7, 2020

JIT: review and document the various cases where the jit circuit-breakers will kick in #31942

Open

BruceForstall added the JitUntriaged CLR JIT issues needing additional triage label Oct 28, 2020

saucecontrol mentioned this issue Feb 24, 2021

Forward Span-based MemoryExtensions overloads to ReadOnlySpan overloads #48669

Merged

BruceForstall removed the JitUntriaged CLR JIT issues needing additional triage label Jan 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples where heavy intrinsics usage runs into internal jit limits on optimization #11905

Examples where heavy intrinsics usage runs into internal jit limits on optimization #11905

AndyAyersMS commented Jan 27, 2019

saucecontrol commented Jan 28, 2019

benaadams commented Jul 30, 2020

AndyAyersMS commented Jul 30, 2020

SingleAccretion commented Feb 24, 2021

Examples where heavy intrinsics usage runs into internal jit limits on optimization #11905

Examples where heavy intrinsics usage runs into internal jit limits on optimization #11905

Comments

AndyAyersMS commented Jan 27, 2019

saucecontrol commented Jan 28, 2019

benaadams commented Jul 30, 2020

AndyAyersMS commented Jul 30, 2020

SingleAccretion commented Feb 24, 2021