Roll back ipow changes due to register pressure. #15242

pmattione-nvidia · 2024-03-06T19:22:11Z

The addition of an array of integers in this function placed too much register pressure on our code base. This function is used by the fixed_point constructor and cast operators, so it potentially affects every kernel. Too many unrelated kernels were impacted and suffered performance degradations to justify this change. This reverts the algorithm introduced in #15110 to what it was previously, with some very minor tweaks.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2024-03-06T19:22:14Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

GregoryKimball · 2024-03-06T21:53:32Z

Adding @shrshi and @PointKernel as reviewers based on their previous analysis of #15110

PointKernel · 2024-03-06T22:01:45Z

@pmattione-nvidia I recall you mentioned that using a smaller lookup table can get about the same performance without introducing too much register pressure. How did that go?

pmattione-nvidia · 2024-03-06T22:28:35Z

I tried that and everything I could think of, but several tests were still just too sensitive. It's not clear to me that they were even using decimal types at all, but they were slowing down significantly regardless (a few tests were still 5-10% slower).

PointKernel · 2024-03-07T02:27:16Z

LGTM

I pinned the original PR in the PR description for easy backtrace. We can revisit the recursive solution once the mixed join has been refactored with the new cuco set (which is supposed to relieve the register pressure a bit). Also, I've sent you the instructions for setting up signed commits via slack. The CI will automatically once it's done.

pmattione-nvidia · 2024-03-08T17:01:41Z

/ok to test

mythrocks

LGTM. This branch might need an upmerge.

pmattione-nvidia · 2024-03-11T21:04:41Z

/merge

Roll back ipow changes due to register pressure.

dda4b5c

pmattione-nvidia added libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Mar 6, 2024

pmattione-nvidia requested a review from a team as a code owner March 6, 2024 19:22

pmattione-nvidia requested review from hyperbolic2346 and mythrocks March 6, 2024 19:22

GregoryKimball requested review from PointKernel and shrshi March 6, 2024 21:52

PointKernel approved these changes Mar 7, 2024

View reviewed changes

hyperbolic2346 approved these changes Mar 7, 2024

View reviewed changes

GregoryKimball assigned pmattione-nvidia Mar 7, 2024

shrshi approved these changes Mar 7, 2024

View reviewed changes

mythrocks approved these changes Mar 11, 2024

View reviewed changes

rapids-bot bot merged commit 63c9ed7 into rapidsai:branch-24.04 Mar 11, 2024
74 checks passed

pmattione-nvidia mentioned this pull request Apr 3, 2024

For powers of 10, replace ipow with switch #15353

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roll back ipow changes due to register pressure. #15242

Roll back ipow changes due to register pressure. #15242

pmattione-nvidia commented Mar 6, 2024 •

edited by PointKernel

Loading

copy-pr-bot bot commented Mar 6, 2024

GregoryKimball commented Mar 6, 2024

PointKernel commented Mar 6, 2024

pmattione-nvidia commented Mar 6, 2024

PointKernel commented Mar 7, 2024

pmattione-nvidia commented Mar 8, 2024

mythrocks left a comment

pmattione-nvidia commented Mar 11, 2024

Roll back ipow changes due to register pressure. #15242

Roll back ipow changes due to register pressure. #15242

Conversation

pmattione-nvidia commented Mar 6, 2024 • edited by PointKernel Loading

Checklist

copy-pr-bot bot commented Mar 6, 2024

GregoryKimball commented Mar 6, 2024

PointKernel commented Mar 6, 2024

pmattione-nvidia commented Mar 6, 2024

PointKernel commented Mar 7, 2024

pmattione-nvidia commented Mar 8, 2024

mythrocks left a comment

Choose a reason for hiding this comment

pmattione-nvidia commented Mar 11, 2024

pmattione-nvidia commented Mar 6, 2024 •

edited by PointKernel

Loading