Optimize integer arithmetic #7553

bjorng · 2023-08-11T11:05:40Z

This pull request optimizes some of the arithmetic operations for integers.

The most noticeable improvement is in multiplication of large integers. Consider this benchmark:

bench() ->
    Size = 1_262_000,
    String = binary:copy(<<"9">>, Size),
    {Time, _Val} = timer:tc(erlang, binary_to_integer, [String]),
    io:format("Size: ~p, seconds: ~p\n", [Size, Time / 1_000_000]).

Its running time is dominated by multiplication of large integers. This PR reduces the running time from about 0.45 seconds down to about 0.32 seconds on my computer (an M1 MacBook Pro).

github-actions · 2023-08-11T11:09:43Z

CT Test Results

      3 files   133 suites 48m 53s ⏱️
1 559 tests 1 503 ✔️ 54 💤 2 ❌
1 980 runs 1 905 ✔️ 73 💤 2 ❌

For more details on these failures, see this check.

Results for commit 4ca352e.

♻️ This comment has been updated with latest results.

To speed up review, make sure that you have read Contributing to Erlang/OTP and that all checks pass.

See the TESTING and DEVELOPMENT HowTo guides for details about how to run test locally.

Artifacts

// Erlang/OTP Github Action Bot

jhogberg

LGTM

erts/emulator/beam/jit/arm/beam_asm.hpp

erts/emulator/beam/jit/arm/instr_arith.cpp

erts/emulator/beam/jit/x86/instr_arith.cpp

erts/emulator/beam/jit/arm/instr_arith.cpp

Fuse a multiplication operator followed by an addition operator. That will generally reduce the number of instructions compared to having separate operators.

We used to replace division by a power of two with a right shift only when the dividend was known to be a positive integer. Extend the implementation to do right shift when the range of the dividend is unknown.

Inline the code for right shift a small operand any number steps. We used to call a helper routine when the shift count exceeded the number of bits in a small.

The routine for squaring a big integer did not have all optimizations that the multiplication routine had.

This commit implements the Karatsuba algorithm in a way that reduces the number of additions, resulting in a measureable performance improvement for multiplication of large integers.

bjorng added team:VM Assigned to OTP team VM enhancement testing currently being tested, tag is used by OTP internal CI labels Aug 11, 2023

bjorng requested review from sverker and jhogberg August 11, 2023 11:05

bjorng self-assigned this Aug 11, 2023

jhogberg approved these changes Aug 11, 2023

View reviewed changes

bjorng force-pushed the bjorn/jit/integer-arithmetic branch 2 times, most recently from 36c0ff3 to be5196d Compare August 14, 2023 09:31

jhogberg reviewed Aug 14, 2023

View reviewed changes

erts/emulator/beam/jit/arm/instr_arith.cpp Outdated Show resolved Hide resolved

bjorng force-pushed the bjorn/jit/integer-arithmetic branch 3 times, most recently from e1e28c3 to 0da399f Compare August 15, 2023 13:07

bjorng removed the testing currently being tested, tag is used by OTP internal CI label Aug 18, 2023

bjorng added 7 commits August 18, 2023 12:55

small_SUITE: Unload generated test modules

a64f45d

Fuse multiplication with addition

6a02b04

Fuse a multiplication operator followed by an addition operator. That will generally reduce the number of instructions compared to having separate operators.

Optimize division by powers of two

16dacd0

We used to replace division by a power of two with a right shift only when the dividend was known to be a positive integer. Extend the implementation to do right shift when the range of the dividend is unknown.

Optimize bsr for small operands

55813dd

Inline the code for right shift a small operand any number steps. We used to call a helper routine when the shift count exceeded the number of bits in a small.

Optimize helper macro used for bignum arithmetic

8cab1c6

Optimize squaring of big integers

4253b86

The routine for squaring a big integer did not have all optimizations that the multiplication routine had.

Optimize the implementation of the Karatsuba algorithm

4ca352e

This commit implements the Karatsuba algorithm in a way that reduces the number of additions, resulting in a measureable performance improvement for multiplication of large integers.

bjorng force-pushed the bjorn/jit/integer-arithmetic branch from 0da399f to 4ca352e Compare August 18, 2023 11:25

bjorng added the testing currently being tested, tag is used by OTP internal CI label Aug 18, 2023

bjorng merged commit 272e678 into erlang:master Aug 21, 2023
14 of 16 checks passed

bjorng deleted the bjorn/jit/integer-arithmetic branch August 21, 2023 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize integer arithmetic #7553

Optimize integer arithmetic #7553

bjorng commented Aug 11, 2023

github-actions bot commented Aug 11, 2023 •

edited

Loading

jhogberg left a comment

Optimize integer arithmetic #7553

Optimize integer arithmetic #7553

Conversation

bjorng commented Aug 11, 2023

github-actions bot commented Aug 11, 2023 • edited Loading

CT Test Results

Artifacts

jhogberg left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 11, 2023 •

edited

Loading