Performance improvements for factor #1525

nbraud · 2020-05-24T14:03:46Z

First pass at fixing #1456: after those changes, factor is 4× faster, and
“only” 48× slower than the GNU implementation (when factoring all numbers from 2
to 10⁶).

Replaced the Vec of factors with a datatype that only stores each prime once
Moved each algorithm (table-based, Pollard's rho, and Miller-Rabin) to its own module, and decoupled them.
Removed unecessary calls to is_prime (~50% of the perf. gain)
Replaced iterated division by 2 with u64::trailing_zeros
Refactored and optimized the Miller-Rabin primality test, extracted dividers from the result (the other half of the performance gain)

It is clearer to see what is going on, as opposed to passing around an unmarked `Vec<u64>`, and there is a single place to add invariants checks. This is also a more compact memory representation: each prime factor is represented only once, with an additional byte for multiplicity. The performance impact is however not significant.

Also decoupled the factorisation methods; now factor::factor contains the logic that chains the different algorithms and aggregates results. As a side-effect, rho::factor now performs extraneous allocations (as each recursive step creates a new `Factors` value, which is then aggregated into the previous one) but there is no significant performance impact.

No significant performance impact (most of the time is spent elsewhere), but an easy and satisfying fix nevertheless.

When the remainder is smaller than the max. entry in the table, it is guaranteed to be prime.

50% performance improvement on factoring all numbers between 2 and 10⁶.

Another 6.97% runtime improvement

Replace iterated division with u64::trailing_zeros, hoist the selection of `mul` out of the loop, another cool 49.5% runtime improvement.

Another 36% improvement.

nbraud · 2020-05-24T14:18:20Z

I am going to have a deeper look at the implementation of Pollard's rho, but I suspect further improvements will require switching to a wholly-different factoring algorithm.

Factoring Small to Medium Size Integers: An Experimental Comparison (2010) suggests that Shanks's SQUFOF (square forms factorization) is the fastest algorithm for integers up to ~60 bits.

A possible alternative I might investigate is Hart's “one-line factoring algorithm”, which is said to be competitive with SQUFOF and simpler to implement.

PS: This would presumably be the subject of a second PR.

sylvestre · 2020-05-24T15:02:26Z

Thanks for the PR. There are some rust format and clippy issues, could you please fix them?

nbraud · 2020-05-24T15:08:11Z

Thanks for the PR. There are some rust format and clippy issues, could you please fix them?

Done; I was trying first to recall how to deal with not having NLLs (as the build is currently failing on obsolete versions of Rust).

nbraud · 2020-05-24T15:08:43Z

I didn't see the format issues, though (or at least, running cargo fmt didn't introduce any change)

rivy

I didn't see the format issues, though (or at least, running cargo fmt didn't introduce any change)

It looks like it's just cargo clippy issues, specifically large numbers of single character variable names, unneeded returns, long literals lacking separators, ....

If you look at the "Checks" tabs and pull up the CICD section, it lists all the warnings.

It is acceptable to use an #[allow(...)] for the naming issue if you have reasons.

Thanks for your work!

nbraud · 2020-05-24T15:22:00Z

It looks like it's just cargo clippy issues, specifically about a large number of single character variable names.

Yes, I already dealt with it (by using 1 fewer variable >_>')

Thanks for your work!

You are welcome <3

src/uu/factor/src/factor.rs

Instead of computing a^r and a^(n-1) = a^(r 2ⁱ) separately, compute the latter by repeatedly squaring the former. 33.6% performance improvement

Co-authored-by: Roy Ivy III <rivy.dev@gmail.com>

nbraud · 2020-05-24T17:16:54Z

This is “merely” 37× slower than GNU factor, now, and I still haven't implemented any fancier number theory (aside from extracting dividers from the M-H primality test)

src/uu/factor/src/miller_rabin.rs

nbraud · 2020-05-24T21:50:03Z

@rivy @sylvestre I found another 2× speedup with a loop exchange (and essentially inlining a batch version of pow inside rabin_miller::test) but I'd rather postpone it to another PR:
I prototyped it using lazy_static and nalgebra, but those dependencies aren't really needed — all I need is all, any, and map for [u64; 7] — so I'd rather rewrite it more cleanly, but I'm not doing that tonight.

rivy · 2020-05-24T21:55:09Z

@nbraud , thanks! Sounds like your having fun! Mush on at your leisure.

nbraud · 2020-05-24T21:57:17Z

@rivy Thanks a bunch for reviewing and merging so quickly :O

nbraud · 2020-05-24T21:57:39Z

And yes, golfing this to be ~93% faster was fun :3

nbraud added 10 commits May 24, 2020 15:41

factor::factor: Use u64::trailing_zero instead of iterated division

bc11e57

No significant performance impact (most of the time is spent elsewhere), but an easy and satisfying fix nevertheless.

factor::factor: Short-circuit the fallback to Pollard's rho

418fd61

When the remainder is smaller than the max. entry in the table, it is guaranteed to be prime.

factor::table: Remove extraneous calls to the primality test

1697406

50% performance improvement on factoring all numbers between 2 and 10⁶.

factor::table: Remove obsolete, commented code

e1a6dbe

factor::factor: Remove extraneous call to the primality test

74054fe

Another 6.97% runtime improvement

factor: Move the Miller-Rabin primality test to its own module.

e3ecc81

factor::miller_rabbin: Refactor before extracting dividers

6b9585b

Replace iterated division with u64::trailing_zeros, hoist the selection of `mul` out of the loop, another cool 49.5% runtime improvement.

factor::miller_rabin: Extract dividers from the primality test

8241037

Another 36% improvement.

format: Make clippy happy

29eb8fd

rivy self-requested a review May 24, 2020 15:17

rivy requested changes May 24, 2020

View reviewed changes

nbraud requested a review from rivy May 24, 2020 15:27

factor::numeric: Replace lose functions with an Arithmetic trait

30fd6a0

nbraud commented May 24, 2020

View reviewed changes

src/uu/factor/src/factor.rs Outdated Show resolved Hide resolved

nbraud and others added 3 commits May 24, 2020 19:10

factor::rho: Small refactor

543c7b9

factor::miller_rabin: Avoid unecessary exponentiation

36a2948

Instead of computing a^r and a^(n-1) = a^(r 2ⁱ) separately, compute the latter by repeatedly squaring the former. 33.6% performance improvement

factor::Factors::add: Split up to work without NLL

4c3682a

Co-authored-by: Roy Ivy III <rivy.dev@gmail.com>

nbraud commented May 24, 2020

View reviewed changes

src/uu/factor/src/miller_rabin.rs Show resolved Hide resolved

rivy approved these changes May 24, 2020

View reviewed changes

rivy merged commit 09abcf8 into uutils:master May 24, 2020

nbraud deleted the factor/faster branch May 24, 2020 21:57

This was referenced Jun 18, 2020

factor: Add/update copyright notices as necessary #1546

Merged

factor: Fix very-rare bug in ρ #1549

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvements for factor #1525

Performance improvements for factor #1525

nbraud commented May 24, 2020 •

edited

Loading

nbraud commented May 24, 2020 •

edited

Loading

sylvestre commented May 24, 2020

nbraud commented May 24, 2020

nbraud commented May 24, 2020

rivy left a comment •

edited

Loading

nbraud commented May 24, 2020 •

edited

Loading

nbraud commented May 24, 2020

nbraud commented May 24, 2020

rivy commented May 24, 2020

nbraud commented May 24, 2020

nbraud commented May 24, 2020

Performance improvements for factor #1525

Performance improvements for factor #1525

Conversation

nbraud commented May 24, 2020 • edited Loading

nbraud commented May 24, 2020 • edited Loading

sylvestre commented May 24, 2020

nbraud commented May 24, 2020

nbraud commented May 24, 2020

rivy left a comment • edited Loading

Choose a reason for hiding this comment

nbraud commented May 24, 2020 • edited Loading

nbraud commented May 24, 2020

nbraud commented May 24, 2020

rivy commented May 24, 2020

nbraud commented May 24, 2020

nbraud commented May 24, 2020

nbraud commented May 24, 2020 •

edited

Loading

nbraud commented May 24, 2020 •

edited

Loading

rivy left a comment •

edited

Loading

nbraud commented May 24, 2020 •

edited

Loading