-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimized curl transform formulas #97
Comments
This uses Quine-McCluskey algo. to optimize transform function. While SSE
AVX
|
@wusyong, Can you manually disable AVX2-specific execution paths and compare again? |
@jserv Results remain same. And after testing AVX2 on devorg the results have no change and sometimes worse. I even just removed operator for AVX2Original:
Optimized:
|
Extra parameters IMHO, applying Quine-McCluskey algorithm for micro-optimization is reasonable, and we can rework AVX backend later. Please send pull request(s) along with appropriate explanation. Reference: https://software.intel.com/sites/default/files/managed/b4/3a/319433-024.pdf |
Utilize Quine-McCluskey algorithm to optimized transform function in each pow implementation files. SSE has improvement by ~10%, while AVX and AVX2 have no significant change. Following benchmarks are tested on node0. SSE Original: Hash rate average value: 7997.251 kH/sec, with the range +- 109.604 kH/sec including 95% of the hash rate values. Optimized: Hash rate average value: 9075.665 kH/sec, with the range +- 112.197 kH/sec including 95% of the hash rate values. AVX Original: Hash rate are around 94XX kH/s with multiple tests Hash rate average value: 9452.758 kH/sec, with the range +- 394.444 kH/sec including 95% of the hash rate values. Optimized: Hash rate are around 95XX kH/s with multiple tests Hash rate average value: 9578.751 kH/sec, with the range +- 341.911 kH/sec including 95% of the hash rate values. Resolve DLTcollab#97
You are welcome ;) |
powsrv.io
team optimized the curl transform formulas, claiming 7% speedup.dcurl
should benefit from the changes proposed in entangled PR #803.Reference: Curl improvement in IRI.
The text was updated successfully, but these errors were encountered: