-
Notifications
You must be signed in to change notification settings - Fork 596
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Revision, after realizing that the memcpy trick + icc will fail to
vectorize loops containing it: * Restore the intrinsics, just for icc, just for the float<->int/uint varietes (still axe the ones involving doubles). * Remove the __builtin_bitcast clauses, it seems to provide no benefit for the compilers that support it (icc doesn't, and that's the one that seems to need the exra hints). * Add comments explaining why this is all the case and also reminding us NOT to switch to C++20 std::bit_cast in the future without also testing whether using it prevents auto-vectorization. Signed-off-by: Larry Gritz <lg@larrygritz.com>
- Loading branch information
Showing
1 changed file
with
33 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters