riscv64: Implement a few SIMD arithmetic ops #6268

afonso360 · 2023-04-23T11:33:31Z

👋 Hey,

This PR Implements a few more arithmetic ops, I didn't want to implement too many since I want to get the wasmtime testsuite working to get better test coverage.

Similarly to #6266, I've also switched the scalar rules to match only scalars, since most of them used fits_in_64 which also matches vectors.

I also had accidentally switched the order of the registers in VecAluRRR, which I only noticed when implementing isub.

Implemented ops are:

isub
imul
smulhi
umulhi
band
bor
bxor

This PR is based on #6266, I'll rebase when that lands.

cranelift/codegen/src/isa/riscv64/lower/isle.rs

cranelift/codegen/meta/src/isa/riscv64.rs

These were accidentally reversed from what we declare in the isle emit helper

Looks like x86 does not implement it

alexcrichton · 2023-04-25T15:39:13Z

cranelift/codegen/src/isa/riscv64/inst/vector.rs

            VecAluOpRRR::Vadd => write!(f, "vadd.vv"),
+            VecAluOpRRR::Vsub => write!(f, "vsub.vv"),
+            VecAluOpRRR::Vmul => write!(f, "vmul.vv"),
+            VecAluOpRRR::Vmulh => write!(f, "vmulh.vv"),
+            VecAluOpRRR::Vmulhu => write!(f, "vmulhu.vv"),
+            VecAluOpRRR::Vand => write!(f, "vand.vv"),
+            VecAluOpRRR::Vor => write!(f, "vor.vv"),
+            VecAluOpRRR::Vxor => write!(f, "vxor.vv"),


If these all follow the same pattern, one thing I found helpful for AVX was do do something like:

let mut s = format!("{self:?}"); s.make_ascii_lowercase(); s.push_str(".vv"); f.write_str(&s)

which can cut down on codegen times and additionally make this a bit easier to maintain. If the instructions have all sorts of different names though this may not work out well

We also have another variant for all of these, which is v*.vx. However, it should still be fairly easy to fit into that scheme when we do need it.

Edit: Hmm, It might not be so easy, because I was planning on renaming these enum arms into VaddVV and VaddVX which would then break that. But I don't mind doing it this way for now and figuring it out later.

alexcrichton · 2023-04-25T15:41:47Z

cranelift/filetests/filetests/runtests/simd-band.clif

+target aarch64
+target s390x
+set enable_simd
+target x86_64


Mind copying over the target x86_64 has_sse41=false line as well from the original test? (maybe lost through a rebase by accident)

This also makes me idly think that perhaps there should be a ; skip: riscv directive for filetests or similar, but not something to be added in this PR of course.

* riscv64: Swap order of `VecAluRRR` source registers These were accidentally reversed from what we declare in the isle emit helper * riscv64: Add SIMD `isub` * riscv64: Add SIMD `imul` * riscv64: Add `{u,s}mulhi` * riscv64: Add `b{and,or,xor}` * cranelift: Move `imul.i8x16` runtest to separate file Looks like x86 does not implement it * riscv64: Better formatting for `VecAluOpRRR` * cranelift: Enable x86 SIMD tests with `has_sse41=false`

afonso360 requested a review from a team as a code owner April 23, 2023 11:33

afonso360 requested review from jameysharp and removed request for a team April 23, 2023 11:33

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:meta Everything related to the meta-language. labels Apr 23, 2023

alexcrichton reviewed Apr 24, 2023

View reviewed changes

cranelift/codegen/src/isa/riscv64/lower/isle.rs Outdated Show resolved Hide resolved

cranelift/codegen/meta/src/isa/riscv64.rs Outdated Show resolved Hide resolved

alexcrichton mentioned this pull request Apr 24, 2023

riscv64: Support non 128bit vector sizes #6266

Merged

afonso360 added 6 commits April 25, 2023 16:31

riscv64: Swap order of VecAluRRR source registers

e2ed70d

These were accidentally reversed from what we declare in the isle emit helper

riscv64: Add SIMD isub

2f032eb

riscv64: Add SIMD imul

8dee30e

riscv64: Add {u,s}mulhi

142ce48

riscv64: Add b{and,or,xor}

3e1fc1a

cranelift: Move imul.i8x16 runtest to separate file

09447be

Looks like x86 does not implement it

afonso360 force-pushed the riscv-vec-arithmetic branch from c6eeba1 to 09447be Compare April 25, 2023 15:31

alexcrichton approved these changes Apr 25, 2023

View reviewed changes

afonso360 added 2 commits April 25, 2023 16:55

riscv64: Better formatting for VecAluOpRRR

86bfb0b

cranelift: Enable x86 SIMD tests with has_sse41=false

2334775

afonso360 enabled auto-merge April 25, 2023 16:29

afonso360 added this pull request to the merge queue Apr 25, 2023

Merged via the queue into bytecodealliance:main with commit 62cbb50 Apr 25, 2023

afonso360 deleted the riscv-vec-arithmetic branch April 25, 2023 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

riscv64: Implement a few SIMD arithmetic ops #6268

riscv64: Implement a few SIMD arithmetic ops #6268

afonso360 commented Apr 23, 2023

alexcrichton Apr 25, 2023

afonso360 Apr 25, 2023 •

edited

Loading

alexcrichton Apr 25, 2023

alexcrichton Apr 25, 2023

riscv64: Implement a few SIMD arithmetic ops #6268

riscv64: Implement a few SIMD arithmetic ops #6268

Conversation

afonso360 commented Apr 23, 2023

alexcrichton Apr 25, 2023

Choose a reason for hiding this comment

afonso360 Apr 25, 2023 • edited Loading

Choose a reason for hiding this comment

alexcrichton Apr 25, 2023

Choose a reason for hiding this comment

alexcrichton Apr 25, 2023

Choose a reason for hiding this comment

afonso360 Apr 25, 2023 •

edited

Loading