Winch: Float conversion instructions #15

jeffcharles · 2023-12-15T21:34:16Z

No description provided.

jeffcharles · 2024-01-03T22:43:35Z

Fuzzing has been running successfully for around 4 hours

saulecabrera · 2024-01-11T12:38:43Z

winch/codegen/src/masm.rs

+    /// Truncate a float into an integer.
+    /// In x64, this will emit multiple instructions.


Could we generalize these comments in such a way that are able to omit ISA-specific details? Something like:

/// Emits one or more instructions to truncate a float into an integer.

I think it would be less error prone (and potentially less tedious) to do this than having to update the comment every time a new backend is added.

saulecabrera · 2024-01-11T13:32:59Z

winch/codegen/src/isa/x64/masm.rs

+    fn truncate(
+        &mut self,
+        context: &mut CodeGenContext,
+        src_size: OperandSize,
+        dst_size: OperandSize,
+        kind: TruncateKind,
+    ) {


I'll add this comment here, but it applies to all the new methods which take a &mut CodeGenContext as a parameter.

I'm trying to understand why passing the &mut CodeGenContext is needed, versus passing the already resolved register operands. The general convention that I'm trying to keep at the MacroAssembler level is that the approach of passing the CodeGenContext should be used as a last resort, more concretely when the lowering of a particular instruction is not generic enough, and needs resolution of ISA-specific details, like for example in the case of the div instruction, which in x64 requires very specific registers to be used. Looking at the conversion and truncation methods, it doesn't look like there's anything ISA specific here and that most of the logic can be moved to the visitor, which would be my preference. Another advantage of such approach is that by keeping the MacroAssembler layer as thin as possible, there's less repetition that will be needed when working on other backends.

Let me know if there's anything that I'm missing here.

Yeah I can move the register part of the logic into the visitor. I wasn't entirely sure what your preference would be so I went with one approach but agreed that since it's not ISA-specific, having in the visitor makes more sense.

Actually for truncate, I need an additional temporary XMM register but only when performing an unsigned truncation. I could change how we model truncation so we have a signed_truncate and an unsigned_truncate at the MacroAssembler level where the unsigned_truncate method would take an additional temporary XMM parameter.

The same pattern holds for convert where I need an additional temporary GPR but only when performing an unsigned convert. Again I could split that method at the MacroAssembler level into a signed_convert and an unsigned_convert where the unsigned_convert takes an additional temporary GPR as a parameter.

Yeah, I didn't mention it explicitly on my original comment but I think that splitting those operations (<signed|unsigned>_convert) and perform the dispatch from the visitor makes sense to me.

Sorry for rambling on this, it's been a few weeks since I worked on this so I'm just remembering I did this as I go through the exercise again. I don't know if we would need the extra temporary XMM register or GPR for other ISAs when performing unsigned truncation or conversion. Maybe that's just an x86 detail? But I haven't tried making an AArch64 implementation yet so maybe it would also be necessary on that ISA. So does it make sense to bake in needing additional temporary registers for unsigned truncation and conversion at the MacroAssembler trait level given it may just be an x86 requirement?

I think it makes sense to do that. When implementing the aarch64 backend we'll know for sure and we can adjust then. If an extra register is the issue, there are other options to deal with that IMO, like for example passing an smallvec of registers to work with, in which each backend defines how many are needed, which could be empty if no extra registers are needed.

jeffcharles · 2024-01-11T22:50:43Z

Fuzzing has been running for a little over two hours successfully

saulecabrera

Looks great, thanks for the refactoring. Left one last comment, and I think that after that this should be ready to land.

saulecabrera · 2024-01-12T12:27:12Z

winch/codegen/src/codegen/context.rs

+    /// Prepares arguments for emitting a convert operation with a temporary
+    /// GPR.
+    pub fn convert_op_with_tmp_gpr<F, M>(&mut self, masm: &mut M, dst_ty: WasmType, mut emit: F)
+    where
+        F: FnMut(&mut M, Reg, Reg, Reg, OperandSize),
+        M: MacroAssembler,
+    {
+        let tmp_gpr = self.any_gpr(masm);
+        self.convert_op(masm, dst_ty, |masm, dst, src, dst_size| {
+            emit(masm, dst, src, tmp_gpr, dst_size);
+        });
+        self.free_reg(tmp_gpr);
+    }
+
+    /// Prepares arguments for emitting a convert operation with a temporary
+    /// floating point register.
+    pub fn convert_op_with_tmp_fpr<F, M>(&mut self, masm: &mut M, dst_ty: WasmType, mut emit: F)
+    where
+        F: FnMut(&mut M, Reg, Reg, Reg, OperandSize),
+        M: MacroAssembler,
+    {
+        let tmp_fpr = self.reg_for_class(RegClass::Float, masm);
+        self.convert_op(masm, dst_ty, |masm, dst, src, dst_size| {
+            emit(masm, dst, src, tmp_fpr, dst_size);
+        });
+        self.free_reg(tmp_fpr);
+    }
+


I think it's possible to merge these two functions into one that takes an extra RegClass param to signal the class of the temporary register?

Yes. I didn't do that initially since I was thinking it wouldn't make sense to expose that to the visitor but I'm fine with making that change.

jeffcharles · 2024-01-12T15:11:26Z

I'll squash the commits and open a PR on the main repo

jeffcharles · 2024-01-12T15:19:46Z

bytecodealliance#7773

jeffcharles force-pushed the winch-float-conversion-instructions branch from bc02d7d to 1c0607c Compare December 15, 2023 22:00

jeffcharles force-pushed the winch-float-conversion-instructions branch 2 times, most recently from c33ab59 to 39c585d Compare January 3, 2024 16:40

saulecabrera reviewed Jan 11, 2024

View reviewed changes

jeffcharles requested a review from saulecabrera January 11, 2024 22:50

saulecabrera reviewed Jan 12, 2024

View reviewed changes

Winch: Float conversion instructions

5d783c6

jeffcharles force-pushed the winch-float-conversion-instructions branch from d441b02 to 5d783c6 Compare January 12, 2024 15:12

jeffcharles closed this Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Winch: Float conversion instructions #15

Winch: Float conversion instructions #15

jeffcharles commented Dec 15, 2023

jeffcharles commented Jan 3, 2024

saulecabrera Jan 11, 2024

saulecabrera Jan 11, 2024

saulecabrera Jan 11, 2024

saulecabrera Jan 11, 2024

jeffcharles Jan 11, 2024

jeffcharles Jan 11, 2024

jeffcharles Jan 11, 2024

saulecabrera Jan 11, 2024

jeffcharles Jan 11, 2024

saulecabrera Jan 11, 2024

jeffcharles commented Jan 11, 2024

saulecabrera left a comment

saulecabrera Jan 12, 2024

jeffcharles Jan 12, 2024

jeffcharles commented Jan 12, 2024

jeffcharles commented Jan 12, 2024

		/// Truncate a float into an integer.
		/// In x64, this will emit multiple instructions.

Winch: Float conversion instructions #15

Winch: Float conversion instructions #15

Conversation

jeffcharles commented Dec 15, 2023

jeffcharles commented Jan 3, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeffcharles commented Jan 11, 2024

saulecabrera left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeffcharles commented Jan 12, 2024

jeffcharles commented Jan 12, 2024