Implement support for utf16+latin1 in the component model #4309

alexcrichton · 2022-06-23T21:46:26Z

Currently support for utf16+latin1 is not implemented in Wasmtime, but we'll need to finish this and test it before the component model is considered done.

In general I'd expect that this would use the encoding_rs crate for the internal details of latin1 to avoid open-coding that in Wasmtime itself.

Lowering

Lowering a string into wasm is currently unimplemented. I think that this is required to implement the store_string_to_latin1_or_utf16 function in the canonical ABI explainer. My current understanding is that even if we could implement something more optimal in Rust we can't do that because the semantics of lowering are already specified.

I believe the pseudo-code there does most of the fiddly bits but some small helpers in encoding_rs are probably going to be required.

Lifting

Calculation of the byte length and actually getting the string are unimplemented. I think that we're free to use encoding_rs here however we see fit. Probably the decode_latin1 function will be useful here.

Other notes

I am personally unfamilar with latin1 as an encoding. I don't know if an arbitrary list of types are guaranteed to be valid latin1 or not. (the infallibility of decode_latin1 seems odd to me).

Using encoding_rs may be a better option for utf16 decoding we currently do (and maybe even utf8 since encoding_rs can probably do simd things that the standard library can't). If someone's intrepid it might be interesting to try to benchmark this and see if it's beneficial to use encoding_rs for almost everything.

The text was updated successfully, but these errors were encountered:

alexcrichton added the wasm-proposal:component-model Issues related to the WebAssembly Component Model proposal label Jun 23, 2022

alexcrichton mentioned this issue Jun 23, 2022

Tracking issue for implementing the component model #4185

Closed

42 tasks

alexcrichton mentioned this issue Aug 5, 2022

Implement strings in adapter modules #4623

Merged

alexcrichton closed this as completed in #4623 Aug 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement support for utf16+latin1 in the component model #4309

Implement support for utf16+latin1 in the component model #4309

alexcrichton commented Jun 23, 2022

Implement support for utf16+latin1 in the component model #4309

Implement support for utf16+latin1 in the component model #4309

Comments

alexcrichton commented Jun 23, 2022

Lowering

Lifting

Other notes