Cross-platform `win32-input-mode` usage #8343

o-sdn-o · 2020-11-20T11:21:26Z

Description of the new feature/enhancement

By creating this issue, I would like to start a public discussion about what could be changed in the current design of the win32-input-mode protocol before it becomes widespread and cross-platform.

The Considerations section states:

terminal/doc/specs/#4999 - Improved keyboard handling in Conpty.md

Lines 60 to 62 in 612b00c

    
           The goal we're trying to achieve is communicating `INPUT_RECORD`s from the 
        
           terminal to the client app via conpty. This isn't supposed to be a \*nix 
        
           terminal compatible communication, it's supposed to be fundamentally Win32-like.

At the same time, the section Future Considerations allows cross-platform use:

terminal/doc/specs/#4999 - Improved keyboard handling in Conpty.md

Lines 411 to 416 in 612b00c

    
           * Client applications that want to be able to read full Win32 keyboard input 
        
             from `conhost` _using VT_ will also be able to use `^[[?9001h` to do this. If 
        
             they emit `^[[?9001h`, then conhost will switch itself into 
        
             `win32-input-mode`, and the client will read `win32-input-mode` encoded 
        
             sequences as input. This could enable other cross-platform applications to 
        
             also use win32-like input in the future.

Improvements

List of possible improvements, issues and use cases not covered by this keyboard protocol:

~~No way to restore the normal mode of the terminal when the connection is lost or the application crashes.~~
Cross-platform win32-input-mode usage #8343 (comment)
Support win32-input-mode request acknowledgment with the current state of keyboard modifiers.
- Req: ^[[?9001h
- Ack: ^[[0;0;0;0;CtrlState;0_
Support for LeftShift and RightShift modifiers along with Shift.
Support for Unicode codepoint decimal value (for BaseCharacter) instead of UnicodeChar (Uc), which currently represents one UTF-16 code unit (wchar_t).
^[[Vk;Sc;Uc;Kd;Cs;Rc_ => ^[[Vk;Sc;BaseCharacter;Kd;Cs;Rc_
Support for passing/receiving an entire grapheme cluster as one keyboard event in order to get it by a single ReadKey call, e.g. getting an accented letter á represented as a sequence of codepoints. (GC = BaseCharacter + Codepoint1 + Cp2 + … + CpN).
Support multi-code-point characters in TerminalInput::HandleKey #2853
Cross-platform win32-input-mode usage #8343 (comment)
Cross-platform win32-input-mode usage #8343 (comment)

Possible solution:
- Add continuing codepoints as optional arguments to the event sequence:
  ^[[Vk;Sc;BaseCharacter;Kd;Cs;Rc [ ; Codepoint1 ; Cp2 ; … ; CpN ] _
Sending clipboard data without additional generation of keystroke sequence, since this data must already be enclosed (mandatory bracketed paste mode).

The text was updated successfully, but these errors were encountered:

j4james · 2023-06-11T15:55:00Z

No way to restore the normal mode of the terminal when the connection is lost or the application crashes.

This is a general problem that can apply to a lot of terminal states. Some of them you could potentially recover from by blindly typing reset in a WSL shell, or echoing an appropriate escape sequence, but that's not a realistic solution for most users. We need something like a Reset Terminal action that'll allow the user to easily recover from any messed up state - not just this one particular keyboard protocol.

This is a fairly common feature in terminal emulators, and I could have sworn we already had an issue for it, but I can't find it now. It's possible it was just something that was discussed in an unrelated issue, but we definitely should open an issue for it if there isn't one.

No way to get an entire grapheme cluster via single read key call, i.e. get accented a-letter á that represented as a codepoint sequence.

But the win32-input-mode is a direct representation of the key events we receive. If there was keyboard that allowed you to enter grapheme clusters, that would have to be in the form of multiple key events, and this is something that applications would already be expected to handle. Same goes for any characters outside the BMP for that matter - they're going to appear as two separate key events to represent the surrogate pair. We can't report a UTF-8 representation for combined events that we haven't received yet.

o-sdn-o · 2023-06-11T18:48:59Z

I think that it is necessary to support grapheme clusters in this keyboard protocol for the following reason.
The application must be able to compose codepoints into clusters. At the same time, it should mark how current cluster was composed. In the general case, it makes sense to distinguish a grapheme cluster assembled with several keystrokes from a grapheme cluster generated with one keystroke, since when deleting it with Backspace, in the first case, the codepoints will have to be deleted one-by-one, in the second, the whole cluster at a time. By using the Del key in both cases, we can delete the entire cluster.

For example, a user can compose a cluster by typing codepoint by codepoint using Alt+Num, and in the process correct errors using Backspace. Also, the user can use IME with one keystroke to get a complex cluster, and delete it with Backspace also in one keystroke.

Currently, we can only get fragmented keyboard events/clusters. Tomorrow we may be able to get entire grapheme cluster from Windows subsystems per call, and the keyboard protocol should support this.

o-sdn-o · 2023-06-13T07:15:43Z

Closing this issue in favor of vt-input-mode protocol.

j4james · 2023-06-13T09:10:55Z

For example, a user can compose a cluster by typing codepoint by codepoint using Alt+Num, and in the process correct errors using Backspace. Also, the user can use IME with one keystroke to get a complex cluster, and delete it with Backspace also in one keystroke.

Assuming this was possible, what happens when your editor saves these two different clusters to disk and later reloads them? Now there's no way to distinguish between the two, and a backspace would work in exactly the same way for both of them. To a user, the backspace behavior would just seem random.

I really cannot imagine an editor going to all that effort, when it's just going to cause confusion. If anything, it might make sense to backspace over the entire cluster with one keystroke in all cases, regardless of how the cluster was entered. That at least would be consistent.

o-sdn-o · 2023-06-13T09:27:04Z

It would be important for me here to clarify that we are talking only about the current open cluster in which code points are currently being added.

After the cluster is closed, it is atomic.

j4james · 2023-06-13T10:12:47Z

Then I don't understand why you'd need to tell the difference between a cluster entered as a single keystroke and one entered with multiple keystrokes. If the keyboard protocol splits a cluster into multiple keystrokes, the final keystroke would assumedly close the cluster, so it's still going to immediately become atomic isn't it?

o-sdn-o · 2023-06-13T10:18:28Z

The last entered cluster must be kept open, since it is not known whether an additional joiner will arrive or not, for example, with a skin tone modifier.

o-sdn-o · 2023-06-13T10:39:18Z

User can manually construct a grapheme cluster of any length they want by stringing together multiple zero-width joiners.

In the case of receiving Backspace and before that there was a cluster containing more than one codepoint received in one transaction, then it will delete this cluster entirely. If codepoints were received separately, only the last codepoint will be deleted.

Case 1

ReadKey call: BaseChar1
ReadKey call: BaseChar2
ReadKey call: Continuing Codepoint1
ReadKey call: Continuing Codepoint2
ReadKey call: Continuing Codepoint3
ReadKey call: Backspace
---------------
Result: BaseChar1 + BaseChar2+Codepoint1+Codepoint2

Case 2

ReadKey call: BaseChar1
ReadKey call: BaseChar2+Codepoint1+Codepoint2+Codepoint3
ReadKey call: Backspace
---------------
Result: BaseChar1

Case 3

ReadKey call: BaseChar1
ReadKey call: BaseChar2+Codepoint1+Codepoint2
ReadKey call: Continuing Codepoint3
ReadKey call: Backspace
ReadKey call: Backspace
---------------
Result: BaseChar1 + BaseChar2+Codepoint1

Note: Grapheme cluster segmentation is based on the codepoint properties from the Unicode Character Database and Grapheme Cluster Boundary Rules.

j4james · 2023-06-13T11:51:14Z

We seem to be going in circles here. I refer you back to my comment here: #8343 (comment)

I understand what you're suggesting - I just don't think it's a very good idea. That's just my opinion.

o-sdn-o · 2023-06-13T12:06:24Z

it might make sense to backspace over the entire cluster with one keystroke in all cases, regardless of how the cluster was entered.

For example GitHub web interface allows you to ZWJ+codepoint-by-ZWJ+codepoint backspace a complex cluster. Try to backspace the following cluster 🏴‍☠️. I am using FF as my desktop browser. It may behave differently in other browsers. Windows Notepad and Notepad++ allows you to backspace codepoint-by-codepoint.

o-sdn-o · 2023-06-13T12:16:15Z

And it's annoying when you insert a complex cluster with a single keystroke, and to remove it you have to press the backspace a dozen times. This does not apply to rare emoji. It seems to me when using a Devanagari-like writing system with complex clusters in each syllable, this is critical.

In other hand if the whole cluster will be deleted in one backspace, then in case of a typo, you need to retype the entire cluster from the beginning.

o-sdn-o · 2023-06-13T12:48:57Z

Probably a compromise here will be to completely delete either with the Del key or the backspace, but the whole cluster that has just arrived. In all other cases, the backspace deletes codepoint-by-codepoint.

j4james · 2023-06-13T12:49:10Z

This seems like something I'd want to configure in my editor though. Sometimes I might prefer backspacing over individual parts of a cluster, and sometimes I might prefer deleting the entire thing with a single backspace. That should be my choice to make, so I know it's going to work exactly the way I expect. I definitely wouldn't want the behavior randomly changing depending on which IME I happened to use, and then potentially changing again when I reload the content from disk.

o-sdn-o · 2023-06-13T14:18:58Z

I agree that disputes about what atomic or piecewise grapheme clusters should be is a matter of settings. But that's not the point.

I am based on the assumption that a certain operating system can output an entire cluster at a single keystroke, and I consider it fundamentally wrong to split it into a sequence of distinct keypresses and releases, as required by the current win32-input-mode implementation.

j4james · 2023-06-13T14:50:33Z

I consider it fundamentally wrong to split it into a sequence of distinct keypresses and releases

I sympathise with that viewpoint, but from my understanding of your proposed protocol, you're just creating more work for app devs. They already have to accept a cluster split into distinct keypresses, but now they also have to check for additional parameters at the end of the sequence that might be continuing codepoints. So unless there is some value to them in distinguishing between the two, it's just bunch of extra work for nothing.

Also, if an app has already been designed to work with win32-input-mode, they're not going to be expecting those continuing codepoints, so they'd lose half of the cluster. That is not what I would consider backwards compatible. So if you want to extend the protocol like this, I'd encourage you to use a different mode number.

The additional viewport and mouse stuff is an even bigger issue, but I don't want to get into a discussion about protocol design. As long as it's a separate mode that an application has specifically requested, you can do whatever you want.

o-sdn-o added the Issue-Feature Complex enough to require an in depth planning process and actual budgeted, scheduled work. label Nov 20, 2020

ghost added Needs-Triage It's a new issue that the core contributor team needs to triage at the next triage meeting Needs-Tag-Fix Doesn't match tag requirements labels Nov 20, 2020

o-sdn-o changed the title ~~[Win32 input] Generalization of win32-input-mode encoded sequences~~ Cross-platform win32-input-mode usage Dec 1, 2020

o-sdn-o closed this as completed Jun 24, 2021

o-sdn-o closed this as not planned Won't fix, can't repro, duplicate, stale Jun 4, 2022

o-sdn-o mentioned this issue Jun 10, 2023

Soft line breaks option when pasting from clipboard #15508

Closed

o-sdn-o reopened this Jun 11, 2023

o-sdn-o closed this as not planned Won't fix, can't repro, duplicate, stale Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cross-platform `win32-input-mode` usage #8343

Cross-platform `win32-input-mode` usage #8343

o-sdn-o commented Nov 20, 2020 •

edited

Loading

j4james commented Jun 11, 2023

o-sdn-o commented Jun 11, 2023 •

edited

Loading

o-sdn-o commented Jun 13, 2023 •

edited

Loading

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023 •

edited

Loading

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023 •

edited

Loading

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023 •

edited

Loading

o-sdn-o commented Jun 13, 2023

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023

j4james commented Jun 13, 2023

Cross-platform win32-input-mode usage #8343

Cross-platform win32-input-mode usage #8343

Comments

o-sdn-o commented Nov 20, 2020 • edited Loading

Description of the new feature/enhancement

Improvements

j4james commented Jun 11, 2023

o-sdn-o commented Jun 11, 2023 • edited Loading

o-sdn-o commented Jun 13, 2023 • edited Loading

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023 • edited Loading

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023 • edited Loading

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023 • edited Loading

o-sdn-o commented Jun 13, 2023

j4james commented Jun 13, 2023

o-sdn-o commented Jun 13, 2023

j4james commented Jun 13, 2023

Cross-platform `win32-input-mode` usage #8343

Cross-platform `win32-input-mode` usage #8343

o-sdn-o commented Nov 20, 2020 •

edited

Loading

o-sdn-o commented Jun 11, 2023 •

edited

Loading

o-sdn-o commented Jun 13, 2023 •

edited

Loading

o-sdn-o commented Jun 13, 2023 •

edited

Loading

o-sdn-o commented Jun 13, 2023 •

edited

Loading

o-sdn-o commented Jun 13, 2023 •

edited

Loading