Tokenizer 1.26.1
Fixes and improvements
- Fix application of the BPE vocabulary when using
preserve_segmented_tokens
and a subword appears without joiner in the vocabulary - Fix compilation with ICU versions older than 60
preserve_segmented_tokens
and a subword appears without joiner in the vocabulary