Skip to content

Releases: aallam/ktoken

0.4.0

07 Jun 15:11
af845c3
Compare
Choose a tag to compare

Added

  • O200K encoding

0.3.0

14 Oct 10:56
55d3485
Compare
Choose a tag to compare

Focused on internal refactoring and API stabilization to enhance usability and facilitate future development.

Breaking Changes

  • Rename Enconding to EncodingConfig.
  • Rework encodings to enhance extensibility:
    • Add a new Encoding interface.
    • Introduce default implementations of Encoding: CL100KBase, P50KBase, R50KBase, and P50KEdit.
  • Rename Tokenizer.encoding and Tokenizer.encodingForModel to Tokenizer.of with overrides.

0.2.0

13 Oct 12:28
fe44ce6
Compare
Choose a tag to compare

Changed

  • JVM: use local PBE loader by default
  • Encoding: enable custom encoding

0.1.0

08 Oct 19:42
4c649b1
Compare
Choose a tag to compare

Initial release.

Added

  • Encodings: CL100K_BASE , R50K_BASE, P50K_BASE and P50K_EDIT
  • Custom encoding support