Skip to content

Releases: PyThaiNLP/pythainlp

PyThaiNLP v4.1.0-beta4

05 Sep 04:30
Compare
Choose a tag to compare
Pre-release

Docs: https://pythainlp.github.io/dev-docs/
Report bug: https://github.com/PyThaiNLP/pythainlp/issues

Install: pip install --pre pythanlp

See 4.1 Milestone.

What's Changed

New Contributors

Full Changelog: v4.1.0-beta3...v4.1.0-beta4

PyThaiNLP v4.1.0-beta3

04 Aug 06:02
Compare
Choose a tag to compare
Pre-release

What's Changed

Full Changelog: v4.1.0-beta2...v4.1.0-beta3

PyThaiNLP v4.1.0-beta2

27 Jul 13:54
Compare
Choose a tag to compare
Pre-release

What is change?

Full Changelog: v4.1.0-beta1...v4.1.0-beta2

PyThaiNLP v4.1.0-beta1

24 Jul 05:02
da86fe2
Compare
Choose a tag to compare
Pre-release

Schedule

  • First Beta release: 24 July 2023

Docs: https://pythainlp.github.io/dev-docs/
Report bug: https://github.com/PyThaiNLP/pythainlp/issues

Install: pip install --pre pythanlp

See 4.1 Milestone.

What is new?

Deprecation and other API changes

  • 5e97e7c Change the default NER to thainer-v2

New API

  • Add pythainlp.coref: Add pythainlp.coref for support Thai Coreference resolution #802
  • Add wtpsplit to sentence segmentation & paragraph segmentation #804 and add paragraph_threshold into paragraph_tokenize function #806
  • Add word approximation to pythainlp.soundex.sound by @wannaphong in #809
  • Add pythainlp.wsd for Thai Word Sense Disambiguation by @wannaphong in #818
  • Add pythainlp.chat and WangChanGLM to pythainlp.generate by @wannaphong in #819
  • Add a param-free classification model (pythainlp.cls) by @c4n in #821
  • Add pythainlp.el by @wannaphong in #822
  • Add pythainlp.util.abbreviation_to_full_text #826 by @wannaphong in #826

Tokenizer

  • Add wtpsplit engine to sentence_tokenize #804
  • New paragraph_tokenize funtion to split Thai text to a paragraph. #804
  • add paragraph_threshold into paragraph_tokenize function by @pavaris-pm in #806

Translate

Corpus

Util

New Contributors

Full Changelog: v4.0.0...v4.1.0-beta1

PyThaiNLP v4.0.2 Released!

30 May 17:12
Compare
Choose a tag to compare

PyThaiNLP v4.0.2 is a bug fix release of PyThaiNLP v4.0.

Upgrade: pip install -U pythainlp

Documentation: https://pythainlp.github.io/docs/4.0

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 4.0 Change Log

What's Changed

Full Changelog: v4.0.1...v4.0.2

Contributors

Thanks all the contributors. (Image made with contributors-img)

If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.

PyThaiNLP v4.0.1 Released!

03 May 09:05
Compare
Choose a tag to compare

PyThaiNLP v4.0.1 is a bug fix release of PyThaiNLP v4.0.

Upgrade: pip install -U pythainlp

Documentation: https://pythainlp.github.io/docs/4.0

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 4.0 Change Log

What's Changed

  • Fix mishandling Karun in Kavee Matra Checker by @HRNPH in #793
  • adding tonemark removal to fix mattra checking by @HRNPH in #795

Full Changelog: v4.0.0...v4.0.1

Contributors

Thanks all the contributors. (Image made with contributors-img)

If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.

PyThaiNLP 4.0 Released!

14 Apr 12:24
462a83e
Compare
Choose a tag to compare

PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).

See 4.0 Milestone.

Documentation: https://pythainlp.github.io/docs/4.0

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 4.0 Change Log

If you want to contribute to PyThaiNLP, you can read Contributing to PyThaiNLP.

What is new?

Deprecation and other API changes

  • Delete all LST20 model #728
  • 947c7be Change pythainlp.tools.misspell to pythainlp.tools.misspell.misspell

Improve

  • Reduce import time #719
  • Fix/broken numeric data format (#652) #723

Tokenizer

  • Add blackboard cls #732
  • Add rule to TCC and Change TCC rule for newmm #741

Tag

  • Add blackboard pos_tag #733
  • Add ThaiNER 2.0 #781

Util

  • Add pythainlp.util.count_thai_chars #748
  • Add thai_strptime and convert_years #767

Transliterate

  • Add Thai2Rom ONNX model #743

Khavee

  • add khavee to pythainlp #777
  • add aek/too checker function to khavee #779

Parse

  • Add ud_goeswith #757

Corpus

  • Add new science word #763

Full Changelog

Full Changelog: v3.1.0...v4.0.0

Contributors

Thanks all the contributors. (Image made with contributors-img)

If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.

New Contributors

PyThaiNLP v4.0.0-beta1

01 Apr 16:28
b881672
Compare
Choose a tag to compare
Pre-release

This post will give you the change log for PyThaiNLP 4.0. PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).

This release is the first beta release of PyThaiNLP 4.0.

Schedule

  • Beta release: 1 April 2023
  • Production release: 14 April 2023

See 4.0 Milestone.

What is new?

Deprecation and other API changes

  • Delete all LST20 model #728
  • 947c7be Change pythainlp.tools.misspell to pythainlp.tools.misspell.misspell

Improve

  • Reduce import time #719
  • Fix/broken numeric data format (#652) #723

Tokenizer

  • Add blackboard cls #732
  • Add rule to TCC and Change TCC rule for newmm #741

Tag

  • Add blackboard pos_tag #733
  • Add ThaiNER 2.0 #781

Util

  • Add pythainlp.util.count_thai_chars #748
  • Add thai_strptime and convert_years #767

Transliterate

  • Add Thai2Rom ONNX model #743

Khavee

  • add khavee to pythainlp #777
  • add aek/too checker function to khavee #779

Parse

  • Add ud_goeswith #757

Corpus

  • Add new science word #763

What's Changed

New Contributors

Full Changelog: v3.1.0...v4.0.0-beta1

PyThaiNLP v3.1.1 Released!

30 Oct 18:14
7803adc
Compare
Choose a tag to compare

PyThaiNLP v3.1.1 is the releases updates of PyThaiNLP v3.1.0.

What's Changed

  • pythainlp.tools.misspell changed to pythainlp.tools.misspell.misspell.
  • Add Reduce import time #719 to PyThaiNLP 3.1.1 #753
  • Doc: Lst20 deprecation warning for 3.1.1 (#749) #752 (Thank you @noppayut)

Full Changelog: v3.1.0...v3.1.1

You can install or upgrade by pip install pythainlp==3.1.1.

Documentation: https://pythainlp.github.io/docs/3.1

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 3.1 change log

See 3.1 Milestone.

Contributors

Thanks all the contributors. (Image made with contributors-img)

PyThaiNLP v3.1.0 Released!

24 Sep 15:33
fc60cd3
Compare
Choose a tag to compare

This is the release version for PyThaiNLP v3.1.0

You can install by pip install pythainlp==3.1.0.

Documentation: https://pythainlp.github.io/docs/3.1

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 3.1 change log

See 3.1 Milestone.

What is new?

Deprecation and other API changes

#687 Remove deprecated function

  • pythainlp.word_vector; doesnt_match, get_model, most_similar_cosmul, sentence_vectorizer, similarity. use WordVector class instead
  • pythainlp.util.delete_tone. use pythainlp.util.remove_tonemark instead
  • Remove pythainlp.util.time_time. use pythainlp.util.time_to_thaiword instead
  • pythainlp.tokenize.syllable_tokenize. use pythainlp.tokenize.subword_tokenize instead

Dependency Parsing

  • Now, PyThaiNLP support dependency_parsing 🎉 Add pythainlp.parse.dependency_parsing #706

Name Entity Tagging

  • #665 Add Thai-NNER pythainlp.tag.NNER
  • #658 Add LST20NER onnx model. It is LST20NER model to onnx model from fine-turning by WangchanBERTa model.

Transliteration

  • #659 Add ISO 11940 transliteration
  • #660 Add Thai W2P v0.2
  • #686 Add wunsen
  • #694 Wunsen Mandarin and Japanese update

PyThaiNLP Corpus downloader

  • #656 Add support zip/tar.gz to download corpus

Text normalization

  • #673 Add a normalising rule for Lakkhangyao ๅ

Translate

  • #674 add gpu option

Text summarize

  • #679 Add mt5 cpe kmutt thai sentence sum

Util

  • #682 Add live-dead syllable classification
  • #684 Add live dead syllable classify
  • #690 Add tone detector

Soundex

  • #699 Add Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique

Other

  • #689 map NG tag to PART
  • #691 Remove TinyDB as a dependency
  • #692 Fix notifications that newer versions of corpora are available
  • Add warning about LST20 license

Contributors

New Contributors

Full Changelog: v3.0.10...v3.1.0

All Contributors

Thanks all the contributors. (Image made with contributors-img)

We build Thai NLP.

PyThaiNLP