diff --git a/CHANGELOG.md b/CHANGELOG.md index bc6a260..ebc8f03 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -13,6 +13,11 @@ and this project adheres to [Semantic Versioning](https://semver.org/). ### Fixed ### Security +## [3.2.1] - 2021-03-21 + +### Fixed +* Fixed character issues in the built-in HKCanCor data: 𠮩𠹌, 𠻗 + ## [3.2.0] - 2021-03-20 Note: The underlying CHAT parser, the PyLangAcq package, has been bumped to v0.13.0. diff --git a/docs/.buildinfo b/docs/.buildinfo index 6b129ee..756f522 100644 --- a/docs/.buildinfo +++ b/docs/.buildinfo @@ -1,4 +1,4 @@ # Sphinx build info version 1 # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. -config: f9cf05d75d303d74889bec845d2bf729 +config: 1b2c69900a20a5e6d404547532dec06c tags: 645f666f9bcd5a90fca523b33c5a78b7 diff --git a/docs/_modules/index.html b/docs/_modules/index.html index e9b1113..76d757a 100644 --- a/docs/_modules/index.html +++ b/docs/_modules/index.html @@ -7,7 +7,7 @@ -
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021
+ © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
diff --git a/docs/_modules/pycantonese/jyutping/characters.html b/docs/_modules/pycantonese/jyutping/characters.html
index 113cdec..1a97900 100644
--- a/docs/_modules/pycantonese/jyutping/characters.html
+++ b/docs/_modules/pycantonese/jyutping/characters.html
@@ -7,7 +7,7 @@
- pycantonese.jyutping.characters — PyCantonese 3.2.0 documentation
+ pycantonese.jyutping.characters — PyCantonese 3.2.1 documentation
@@ -64,7 +64,7 @@
- 3.2.0
+ 3.2.1
@@ -170,61 +170,65 @@
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021
+ © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
diff --git a/docs/_modules/pycantonese/jyutping/yale.html b/docs/_modules/pycantonese/jyutping/yale.html
index 36787ef..1d5fa37 100644
--- a/docs/_modules/pycantonese/jyutping/yale.html
+++ b/docs/_modules/pycantonese/jyutping/yale.html
@@ -7,7 +7,7 @@
- pycantonese.jyutping.yale — PyCantonese 3.2.0 documentation
+ pycantonese.jyutping.yale — PyCantonese 3.2.1 documentation
@@ -64,7 +64,7 @@
- 3.2.0
+ 3.2.1
@@ -170,61 +170,65 @@
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021
+ © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
diff --git a/docs/_modules/pycantonese/pos_tagging/hkcancor_to_ud.html b/docs/_modules/pycantonese/pos_tagging/hkcancor_to_ud.html
index 78b4349..23c23e1 100644
--- a/docs/_modules/pycantonese/pos_tagging/hkcancor_to_ud.html
+++ b/docs/_modules/pycantonese/pos_tagging/hkcancor_to_ud.html
@@ -7,7 +7,7 @@
- pycantonese.pos_tagging.hkcancor_to_ud — PyCantonese 3.2.0 documentation
+ pycantonese.pos_tagging.hkcancor_to_ud — PyCantonese 3.2.1 documentation
@@ -64,7 +64,7 @@
- 3.2.0
+ 3.2.1
@@ -170,61 +170,65 @@
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
except: # noqa raise EnvironmentError( f"A file is detected at {path}, but it cannot be read as a " - "a tagger model. The likely cause is that you do not have " - "Git LFS installed on your system -- please install it " - "(https://git-lfs.github.com/) and re-install pycantonese " - "with this command: " - "pip install git+https://github.com/jacksonllee/pycantonese.git@master#egg=pycantonese" # noqa: E501 + "a tagger model. " + "Either the tagger model file object is corrupted for some reason, " + "or - perhaps more likely - you're running pycantonese from a local " + "git repo (e.g., when you are doing dev work) and that you do not have " + "Git LFS installed on your system. " + "In the latter case, please install Git LFS " + "(https://git-lfs.github.com/) and re-install pycantonese." ) self.model.weights, self.tagdict, self.classes = w_td_c self.model.classes = self.classes @@ -654,7 +660,7 @@
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021
+ © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
diff --git a/docs/_modules/pycantonese/word_segmentation.html b/docs/_modules/pycantonese/word_segmentation.html
index 3e9b95d..10353fb 100644
--- a/docs/_modules/pycantonese/word_segmentation.html
+++ b/docs/_modules/pycantonese/word_segmentation.html
@@ -7,7 +7,7 @@
- pycantonese.word_segmentation — PyCantonese 3.2.0 documentation
+ pycantonese.word_segmentation — PyCantonese 3.2.1 documentation
@@ -64,7 +64,7 @@
- 3.2.0
+ 3.2.1
@@ -170,61 +170,65 @@
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021
+ © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
diff --git a/docs/_sources/changelog.rst.txt b/docs/_sources/changelog.rst.txt
index 36b314c..66891b0 100644
--- a/docs/_sources/changelog.rst.txt
+++ b/docs/_sources/changelog.rst.txt
@@ -24,6 +24,15 @@ Fixed
Security
^^^^^^^^
+[3.2.1] - 2021-03-21
+--------------------
+
+Fixed
+^^^^^
+
+
+* Fixed character issues in the built-in HKCanCor data: 𠮩𠹌, 𠻗
+
[3.2.0] - 2021-03-20
--------------------
diff --git a/docs/_sources/data.rst.txt b/docs/_sources/data.rst.txt
index 48d1181..a180e5d 100644
--- a/docs/_sources/data.rst.txt
+++ b/docs/_sources/data.rst.txt
@@ -44,7 +44,7 @@ and part-of-speech tags.
The original HKCanCor source files are in an XML format.
They have been converted to CHAT for incorporation into PyCantonese.
On the format conversion, please consult this
-`readme `_.
+`readme `_.
CHILDES and TalkBank Data
diff --git a/docs/_static/documentation_options.js b/docs/_static/documentation_options.js
index a9749b3..e2359c7 100644
--- a/docs/_static/documentation_options.js
+++ b/docs/_static/documentation_options.js
@@ -1,6 +1,6 @@
var DOCUMENTATION_OPTIONS = {
URL_ROOT: document.getElementById("documentation_options").getAttribute('data-url_root'),
- VERSION: '3.2.0',
+ VERSION: '3.2.1',
LANGUAGE: 'None',
COLLAPSE_INDEX: false,
BUILDER: 'html',
diff --git a/docs/api.html b/docs/api.html
index e1f7f83..1d61373 100644
--- a/docs/api.html
+++ b/docs/api.html
@@ -7,7 +7,7 @@
- API Reference — PyCantonese 3.2.0 documentation
+ API Reference — PyCantonese 3.2.1 documentation
@@ -66,7 +66,7 @@
- 3.2.0
+ 3.2.1
@@ -172,61 +172,65 @@
- Security
-- [3.2.0] - 2021-03-20
-- Added
-- Changed
-- Deprecated
-- Fixed
+- [3.2.1] - 2021-03-21
-- [3.1.1] - 2021-03-18
+- [3.2.0] - 2021-03-20
-- [3.1.0] - 2021-02-21
-- Added
-- Fixed
+- [3.1.1] - 2021-03-18
-- [3.0.0] - 2020-10-25
-- Added
-- Changed
-- API-breaking Changes
-- Non-API-breaking Changes
+- [3.1.0] - 2021-02-21
-- Deprecated
-- Security
+- [3.0.0] - 2020-10-25
+- Added
+- Changed
-- [2.4.1] - 2020-10-10
-- [2.4.0] - 2020-10-10
-- Added
+- [2.4.1] - 2020-10-10
-- [2.3.0] - 2020-07-24
+- [2.4.0] - 2020-10-10
-- [2.2.0] - 2018-06-30
-- Added
+- [2.3.0] - 2020-07-24
-- [2.1.0] - 2018-06-11
- Research Outputs
@@ -1597,7 +1601,7 @@
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021
+ © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
Fixed character issues in the built-in HKCanCor data: 𠮩𠹌, 𠻗
Note: The underlying CHAT parser, the PyLangAcq package, has been bumped to v0.13.0. All of the updates of PyLangAcq’s CHAT reader apply to this PyCantonese release as well. The details are in PyLangAcq’s changelog for v0.13.0. The changelog entries below only document updates specific to PyCantonese.
-Defined the Jyutping
class to better represent parsed Jyutping romanization.
Bumped the PyLangAcq dependency to v0.13.0.
The function parse_jyutping
now returns a list of Jyutping
objects,
rather than tuples of strings.
The following methods in the CHATReader
class have been deprecated:
Fixed the character issues in the built-in HKCanCor data: 𠺢, 𠺝, 𡁜, 𧕴, 𥊙, 𡃓, 𠴕, 𡀔
Part-of-speech tagging:
The test suite now covers code snippets in both the docstrings and .rst
doc files.
Word segmentation:
Added support for Python 3.9.
The following x2y
functions have been deprecated in favor of their
equivalents named in the form of x_to_y
.
Turned on HTTPS for the pycantonese.org domain.
Switched to the wordseg
dependency to a PyPI source instead of a
GitHub direct link.
Added support for Python 3.7 and 3.8.
Exposed the exclude
parameter in various reader methods
for excluding specific participants. This parameter was implemented at
pylangacq v0.10.0.
Allowed “n” to be a syllabic nasal.
Fixed corpus reader not picking up the characters.
Fixed the Jyutping-Yale conversion issue with “yu”
Added number_of_words()
and number_of_characters()
for corpus access
Fixed corpus access path issues
The Hong Kong Cantonese Corpus is included in the package.
A general-purpose search()
function is defined, replacing the
element-specific search functions from version 0.1.
Basic functions available, including…
Parsing Jyutping romanization
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021
- © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 20, 2021 + © Copyright 2014-2021, Jackson L. Lee | Documentation last updated on March 21, 2021