Skip to content

Best engines to choose from #816

Closed Answered by wannaphong
Nihisil asked this question in Q&A
Discussion options

You must be logged in to vote

Hello! Our language doesn't has Thai word segmentation (or word tokenization) standard from the planning and regulation of the Thai language, so word segmentation depend on each standards. If you has the resource, you should use deep learning base but It can has out-of-domain problem. (see Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation). If your work is out-of-domain from deep learning model and you doesn't has resource to hire a Thai linguist, you can use newmm and improve Thai dictionary.

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by Nihisil
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants