Malkitti / Corpusandcodes Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Code and Corpus for Indian Language Computation

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
20wfrq.txt		20wfrq.txt
Malayalam_morph_analyzer.rar		Malayalam_morph_analyzer.rar
README.md		README.md
T_L.UNK.M0.LR.MRG		T_L.UNK.M0.LR.MRG
knd_morph.tgz		knd_morph.tgz
tel_morph.tgz		tel_morph.tgz

Repository files navigation

Corpusandcodes

Code and corpus for Indian language computation

This page contatins codes and corpora for morphological segmentation of Dravidian Languages. At first we have, Kannada, Malaylam, Telugu and Tamil. All the corpora is extracted from Amrita University, IIIT-H, IIIT-M Kerala implemented morphological analysers. It also contains cleaned Wikipidieda text for Kannada, Malaylam, Telugu and Tamil. As Github doesn't allow to include files that are bigger than 25 MB. We only upload the models.

For the entire corpus and codes, please contact - Arun - akallararajappan@\uoc.edu

About

Code and Corpus for Indian Language Computation

telugu

Report repository

Releases

No releases published

Packages

No packages published