Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 527 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 527 Bytes

LemmaGen lemmatizer module

This is a lemmatizer for Bulgarian, Czech, English, Estonian, French, German, Hungarian, Italian, Romanian, Serbian, Slovene and Spanish.

The file directory structure is as follows:

lemmagen/              - Lemmagen Python bindings
lemmagen/dictionaries/ - Binary dictionaries for languages supported
src/                   - Lemmatizer C++ source

Contributors:

Original version form Joseph Stephan Institute (http://lemmatise.ijs.si/)

  • Jernej Virag
  • Domen Grabec
  • Gašper Žejn