Skip to content
Change the repository type filter

All

    Repositories list

    • HTML
      MIT License
      0100Updated Oct 10, 2023Oct 10, 2023
    • A 1M+-token Hungarian named entity dataset with ~30 entity types derived from NYTK-NerKor
      0000Updated Jul 4, 2022Jul 4, 2022
    • AraSum

      Public
      Arab Summarization Corpus
      0200Updated Jun 9, 2022Jun 9, 2022
    • Nom-or-what algorithm, designed to disambiguate case endings on nouns, adjectives, numerals etc. in Hungarian.
      Python
      1001Updated Oct 20, 2021Oct 20, 2021
    • locatives

      Public
      0000Updated May 11, 2021May 11, 2021
    • 0000Updated May 11, 2021May 11, 2021
    • postp

      Public
      Data of the study on postpositions (PhD thesis, Noémi Ligeti-Nagy)
      0000Updated May 10, 2021May 10, 2021
    • purepos

      Public
      PurePos is an open source hybrid morphological tagger.
      Java
      GNU Lesser General Public License v3.0
      71521Updated Oct 13, 2020Oct 13, 2020
    • purepospy

      Public
      Python wrapper for PurePos
      Java
      GNU Lesser General Public License v3.0
      0100Updated Dec 14, 2019Dec 14, 2019
    • emmorphpy

      Public
      A wrapper, a lemmatizer and REST API implemented in Python for emMorph (Humor) Hungarian morphological analyzer
      Python
      GNU Lesser General Public License v3.0
      0300Updated Nov 6, 2019Nov 6, 2019
    • HunTag3

      Public
      A sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models
      Lex
      GNU Lesser General Public License v3.0
      10800Updated Nov 6, 2019Nov 6, 2019
    • manocska

      Public
      Manócska -- integrált igei vonzatkeret adatbázis
      Python
      0400Updated Jun 21, 2019Jun 21, 2019
    • Python
      GNU Lesser General Public License v3.0
      0000Updated May 24, 2019May 24, 2019
    • algorithm for case-disambiguation
      Python
      0100Updated Nov 21, 2018Nov 21, 2018
    • Egy pszicholingvisztikai indíttatású elemző modell
      Python
      GNU Lesser General Public License v3.0
      0100Updated Nov 12, 2018Nov 12, 2018
    • The program used in the paper 'Less is More, More or Less... – Finding the Optimal Threshold for Lexicalisation in Chunking' by Balázs Indig
      Python
      GNU General Public License v3.0
      0100Updated Sep 21, 2018Sep 21, 2018
    • What's Wrong With My NLP? is visualizer and graphical diff for Natural Language Processing problems. We are reimplementing this program in Python 3. For more information about the original program go to http://whatswrong.googlecode.com
      Python
      GNU General Public License v3.0
      1590Updated Aug 29, 2018Aug 29, 2018
    • pywnxml

      Public
      Python3 API for WordNet XML (Hungarian WordNet / BalkaNet / VisDic format)
      Python
      GNU General Public License v2.0
      5500Updated May 15, 2018May 15, 2018
    • vframe

      Public
      A method for constraining possible verbal frames based on the preverb and the infinitival argument for Hungarian verbs
      Python
      GNU Lesser General Public License v3.0
      0000Updated Jan 22, 2018Jan 22, 2018
    • Simple Python command line tools for retrieving a list of urls and specific files in bulk
      Python
      GNU Lesser General Public License v3.0
      1100Updated Jan 19, 2018Jan 19, 2018
    • The program used in the paper 'Gut, Besser, Chunker – Selecting the best models for text chunking with voting' by Balázs Indig and István Endrédy
      Python
      GNU Lesser General Public License v3.0
      1100Updated Sep 8, 2016Sep 8, 2016
    • PurePOS rewritten in Python3
      Python
      GNU Lesser General Public License v3.0
      0370Updated Jun 8, 2016Jun 8, 2016
    • boilerplate removal test set for portals (more sites from the same domain)
      HTML
      0100Updated Mar 22, 2016Mar 22, 2016
    • SS05

      Public
      The original SS05 algorithm from Hong Shen and Anoop Sarkar used in the paper 'Voting Between Multiple Data Representations for Text Chunking'
      Perl
      GNU Lesser General Public License v3.0
      1120Updated Mar 21, 2016Mar 21, 2016
    • Results of boilerplate removal algorithms
      Python
      5800Updated Mar 8, 2016Mar 8, 2016