Skip to content
/ NTM Public

Testing of Neural Topic Modeling for Japanese articles

License

Notifications You must be signed in to change notification settings

m3yrin/NTM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Neural Topic Modeling

Testing of Neural Topic Modeling for Japanese articles

This repository intended to test performance of Neural Topic Model and Gensim LDA. A Pytorch implementation of Neural Topic Model and Gensim LDA test code are contained in this repository.

Dataset

livedoor ニュースコーパス / livedoor News Corpus
https://www.rondhuit.com/download.html#ldcc
CC BY-ND 2.1 JP
https://creativecommons.org/licenses/by-nd/2.1/jp/

Neural Topic Model

Open In Colab

Reference

memo

  • yuewang-cuhk' s NTM implementation is partially used.
  • tdual' s script is massively cited.
  • janome tokenizer is used instead of Mecab.

GPU instance is recommended. If training is too slow, please check instance type of Google Colaboratory.

LDA

Open In Colab

Reference

http://tdual.hatenablog.com/entry/2018/04/09/133000

Memo

  • tdual' s LDA script is massively cited.
  • janome tokenizer is used instead of Mecab.

About

Testing of Neural Topic Modeling for Japanese articles

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published