Skip to content

kapil-verma/NLP-on-Hindi-Text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

NLP-on-Hindi-Text

It's a supoorting repository for this Kaggle notebook which is aimed for analyzing Hindi Bible text using NLP.

Contents

Hindi NLP-Data preprocessing.ipynb contains Data Cleaning of the JSON file (dataset).
Files created through the Data preprocessing are available in Results folder.

Requirements

Hindi NLP resources like Indicnlp library and Hindi SentiWordNet are required to run the Hindi NLP.ipnyb file. indic_nlp_resources can be downloded from here
indic_nlp_library can be downloded from here
Alternative: tokenizing with NLTK will also do the work
Hindi SentiWordNet can be downloaded from here

Dataset

Hindi Bible dataset in JSON format is taken from here