Text Classification From Wikipedia Data

The project consists of 2 parts: First, we extract the data from the Wikipedia dump files and second, we implement various text classification models like RNN, RCNN, Self-Attention and BERT. This last model is the only one which has been implemented using Google colab (.ipynb file).

Text Classification is one of the basic and most important task of Natural Language Processing. In this repository, we focus on one such text classification task, more in detail in the problem of: Detection of biased language.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data_processing		data_processing
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification From Wikipedia Data

About

Releases

Packages

Languages

GuillermoJaca/Text_Classification_From_Wikipedia_Data

Folders and files

Latest commit

History

Repository files navigation

Text Classification From Wikipedia Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages