LANGUAGE DETECTION (Bigram model)

DESCRIPTION:

Program inputs a corpus of text documents written in different languages. It automatically detects the language of a new given text in the form of a paragraph, sentence, word, or a few letters. The bigram letter model is used with some basics of probability.

TEST:

To test examples, run the language_detection.py and input the number of example you want to test.

DATASET:

Dataset can be found in publicDataSet/public/set folder

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
publicDataSet/public		publicDataSet/public
README.md		README.md
language_detection.py		language_detection.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LANGUAGE DETECTION (Bigram model)

DESCRIPTION:

TEST:

DATASET:

About

Releases

Packages

Languages

Data-Science-kosta/Language-detection-bigram-model

Folders and files

Latest commit

History

Repository files navigation

LANGUAGE DETECTION (Bigram model)

DESCRIPTION:

TEST:

DATASET:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages