Efficient Text Localization Algorithm, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and Traditional Computer Vision.
-
Updated
Dec 18, 2021 - Python
Efficient Text Localization Algorithm, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and Traditional Computer Vision.
Spoken language identification systems (LID) allow for automatic language detection given speech data. Among the many available methods that can be applied to this classification task, modern machine learning and deep learning approaches have been reported as effective. A previous study approached the problem of spoken language identification in…
A Python implementation of language identification using Canvar and Trenkle’s approach and the WiLI-2018 database
A language identification model using pretrained FastText embeddings from HuggingFace, accurately detecting languages in text data for enhanced text classification and NLP applications.
This repository contains a CNN architecture to classify 13 Indian Languages from their spoken utterance.
Math125A_LanguageIdentification
Indonesian-English code-mixed Twitter dataset
A small and fast language identification model powered by fastText
Leveraging Latent Representations for Indian Language Identification
🌐 Language identification for Scandinavian languages
Several benchmarks on sentence splitting and language identification
Language Identification python script
An Android application aimed at assisting tourists in a foreign country.
FastText Pytorch version
Language Identification using Näive Bayes
This repository presents an approach to predict the language in which a document is written. In particular, the proposed approach transforms a text into character n-gram features and uses them to support the predictive power of a machine-learned classifier. Experimental results show that it is capable of identifying 14 languages with high accura…
url2lang infers the language of a document from its URL
Language identification with as few characters as possible
Add a description, image, and links to the language-identification topic page so that developers can more easily learn about it.
To associate your repository with the language-identification topic, visit your repo's landing page and select "manage topics."