A repository for the Modern Java Technologies Course @ FMI
-
Updated
Oct 9, 2019 - Java
A repository for the Modern Java Technologies Course @ FMI
An Implement of search Engine
Program that preprocesses a collection of documents to calculate the frequency of the most common terms and identify the keywords of each document. The first time will do it without using the stemming technique and without removing the stopwords. The second time will use these techniques.
This is a simple Spring Boot project which removes stop words from a text file.
Processing, Retrieving, and Ranking Documents in a Wikipedia collection
Lucene token filter that removes trailing stopwords from shingles.
📒 An Aho-Corasick algorithm based string-searching utility for Java. It supports tokenization, ignoring case, replacing text. So you can use it to find keywords in an article, filter sensitive words, etc.
jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1
Add a description, image, and links to the stopwords topic page so that developers can more easily learn about it.
To associate your repository with the stopwords topic, visit your repo's landing page and select "manage topics."