Inverted Search Index

Abstract:

An inverted index is an index data structure storing a mapping from content, such as words or numbers, to its locations in a database file, or in a document or a set of documents. The purpose of an inverted index is to allow fast full text searches, at the cost of increased processing when a document is added to the database. The inverted file may be the database file itself, rather than its index. It is the most popular data structure used in document retrieval systems, used on a large scale for example in search engines.

Description:

Forward Indexing - It is a data structure that stores mapping from documents to words i.e. directs you from document to word.

Eg. - Table of contents in book.

Inverted Indexing - It is a data structure that stores mapping from words to documents or set of documents i.e. directs you from word to document.

Eg. - Index at the back of the book.

Requirement Specification

Implementing this search program mainly consists two important functions.

Indexing
Querying

Indexing:

By Indexing, we are creating a database file which contains the index of all words. So this can be termed as Database Creation also. All the files whose index are to be created are selected and inputed to this function. All the files are parsed and words are separated and indexed. They are arranged in sorted order. For this a sorted Linked List or Hashing is used which will store the words and the related file details. The index thus created is then stored in the file as database. This file is later used in Querying. While the files are removed or added this index file is updated.

Searching:

Once the Indexing is over we have the Querying or Searching. The text to be searched is inputed which is parsed into words and those words are searched in the index file. To avoid the overhead of reading the file again, the file is converted back to a linkedList or hashing program, in which the words are searched. The information about the files which contain the words are collected. The ones with more matches are filtered and produced as the result.

Operations:

Create Database
Display Database
Search Database
Update Database
Save Database

Printing Pattern:

Display the index number and details as follows -> [index_no] <file_count> <word_count>

Saving Pattern:

Store the contents in given pattern:

#<index_no>;
;<file_count>;<file_name>;<word_count>#

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
LL_operations.c		LL_operations.c
README.md		README.md
colors.h		colors.h
common.c		common.c
create_database.c		create_database.c
diplay_database.c		diplay_database.c
inverted_index.h		inverted_index.h
main.c		main.c
makefile		makefile
save_database.c		save_database.c
search_database.c		search_database.c
update_database.c		update_database.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Inverted Search Index

Abstract:

Description:

Forward Indexing - It is a data structure that stores mapping from documents to words i.e. directs you from document to word.

Eg. - Table of contents in book.

Inverted Indexing - It is a data structure that stores mapping from words to documents or set of documents i.e. directs you from word to document.

Eg. - Index at the back of the book.

Requirement Specification

Indexing:

Searching:

Operations:

Printing Pattern:

Saving Pattern:

References:

About

Releases

Packages

Languages

pksh-23514/Inverted-Search-Index

Folders and files

Latest commit

History

Repository files navigation

Inverted Search Index

Abstract:

Description:

Forward Indexing - It is a data structure that stores mapping from documents to words i.e. directs you from document to word.

Eg. - Table of contents in book.

Inverted Indexing - It is a data structure that stores mapping from words to documents or set of documents i.e. directs you from word to document.

Eg. - Index at the back of the book.

Requirement Specification

Indexing:

Searching:

Operations:

Printing Pattern:

Saving Pattern:

References:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages