Skip to content

A Python implementation of a tf-idf based search engine for a subset of the English Wikipedia

License

Notifications You must be signed in to change notification settings

Giuseppe-Della-Corte/WikiSearchEngine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WikiSearchEngine

A Python implementation of a tf-idf based search engine for a subset of the English Wikipedia.

To run the program

You have to run both commands below. All files must be placed in the same folder.

  1. 'python generator.py parsed_xml'
  2. 'python query.py '

Information

The English Wikipedia sub-set has been reduced to make the upload easier.

Authors

WikiSearchEngine was developed as a group project for the Information Retrieval course, Master in Language Technology, Uppsala University.

Authors: Giuseppe Della Corte, Ugo Loobuyck

Course: Information Retrieval

Master of Language Technology, Uppsala University

About

A Python implementation of a tf-idf based search engine for a subset of the English Wikipedia

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages