Skip to content

foenpeng/multithreaded-web-scraper-of-Indeed.com

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multi-threaded Web Scraping Indeed.com for Most-Wanted Data Science Skills

Introduction

This is a multi-threaded web scraper of Indeed.com that scrapes data science job Ads for user pre-defined location (for example, Pittsburgh, PA). And it finally provides a bar chart of most-wanted data science skills in job market in that location.

  • The program is written in Python 3.
  • Single-threaded scraping program, which is my first version of this scraper, is also provided.
  • Multi-threaded scraper is faster in speed than single-threaded scraper by 862.79% for the following Pittsburgh case.

Visualization and Results

pittsburgh_081316

  • These results were run on August 12, 2016.

Installation and Usages

Here is an example to run the application against Seattle WA

python3 run_scraper.py --city Seattle --state WA

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%