Skip to content

Latest commit

 

History

History
10 lines (8 loc) · 645 Bytes

README.md

File metadata and controls

10 lines (8 loc) · 645 Bytes

How it works

  1. Understand how the maximum number of jobs, number of pages and page indices relate to its corresponding result page url.
    • Loop page indices and query for the associated result page.
  2. For each result page, use HTML parser to extract all listing URLs.
  3. Loop and query each listing URL for its job listing page
  4. For each listing page, use HTML parser to extract desired information.