Example of parsing(crawling) visitnsw.com site
Installation:
- Download the latest version of Geckodriver
- Unzip and put geckodriver into project directory
- Go to the visitnsw.com and choose your category
- Copy current url
- Change domain (in main.py) to current url
- Save main.py
- type pip3 install -r requirements.txt
- python3 main.py and wait :)
- Save extracted data into result.xls file in the project directory
name | site | tel | |||
---|---|---|---|---|---|
MIRAGE: DISUSED PUBLIC PROPERTY IN TAIWAN (PHOTOGRAPHY EXHIBITION) - ALBURY WODONGA | http://www.mamalbury.com.au/see-and-do/exhibitions/yao-jui-chung-lost-society-document-sandy-hsiu-chih-lo-mirage | mama@alburycity.nsw.gov.au | (02) 0000 0000 | https://www.facebook.com/MAMAlbury/ | https://twitter.com/MAMAlbury |