Python PDF Web Scraper

A simple Python script that scrapes web pages for PDF files and downloads them to a local directory.

Getting Started

Clone this repository.
Install Python.
Install Pip.
Install pip installl beautifulsoup4 and pip install urllib3 in your terminal.
Place the web page URL and output file location in the main.py file here:

# Define your URL
url = "https://yourWebsiteURL"

#If there is no such folder, the script will create one automatically
folder_location = r'/YOUR/OUTPUT/FILE/PATH'

Run the script: python main.py
PDF files will be downloaded to your local directory.

Resources

License

This project is released under the terms of The Unlicense, which allows you to use, modify, and distribute the code as you see fit.

The Unlicense removes traditional copyright restrictions, giving you the freedom to use the code in any way you choose.
For more details, see the LICENSE file in this repository.

Credits

Author: Scott Grivner
Email: scott.grivner@gmail.com
Website: scottgrivner.dev
Reference: Main Branch

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
docs/images		docs/images
.gitignore		.gitignore
LICENSE		LICENSE
PRG.md		PRG.md
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python PDF Web Scraper

Table of Contents

Getting Started

Resources

License

Credits

About

Releases

Packages

Languages

License

scottgriv/python-pdf_web_scraper

Folders and files

Latest commit

History

Repository files navigation

Python PDF Web Scraper

Table of Contents

Getting Started

Resources

License

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages