-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple URLs almost works #60
Comments
So I have just confirmed this bug:
This crawl checks the visible link ( This crawl checks links on the public page, as well as visits |
The problem is that the links in |
This should fix it: https://github.com/dantleech/fink/pull/72/files |
Allow the inclusion of additional links
Thanks! It now works like this:
|
I spoke a little too soon ... It does work and checks external links, which is great. But it seems to only do this on the first page, but doesn't follow paginated links, like It seems like it transfers the querystring to the base URL in the crawl process, resulting in something like this: Where |
UPDATE: It works! I changed the pager (this is in Drupal) from |
It's possible to check multiple URLs with this:
php fink.phar https://example.org https://example.org/hiddenpage --max-external-distance=1
External links on the second URL are checked, but the crawler doesn't seem to follow internal links. My use case is using
example.org/hiddenpage
as a list of internal links, to guide the crawler to specific pages, but fink doesn't seem to follow these links from thehiddenpage
, but only checks if the links work, and returning astatus: 200
.The text was updated successfully, but these errors were encountered: