Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

503 error (access denied) for Google Scholar, i.e. how to deal with "Please show you're not a robot" #113

Open
GerHobbelt opened this issue Oct 18, 2019 · 12 comments
Labels
📖documentation Improvements or additions to documentation 🕵investigate Needs further analysis to find the root cause.

Comments

@GerHobbelt
Copy link
Collaborator

Possibly relevant info from another tool which uses Scholar as well: ckreibich/scholar.py#66

Related issue: #2

@GerHobbelt GerHobbelt added 🐛bug Something isn't working 🦸‍♀️enhancement🦸‍♂️ New feature or request 🕵investigate Needs further analysis to find the root cause. labels Oct 18, 2019
@GerHobbelt GerHobbelt added 📖documentation Improvements or additions to documentation and removed 🐛bug Something isn't working 🦸‍♀️enhancement🦸‍♂️ New feature or request labels Nov 3, 2019
@GerHobbelt
Copy link
Collaborator Author

GerHobbelt commented Nov 3, 2019

Related material from others / same type of trouble for Zotero et al:

@GerHobbelt
Copy link
Collaborator Author

Analysis for Qiqqa: this depends on #2, where we should change to using Chrome+CefSharp so that the Google "I am not a bot" Captchas will be working again.

@GerHobbelt
Copy link
Collaborator Author

There's an additional work-around that I didn't know before: in Scholar, click on the "cite" link (often shown as a very large double-quote icon below the item, next to links to "related articles", etc.) and then a popup is shown in the page where you can view a few different citation formats **and below those you'll see a couple of links to 'download' the citation info in various formats, the first of which is: BibTeX. 🥳 yay!

Source: https://texblog.org/2014/04/22/using-google-scholar-to-download-bibtex-citations/

@Mocabl3nd
Copy link

I'm also experiencing this issue. I cant still get pass through the captcha :(

@GerHobbelt
Copy link
Collaborator Author

This makes it a tough nut as it's dependent on #2, which is still some way away (= adapting Qiqqa to use a totally different embedded browser than the old fireFox (a.k.a. XULrunner) in there)

Have you tried running one of the latest experimental v82* releases? I cannot guarantee that the captcha problem will be gone (I get it myself at the oddest times), but at least I don't suffer from it most of the time; automatic BibTeX access is a severe problem on my own machine (for which the work-around posted above is some help at least), so I wonder what the difference in our setups really is that makes Google even more aggressive on your box.

Please check Qiqqa version and if it's not a recent v82, fetch the latest and try that one for a while. If you're not satisfied, you can always "downgrade" back to your old version by re-installing that one: any Qiqqa installer will replace the Qiqqa software already on your machine, while user configuration will be kept intact.

@Mocabl3nd
Copy link

Mocabl3nd commented Apr 21, 2020 via email

@Leepee
Copy link

Leepee commented Jul 20, 2020

As another workaround for this, you can login to the browser, and Scholar allows you a few more cracks at it, to get some more docs cited. If you have more issues, you can also "star" the document, and then go to "My Library" and cite from there. That's what I did and managed to cite 50 something articles just now.

@Mocabl3nd
Copy link

Mocabl3nd commented Jul 28, 2020 via email

@SimonDedman
Copy link

SimonDedman commented Oct 21, 2020

I've found the VPN (tunnelbear is good since it's free but has a low allowance but you hardly need any for this purpose) + cite (quotation marks) links, works quite well, and used to for original qiqqa. However it only worked for about 50 papers (3 successful VPN nation changes) before giving up. I'll try again tomorrow.

Edit: confirmed working. Bit of a grind but I've completed my ~150 paper backlog using this method. I found better results VPN-ing to western countries (I'm based in the US and it lasted longer today with Canada, UK, Ireland, than yesterday when I included Mexico & Brazil, but that might be random variance). In any case: this is a viable approach. Though like I say, I manually check each one, I don't know if others somehow try to let it work automatically for all at once somehow.

@m0rxy
Copy link

m0rxy commented Feb 1, 2021

Hi,
just wanted to let you know that this is still an issue as one cannot complete the CAPTCHA in the sniffer browser.

Keep up the great works people :)

@GerHobbelt
Copy link
Collaborator Author

@m0rxy: Yup, still an issue. While the Scholar captcha problem won't go away, the underlying problem that's aggravating this is #2. Which will be addressed after I've taken care of the PDF background processes upgrade (PDF rendering, text extraction, OCR). Ergo: this will take a while before it is addressed, unfortunately.

@GerHobbelt
Copy link
Collaborator Author

Additional work-around has been posted in #310.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
📖documentation Improvements or additions to documentation 🕵investigate Needs further analysis to find the root cause.
Projects
None yet
Development

No branches or pull requests

5 participants