Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unsupported site with two languages (?) #58

Open
Dzhuks opened this issue Sep 9, 2024 · 1 comment
Open

Unsupported site with two languages (?) #58

Dzhuks opened this issue Sep 9, 2024 · 1 comment
Labels
unsupported-site Sites that reject requests from Slurp or use scripts that prevent Slurp from working

Comments

@Dzhuks
Copy link

Dzhuks commented Sep 9, 2024

I encountered an issue while trying to extract text from articles from the korean university site. I wanted to slurp an article, which is written on English, but I failed. I guess they have two versions of site for korean and english speakers. Therefore, when I try to slurp the page, I get the korean text. You can see an example of this issue in the article from this URL: Welcome message from the Head school.

image
image

By no mean, I am not complaining. Slurp is an amazing plugin. I just wanted to know if there is a solution to this issue.

Obsidian: 1.6.7
OS: Windows 11
Slurp: 0.1.12

@Dzhuks Dzhuks changed the title Unsupported site (?) Unsupported site with two languages (?) Sep 9, 2024
@inhumantsar
Copy link
Owner

it looks like they use cookies or something similar to determine what language to load in, without checking the browser's locale.

Slurp can't do much about that since it won't have the same cookies as your browser and doesn't have a way to click the button before slurping the data. Many sites use the URL to carry locale info, eg http://.../lang=en_US for this exact reason.

This is something that the browser extension may be able to handle though. I'll link this issue there for future reference.

Thanks for the report!

@inhumantsar inhumantsar added the unsupported-site Sites that reject requests from Slurp or use scripts that prevent Slurp from working label Sep 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
unsupported-site Sites that reject requests from Slurp or use scripts that prevent Slurp from working
Projects
None yet
Development

No branches or pull requests

2 participants