This repository is implematation of ๐ DOM based content extraction via text density and I just tested this code for Korean web pages.
๐ DOM based content extraction via text density ๋ ผ๋ฌธ์ ๋ด์ฉ์ Go๋ก ๊ตฌํ ํ ๊ฒ์ ๋๋ค. ํ๊ตญ์ด ํ์ด์ง๋ค์ ๋์์ผ๋ก ํ ์คํธ ํด ๋ดค์ต๋๋ค.
gh repo clone minarc/godensity
cd godensity
go test