🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
python
pdf
machine-learning
ocr
pipeline
text-extraction
pdf-to-text
language-model
extract-text
parsr
pd3f
-
Updated
Oct 13, 2023 - HTML