Extract text from PDF files for LLM processing
Downloads
2.3k
Stars
4
Versions
1
Updated
2026-02-24
Install
npx clawhub@latest install pdf-extract
Documentation
PDF Extract
Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.
Commands
Extract all text from a PDF
pdf-extract "document.pdf"
Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5
Install
sudo dnf install poppler-utils
Launch an agent with Pdf Extract on Termo.