Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run 🤗 directly in your browser or in Node.js
-
Updated
May 4, 2026 - TypeScript
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run 🤗 directly in your browser or in Node.js
ODL-first PDF ingestion PoC with optional eSearch-OCR v5 repair and a Vue preview/export UI.
Add a description, image, and links to the pdf2text topic page so that developers can more easily learn about it.
To associate your repository with the pdf2text topic, visit your repo's landing page and select "manage topics."