Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG
-
Updated
May 26, 2024 - Python
Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG
A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.
A watchdog for OCRMyPDF written in go
Simple frontend for OCRmyPDF (Windows only).
Moved to codeberg.org - https://codeberg.org/DecaTec/OCRmyFiles - Bash script for adding a text layer to PDF files and converting images in PDFs (with OCR).
A streamlit based webapp to detect scanned/digital PDFs from a large corpus as well as allow the user to OCR the scanned docs
ocrmyPDF_Windows is inspired by jbarlow83's ocormtpdf. https://github.com/jbarlow83/OCRmyPDF.
Flask application for OCR and extraction of text from documents with support for repository applications
TIFF Image Convert to OCR PDF
Automated PDF translation & redaction with OCR, PyMuPDF, and AI translation. Preserves layout, font, and colors while supporting selective redaction/masking. English ↔ Hindi supported out-of-the-box. Docker-ready.
Notes during the learning of OCRmyPDF, a Tesseract based Optical Character Recognition(OCR) software
An attempt to make OCR
Add a description, image, and links to the ocrmypdf topic page so that developers can more easily learn about it.
To associate your repository with the ocrmypdf topic, visit your repo's landing page and select "manage topics."