ocrmypdf

Here are 29 public repositories matching this topic...

lucasrla / remarks

Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG

markdown pdf ocr highlighting annotations pdf-converter epub zotero obsidian ocrmypdf svg-images pymupdf remarkable-tablet roamresearch

Updated May 26, 2024
Python

CypherousSkies / reading-for-listeners

Star

A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!

python nlp pdf ocr deep-learning transformers tts ocrmypdf bert mozilla-tts

Updated Feb 17, 2025
Python

soham-1 / fastapi_pdfextractor

Star

An api using fastapi for extracting the text content of pdf using pdfminer. It also supports scanned images in pdf's by using tesseract and ocrmypdf.

tesseract ocrmypdf pdfminer fastapi

Updated Jun 18, 2021
Python

Achiwilms / OCR-Wizard

Star

A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.

python pdf ocrmypdf ocr-recognition pdf-ocr-extraction ocr-python searchable-pdf ocr-pdf pdf-ocr

Updated Oct 28, 2023
Python

bernmic / ocrmypdf-watchdog

Star

A watchdog for OCRMyPDF written in go

go docker golang docker-compose ocrmypdf

Updated Feb 12, 2022
Go

sjain882 / OCRmyPDF-WinGUI

Star

Simple frontend for OCRmyPDF (Windows only).

desktop-app windows pdf csharp dotnet wpf pdf-document pdf-documents ocrmypdf ocr-pdf search-pdf

Updated Jul 9, 2025
C#

DecaTec / OCRmyFiles

Star

Moved to codeberg.org - https://codeberg.org/DecaTec/OCRmyFiles - Bash script for adding a text layer to PDF files and converting images in PDFs (with OCR).

bash pdf ocr images bash-script ocrmypdf

Updated Feb 13, 2022
Shell

prateekralhan / Scanned-PDFs-checker

Sponsor

Star

A streamlit based webapp to detect scanned/digital PDFs from a large corpus as well as allow the user to OCR the scanned docs

ghostscript python3 pdf-document ocrmypdf opensourceforgood pytesseract streamlit

Updated Aug 11, 2022
Python

hansmi / baamhackl

Star

Execute command when files are moved to a directory.

cli golang ocr scanner watchman inotify ocrmypdf

Updated Oct 5, 2025
Go

lakshay1296 / ocrmyPDF_Windows

Star

ocrmyPDF_Windows is inspired by jbarlow83's ocormtpdf. https://github.com/jbarlow83/OCRmyPDF.

python windows flask ocr tesseract windows-10 python3 tesseract-ocr ocrmypdf ocr-recognition tesseract-engine ocr-python tesseract-4

Updated Jan 12, 2020
Python

DenBeke / ocrmymail

Star

OCRmyMail is an SMTP server relay that adds an OCR text layer to PDF mail attachments and sends them to the original recipient.

docker golang pdf mail ocr server smtp ocrmypdf

Updated May 14, 2021
Go

procesaur / TExASe

Star

Flask application for OCR and extraction of text from documents with support for repository applications

api flask ocr repository tika text-extraction tesseract-ocr ocrmypdf

Updated Sep 7, 2023
Python

TheComputeGuy / PDFOCRtool

Star

Add an OCR layer to *any* PDF

python pdf ocr tesseract ocrmypdf pdftopng

Updated Sep 1, 2021
Python

hansmi / dossier

Star

Extract textual information from PDF documents

golang pdf ocr extraction ocrmypdf paperless

Updated Oct 1, 2025
Go

pddd / GUI4OCRMyPDF

Star

Swift UI GUI for ocrmypdf

swift ocr ocrmypdf swiftui

Updated May 27, 2025
Swift

Rajasekaran85 / Python-TIFF-to-OCR-PDF

Star

TIFF Image Convert to OCR PDF

pdf ghostscript ocr glob pdf-converter tesseract-ocr ocrmypdf pypdf2

Updated Mar 9, 2024
Python

MohitGupta0123 / HIN_EN_PDF_Translator

Star

Automated PDF translation & redaction with OCR, PyMuPDF, and AI translation. Preserves layout, font, and colors while supporting selective redaction/masking. English ↔ Hindi supported out-of-the-box. Docker-ready.

python docker ocr tesseract ocrmypdf googletranslate fitz pymupdf huggingface pdf-translation pdf-redaction devnagari hindi-english-multilingual-translation

Updated Aug 17, 2025
Python

brlin-tw / learning-ocrmypdf

Star

Notes during the learning of OCRmyPDF, a Tesseract based Optical Character Recognition(OCR) software

linux notes optical-character-recognition ocrmypdf

Updated Apr 3, 2020

HappyBravo / PDF_to_Text

Star

An attempt to make OCR

javascript python html ocr tesseract tesseract-ocr ocrmypdf pdftotext ocr-python

Updated Jun 24, 2023
HTML

aidayang / OCRmyPDF-OneClick

Star

OCRmyPDF将PDF OCR识别转换为可搜索可复制文档，windows版免安装部署一键启动整合包

python pdf ocr image-processing tesseract ocrmypdf

Updated May 2, 2025
Python

Improve this page

Add a description, image, and links to the ocrmypdf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ocrmypdf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocrmypdf

Here are 29 public repositories matching this topic...

lucasrla / remarks

CypherousSkies / reading-for-listeners

soham-1 / fastapi_pdfextractor

Achiwilms / OCR-Wizard

bernmic / ocrmypdf-watchdog

sjain882 / OCRmyPDF-WinGUI

DecaTec / OCRmyFiles

prateekralhan / Scanned-PDFs-checker

hansmi / baamhackl

lakshay1296 / ocrmyPDF_Windows

DenBeke / ocrmymail

procesaur / TExASe

TheComputeGuy / PDFOCRtool

hansmi / dossier

pddd / GUI4OCRMyPDF

Rajasekaran85 / Python-TIFF-to-OCR-PDF

MohitGupta0123 / HIN_EN_PDF_Translator

brlin-tw / learning-ocrmypdf

HappyBravo / PDF_to_Text

aidayang / OCRmyPDF-OneClick

Improve this page

Add this topic to your repo