O
ocr

Projects with this topic

View kit-kreuzberg project

kit-lang / packages / kit-kreuzberg

Kreuzberg document extraction bindings for Kit

kreuzberg document extraction pdf ocr text ffi kit-lang

0

Updated May 23, 2026

0 0 0 0

Updated May 23, 2026
View A haute voix project

PELLE / A haute voix

"A haute voix !" is an accessibility tool that aims to extract text from a pdf and render it the best way to make it readable out loud by TTS browser tools. All the processing is done locally, documents are processed by your computer.

website ocr JavaScript HTML/CSS tts accessibility

0

Updated May 18, 2026

0 0 0

Updated May 18, 2026
View iiif2ocr project

François Gandolfi / iiif2ocr

Script Python permettant d’extraire des images depuis un manifeste IIIF, de les traiter avec Tesseract OCR, et de générer des fichiers de sortie dans différents formats. Il est conçu pour les bibliothèques, archives et projets de numérisation nécessitant une reconnaissance optique de caractères imprimés brutes.

iiif ocr

0

Updated May 15, 2026

0 0 0 0

Updated May 15, 2026
View Mayan EDMS project

Mayan EDMS / Mayan EDMS

Advanced enterprise Free Open Source DMS (document management system).

Django Python ocr document man... pdf indexing dms enterprise workflow business business-pro... antivirus

671

Updated Apr 29, 2026

671 312 15 172

Updated Apr 29, 2026
View clinical-nlp-pipeline project

Thippesh Mugalikatte Siddappa / clinical-nlp-pipeline

A modular Clinical NLP Pipeline built to process and analyze unstructured medical text using both traditional machine learning and transformer-based approaches.

The project combines multiple components including OCR, text preprocessing, feature engineering, classification, named entity recognition, and visualization into a single end-to-end pipeline. It supports extracting clinical insights from raw documents and predicting medical categories using both TF-IDF + SVM and BERT-based models.

The system was designed and implemented as a structured Python project, with each stage separated into independent modules for scalability and maintainability.

Key Highlights
Built an end-to-end NLP pipeline for clinical text processing. Implemented SVM (≈51% accuracy) and BERT (≈77% accuracy) models. Integrated OCR for extracting text from medical documents. Performed Named Entity Recognition (NER) on clinical data. Designed modular architecture (src/) for clean code organization. Exported outputs for visualization and dashboard integration.

Python machine lear... data science NLP(Natural ... BERT bart ocr SVM text classif... TFIDF named entity... deep learning

0

Updated Apr 26, 2026

0 0 0 0

Updated Apr 26, 2026
View DocuMind project

ALEXENDROS.me / DocuMind

DocuMind es un sistema de organización automática de documentos para Linux desktop, impulsado por IA local (Ollama/Llama3 o HuggingFace). Procesa PDFs, imágenes, vídeos, audio y código: extrae texto/OCR, transcribe, analiza contenido y clasifica/archiva según ISO 15489 (facturas, legal, trabajo, personal, multimedia). Detecta duplicados, registra auditoría en SQLite y prioriza privacidad offline.

Desarrollada en Python 3.10+ con PyMuPDF, Tesseract, Vosk/Whisper, multiprocessing y optimizaciones (xxHash, caching, GPU), demuestra expertise en integración LLM locales/multimodales, procesamiento paralelo, arquitectura modular escalable y evolución hacia GUI PyQt6 con drag-and-drop, búsqueda full-text y empaquetado RPM/Flatpak. (612 caracteres)

Linux Python local-ai Document-Man... ollama ocr multimedia-p... desktop-app SQLite offline-ai automation pyqt6

0

Updated Mar 23, 2026

0 0 0 0

Updated Mar 23, 2026
View INT1341 License Plate Detection project

Ha Duy Long / INT1341 License Plate Detection

An AI-based computer vision project for automatic vehicle license plate detection and recognition using deep learning and OCR

AI ocr yolo Python

0

Updated Mar 16, 2026

0 0 0 0

Updated Mar 16, 2026
View ChanCaptcha project

Crimson Tomato / ChanCaptcha

Solving 4chan captcha

ocr captcha 4chan computer vision firefox-exte...

0

Updated Mar 15, 2026

0 0 0 0

Updated Mar 15, 2026
View EasyOCR with FastAPI project

Tamal Ahmed / EasyOCR with FastAPI

AnalyzeWithOCR is a FastAPI-based backend service that downloads a PDF from a public URL, performs layout-aware text extraction and OCR, and returns structured, page-wise text output via a REST API.

Python fastapi ocr easyocr pdf

0

Updated Feb 18, 2026

0 0 0 0

Updated Feb 18, 2026
View jochre-alto-editor project

jochre / jochre-alto-editor

Graphical browser-based Alto4 editor, for the construction of OCR training corpora.

ocr

4

Updated Jan 26, 2026

4 1 0 0

Updated Jan 26, 2026
View cloudboys-portfolio project

Jones Johnsson / cloudboys-portfolio

Cloud-native data engineering + ML POC: ingest Reddit images, run OCR, store results in BigQuery/Cloud Storage, and serve analytics via FastAPI + a React dashboard.

Data Enginee... gcp fastapi ocr Python React Docker Git devops Markdown

0

Updated Jan 25, 2026

0 0 0 0

Updated Jan 25, 2026
View RetrOCR project

YoshiRulz / RetrOCR

(Design WIP) Ext. tool adding a transcription (OCR) workflow to the EmuHawk (BizHawk) emulator, allowing retro games to be translated partially- or fully-automatically

BizHawk ocr transcription translation C# .NET

0

Updated Jan 14, 2026

0 0 0 0

Updated Jan 14, 2026
View SRS Platform project

Pulga / SRS Platform

Sistema event-driven con Kafka que transforma documentos no estructurados en especificaciones de software completas. Extrae texto con OCR, procesa NER con transformers, clasifica oraciones y generar SRS en múltiples formatos.

ocr NLP kafka Python transformers

0

Updated Jan 05, 2026

0 0 0 0

Updated Jan 05, 2026
View jochre-yiddish-corpus project

jochre / corpora / jochre-yiddish-corpus

Jochre OCR training corpus for Yiddish in Alto4 format

ocr yiddish corpus

1

Updated Dec 10, 2025

1 1 0 5

Updated Dec 10, 2025
View RapidOCR - Traitement Articles Recherche project

Paul Caillé / RapidOCR - Traitement Articles Recherche

Traitement d'articles en C++ (via RapidOCROnnx) de journaux italiens dans le cadre d'un mémoire de recherche en histoire. Catégorisation à venir.

ocr

0

Updated Dec 08, 2025

0

Updated Dec 08, 2025
View jochre3-ocr project

jochre / jochre3-ocr

Jochre3 OCR engine with default implementation for Yiddish - completely new version of https://github.com/urieli/jochre

ocr yiddish

0

Updated Dec 07, 2025

0 1 1 1

Updated Dec 07, 2025
View UrT OCR project

Josh / UrT OCR

Process UrT gameplay to gather distance stats for Game Life Balance: https://game-life-balance.com

ocr opencv gaming data analysis

0

Updated Nov 15, 2025

0 0 0 0

Updated Nov 15, 2025
View Seeneva - smart comic book reader project

Seeneva / Seeneva - smart comic book reader

A libre smart powered comic book reader for Android.

❗Note: This is a mirror. Check GitHub repository.

Android ocr ML tts comic-reader seeneva

0

Updated Sep 21, 2025

0

Updated Sep 21, 2025
View Catalogación de Partituras DMP project

El Sistema / Catalogación de Partituras DMP

Plataforma de Administración de Documentos (DMP) para preservar el patrimonio musical de "El Sistema", usando:
Papra DMP: Gestión de metadatos. Audiveris OMR: OMR para partituras.

opensource dms Papra ocr el-sistema audiveris

0

Updated Aug 10, 2025

0 0 0 0

Updated Aug 10, 2025
View ocr-layout-newspaper-yolov8 project

hendarto kurniawan / ocr-layout-newspaper-yolov8

This project focuses on developing a prototype application for extracting headlines and content from digitized newspaper images stored in the SIDAK (Sistem Informasi Database Koleksi) system of the Monumen Pers Nasional, utilizing computer vision and deep learning techniques.

The prototype aims to overcome the limitations of standard OCR tools by integrating YOLOv8 object detection to precisely identify and separate newspaper headlines and article content before text extraction.

machine lear... yolov8 object detec... ocr

0

Updated Jul 15, 2025

0 0 0 0

Updated Jul 15, 2025