A Python-based REST API for PDF OCR using AI models with PyTorch and Transformers that runs in a Docker container.
-
Updated
May 8, 2024 - Python
A Python-based REST API for PDF OCR using AI models with PyTorch and Transformers that runs in a Docker container.
Collected Alfred Workflows & Proofs of Concept
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
A PHP wrapper for Tesseract-OCR binary
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
ddddocr rust 版本,ocr_api_server rust 版本,二进制版本,验证码识别,不依赖 opencv 库,跨平台运行,a simple OCR API server, very easy to deploy。
OCR powered screen-capture tool to capture information instead of images
Sample code for the Datalogics Java interface of the Adobe PDF Library setup to build with Maven
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
EasyOCR is basically Optical Character Reading package that belongs from PyTorch. Using this texts from the images can be extracted easily, documents, texts can be scanned. For License Plate's Number Recognition, it can be applicable easily as it can extract the texts. About License Plate's Number, there are several language's character plates a…
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
TutorAI is a RAG system capable of assisting with learning academic subjects and using the curriculum and citing it. The project revolves around building an application that ingests a textbook in most formats and facilitates efficient learning of the course material.
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."