A Unified Toolkit for Deep Learning Based Document Image Analysis
-
Updated
Mar 7, 2024 - Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
Read and extract text and other content from PDFs in C# (port of PDFBox)
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
OCR engine for all the languages
Document Layout Analysis resources repos for development with PdfPig.
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
A more complete example of programming with PDFMiner, which continues where the default documentation stops
A Large Dataset of Historical Japanese Documents with Complex Layouts
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
A powerful CLI tool for visualization and encoding of PAGE-XML files
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
An Open Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi (ICDAR 2021)
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
Add a description, image, and links to the layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the layout-analysis topic, visit your repo's landing page and select "manage topics."