Curating a dataset of British patents
-
Updated
May 24, 2024 - Jupyter Notebook
Curating a dataset of British patents
OCR engine for all the languages
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
Read and extract text and other content from PDFs in C# (port of PDFBox)
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
HTR ground truth of the Chi-Know-Po project (Collex Persée)
PdfDet aims to simplify PDF layout detect tasks for users.
A python package to structure files using visual and style informations
A Unified Toolkit for Deep Learning Based Document Image Analysis
Nordrassil is a keyboard layout that provides an elegant and balanced typing experience by its use of a thumb-alpha, emphasis on middle fingers, and de-prioritisation of pinkies.
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
Document Layout Analysis resources repos for development with PdfPig.
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Add a description, image, and links to the layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the layout-analysis topic, visit your repo's landing page and select "manage topics."