BA-thesis in history.
-
Updated
Jul 13, 2017 - Python
BA-thesis in history.
This repository presents the code of the paper titled "Scribble Based Interactive Page Layout Segmentation Using Gabor Filter" published in ICFHR2016.
A more complete example of programming with PDFMiner, which continues where the default documentation stops
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
OCR-D wrapper for page-xml-draw
A powerful CLI tool for visualization and encoding of PAGE-XML files
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
An Open Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi (ICDAR 2021)
A Large Dataset of Historical Japanese Documents with Complex Layouts
A Python + C implementation for image-based PDF page layout analysis and content extraction.
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
Document Layout Analysis resources repos for development with PdfPig.
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
Add a description, image, and links to the layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the layout-analysis topic, visit your repo's landing page and select "manage topics."