layout-analysis
Here are 31 public repositories matching this topic...
OCR-D wrapper for page-xml-draw
-
Updated
May 1, 2021 - Python
PdfDet aims to simplify PDF layout detect tasks for users.
-
Updated
Mar 28, 2024 - Python
A python package to structure files using visual and style informations
-
Updated
Mar 9, 2024 - Python
HTR ground truth of the Chi-Know-Po project (Collex Persée)
-
Updated
Apr 16, 2024
BA-thesis in history.
-
Updated
Jul 13, 2017 - Python
Curating a dataset of British patents
-
Updated
May 24, 2024 - Jupyter Notebook
Nordrassil is a keyboard layout that provides an elegant and balanced typing experience by its use of a thumb-alpha, emphasis on middle fingers, and de-prioritisation of pinkies.
-
Updated
Feb 28, 2024
This repository presents the code of the paper titled "Scribble Based Interactive Page Layout Segmentation Using Gabor Filter" published in ICFHR2016.
-
Updated
Feb 8, 2019 - C++
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
-
Updated
Sep 24, 2021 - Python
A Python + C implementation for image-based PDF page layout analysis and content extraction.
-
Updated
Apr 13, 2023 - C++
An Open Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi (ICDAR 2021)
-
Updated
Feb 18, 2024
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
-
Updated
Apr 16, 2023 - Python
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
-
Updated
May 23, 2024 - Python
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
-
Updated
May 17, 2023 - C++
A powerful CLI tool for visualization and encoding of PAGE-XML files
-
Updated
May 19, 2021 - Python
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
-
Updated
Oct 6, 2023 - Python
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
-
Updated
Sep 11, 2020 - Python
A more complete example of programming with PDFMiner, which continues where the default documentation stops
-
Updated
Jul 24, 2019 - Python
A Large Dataset of Historical Japanese Documents with Complex Layouts
-
Updated
Jul 22, 2022 - Jupyter Notebook
Improve this page
Add a description, image, and links to the layout-analysis topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the layout-analysis topic, visit your repo's landing page and select "manage topics."