table_structure_recognition

Table detection and table structure recognition using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

Dataset

You can download PubTables-1M from Microsoft Research Open Data, and uncompress PubTables-1M-Image_Page_Detection_PASCAL_VOC.tar.gz and PubTables-1M-Image_Table_Structure_PASCAL_VOC.tar.gz to the directory PubTables-1M/. Or you can download PubTables-1M and FinTabNet.c using download_data.sh.

Then, you can run the 3 notebooks (voc2coco_detection.ipynb, voc2coco_structure.ipynb) and voc2coco_structure_fintabnet.ipynb to convert VOC format to COCO format.

Train Model (Yolov5)

You can clone latest Yolov5 code from https://github.com/ultralytics/yolov5 to the directory yolov5/, and run the 2 scripts to train table detection model (train_PubTables-1M_detection.sh) and table structure recognition model (train_PubTables-1M_structure.sh). You may need to change the variable path of the yaml files according to your environment in the directory yolov5/data/.

I have trained each model using yolov5s for 10 epochs, and you can use the models in the directory yolov5/runs/ for fast try or finetune from the checkpoints.

Train Model (Yolov8)

You can change to the directory yolov8/, and run the 2 scripts to train table detection model (train_PubTables-1M_detection.sh) and table structure recognition model (train_PubTables-1M_structure.sh). You may need to change the variable path of the yaml files according to your environment in the directory yolov8/data/.

I have trained each model using yolov8s for 10 epochs, and you can use the models in the directory yolov8/runs/detect/ for fast try or finetune from the checkpoints.

Use Model

You can run the notebook table_structure_recognition.ipynb to convert a table image to an excel file. Please pay attention to the ocr function, you should use all-in-one-ai, or PaddleOCR, or any OCR service to get the ocr result.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
FinTabNet.c		FinTabNet.c
PubTables-1M		PubTables-1M
yolov5		yolov5
yolov8		yolov8
.gitignore		.gitignore
README.md		README.md
SimSong.ttc		SimSong.ttc
Using_Table_Transformer_for_table_detection_and_table_structure_recognition.ipynb		Using_Table_Transformer_for_table_detection_and_table_structure_recognition.ipynb
download_data.sh		download_data.sh
postprocess.py		postprocess.py
table_structure_recognition.ipynb		table_structure_recognition.ipynb
zh_val_0.jpg		zh_val_0.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FinTabNet.c

FinTabNet.c

PubTables-1M

PubTables-1M

yolov5

yolov5

yolov8

yolov8

.gitignore

.gitignore

README.md

README.md

SimSong.ttc

SimSong.ttc

Using_Table_Transformer_for_table_detection_and_table_structure_recognition.ipynb

Using_Table_Transformer_for_table_detection_and_table_structure_recognition.ipynb

download_data.sh

download_data.sh

postprocess.py

postprocess.py