Skip to content

abcpp12383/ThreeStageBinarization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

PWC PWC PWC PWC

Stage-1 Flowchart

Stage-2 Flowchart

Stage-3 Flowchart

Citation

If you find our paper useful in your research, please consider citing:

Conference version (accepted by PRICAI 2023)

@inproceedings{ju2023ccdwt,
  title={CCDWT-GAN: Generative Adversarial Networks Based on Color Channel Using Discrete Wavelet Transform for Document Image Binarization},
  author={Ju, Rui-Yang and Lin, Yu-Shian and Chiang, Jen-Shiun and Chen, Chih-Chia and Chen, Wei-Han and Chien, Chun-Tse},
  booktitle={Pacific Rim International Conference on Artificial Intelligence},
  pages={186--198},
  year={2023},
  organization={Springer}
}

Journal version (under review):

@article{ju2022three,
  title={Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks},
  author={Ju, Rui-Yang and Lin, Yu-Shian and Jin, Yanlin and Chen, Chih-Chia and Chien, Chun-Tse and Chiang, Jen-Shiun},
  journal={arXiv preprint arXiv:2211.16098},
  year={2022}
}

Requirements

  • Linux (Ubuntu)
  • Python >= 3.6 (Pytorch)
  • NVIDIA GPU + CUDA CuDNN

Installation

    pip install segmentation-models-pytorch
    pip install pytesseract
  • Download tesseract data

    For Conda users, you can create a new Conda environment using conda env create -f environment.yaml

Dataset

You can download the dataset used in this experiment from Dropbox.

Usage

  • Preprocess

      python ./Base/image_to_224.py
      python ./Base/image_to_512.py
    
  • Train the model

    • Stage2
      python train_stage2.py
    
    • Before train left part of Stage3
      python predict_for_stage3.py
    
    • left part of Stage3 (need train predict_for_stage3.py first)
      python train_stage3.py
    
    • right part of Stage3 (independent training)
      python train_stage3_resize.py
    
  • Evaluation the model

      python3 eval_stage3_all.py
    

About

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages