Multilingual Text Inversion Detection of Scanned Images

Text Localization, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and CV

Research Paper: Multilingual Text Inversion Detection using Shape Context, presented in the IEEE TENSYMP 2021 Conference held at Grand Hyatt Jeju, the Republic of Korea on 23-25th Aug 2021.

Paper Link: https://ieeexplore.ieee.org/document/9550858

Paper Presentation https://youtu.be/zm9uaxdWMOA

Problem Definition

There can be problems in textual scanned images. The problem of inversion is one of the hardest anomaly to detect efficiently though it can be easily decipherable visually. Moreover, the scanned document can be in any language and the text can be anywhere in the image.

In this project, an algorithm to efficiently localize text has been implemented. Once the text area in an image is localized, it is passed on to language identification algorithm. Further, a mathematical descriptor is used to identify the text is inverted or not. The entire pipeline uses traditional methods in place of deep learning based methods and hence much more efficient.

How to run:

docker pull karthik199712/computer_vision:cv

To execute the pipeline with default image:

sudo docker run -it karthik199712/computer_vision:cv main.py

To execute the pipeline with an image in the dataset, give the image path and name after --image flag.

For instance, to execute with 11.png input image, command is as below:

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/11.png

The deep learning implementation for comparison is available in VGG16_INFERENCE_BASE.ipynb.

You can find some output examples as below.

Upright English Image

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/0.png

Inverted English Image

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/1.png

Upright Malayalam Image

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/mal1.png

Inverted Malayalam Image

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/mal2.png

Upright Greek Image

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/greek1.png

Inverted Greek Image

sudo docker run -it karthik199712/computer_vision:cv main.py --image ./data/greek2.png

Co-working Credits: Karthik K

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
base		base
data		data
ContourBbox.py		ContourBbox.py
ContourBbox.pyc		ContourBbox.pyc
Inversion Detection using Computer Vision.ipynb		Inversion Detection using Computer Vision.ipynb
Inverted_or_upright.py		Inverted_or_upright.py
Inverted_or_upright.pyc		Inverted_or_upright.pyc
LICENSE		LICENSE
README.md		README.md
VGG16_INFERENCE_BASE.ipynb		VGG16_INFERENCE_BASE.ipynb
curve_fit.py		curve_fit.py
curve_fit.pyc		curve_fit.pyc
dilationconstant.py		dilationconstant.py
dilationconstant.pyc		dilationconstant.pyc
generated.json		generated.json
greek_inverted.png		greek_inverted.png
greek_upright.png		greek_upright.png
inverted.png		inverted.png
main.py		main.py
mal_inverted.png		mal_inverted.png
mal_upright.png		mal_upright.png
shape_context.py		shape_context.py
shape_context.pyc		shape_context.pyc
upright.png		upright.png

License

AdroitAnandAI/Multilingual-Text-Inversion-Detection-of-Scanned-Images

Folders and files

Latest commit

History

Repository files navigation

Multilingual Text Inversion Detection of Scanned Images

Text Localization, Image Inversion Detection of Scanned Documents & Language Identification based on Shape Context and CV

Problem Definition

How to run:

Upright English Image

Inverted English Image

Upright Malayalam Image

Inverted Malayalam Image

Upright Greek Image

Inverted Greek Image

About

Topics

Resources

License

Stars

Watchers

Forks

Languages