A Lightweight Visual Font Style Recognition with Quantized Convolutional Autoencoder

Font style recognition plays a vital role in the field of computer vision, particularly in document and pattern analysis, and image processing. In the industry context, this recognition of font styles holds immense importance for professionals such as graphic designers, front-end developers, and UI-UX developers. In recent times, font style recognition using Computer Vision has made significant progress, especially in English. Very few works have been done for other languages as well. However, the existing models are computationally costly, time-consuming, and not diversified. In this work, we propose a state-of-the-art model to recognize Bangla fonts from images using a quantized Convolutional Autoencoder (Q-CAE) approach. The compressed model takes around 58 KB of memory only which makes it suitable for not only high-end but also low-end computational edge devices. We have also created a synthetic data set consisting of 10 distinct Bangla font styles and a total of 60,000 images for conducting this study as no dedicated dataset is available publicly. Experimental outcomes demonstrate that the proposed method can perform better than existing methods, gaining an overall accuracy of 99.95% without quantization and 99.85% after quantization.

The work has been published in IEEE Open Journal of the Computer Society and the full article can be accessed publicly from IEEE Xplore .

Dataset

Experimental data is publicly available in Mendeley Data .

Sample data instances-

Citation

@ARTICLE {10475431,
author={Tonmoy, Moshiur Rahman and Rakib, Abdul Fattah and Rahman, Rashik and Adnan, Md Akhtaruzzaman and Mridha, MF and Huang, Jie and Shin, Jungpil},
journal = {IEEE Open Journal of the Computer Society},
title = {A Lightweight Visual Font Style Recognition With Quantized Convolutional Autoencoder},
year = {2024},
volume = {5},
number = {01},
issn = {2644-1268},
pages = {120-130},
doi = {10.1109/OJCS.2024.3378709},
publisher = {IEEE Computer Society},
address = {Los Alamitos, CA, USA},
month = {Jan}
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
sample_data		sample_data
LICENSE		LICENSE
README.md		README.md
[Quantized]_Bangla_Visual_Font_Recognizer.ipynb		[Quantized]_Bangla_Visual_Font_Recognizer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sample_data

sample_data

LICENSE

LICENSE

README.md

README.md

[Quantized]_Bangla_Visual_Font_Recognizer.ipynb

[Quantized]_Bangla_Visual_Font_Recognizer.ipynb

Repository files navigation

A Lightweight Visual Font Style Recognition with Quantized Convolutional Autoencoder

Dataset

Citation

About

Releases

Packages

Languages

License

moshiurtonmoy/A-Lightweight-Visual-Font-Style-Recognition-with-Quantized-Convolutional-Autoencoder

Folders and files

Latest commit

History

Repository files navigation

A Lightweight Visual Font Style Recognition with Quantized Convolutional Autoencoder

Dataset

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages