Handwritten Bangla Symbol Recognition with DenseNet

Version: 0.0.3  
Author : Md. Nazmuddoha Ansary
Python : 3.6.8

Symbol List

'অ','আ','ই','ঈ','উ','ঊ',  
'ঋ','এ','ঐ','ও','ঔ',  
'ক','খ','গ','ঘ','ঙ',  
'চ','ছ','জ','ঝ','ঞ',  
'ট','ঠ','ড','ঢ','ণ',  
'ত','থ','দ','ধ','ন',  
'প','ফ','ব','ভ','ম',  
'য','র','ল',  
'শ','ষ','স','হ',  
'ড়','ঢ়','য়',  
'ৎ','ং','ঃ','ঁ'  
'ঁ'

'ঁ' is not printable

DenseNet

The model is based on the original paper:Densely Connected Convolutional Networks

Authors and Researchers: Gao Huang ; Zhuang Liu ; Laurens van der Maaten ; Kilian Q. Weinberger

The paper introduces Dense Blocks within the traditional convolutional neural network architechture.
The composite layers can also contain bottoleneck layers

As compared to well established CNN models (like : FractNet or ResNet) DenseNet has:
* Less number of feature vector
* Low information bottoleneck
* Better Handling Of the vanishing gradient problem

Database:

CMATERdb

CMATERdb 3.1.2: Handwritten Bangla basic-character database

Data Sample

Established Results

From:Alom et. al. 2018

Version and Requirements

Keras==2.2.5  
numpy==1.16.4  
tensorflow==1.13.1

pip3 install -r requirements.txt

Colab and TPU(Tensor Processing Unit)

TPU’s have been recently added to the Google Colab portfolio making it even more attractive for quick-and-dirty machine learning projects when your own local processing units are just not fast enough. While the Tesla K80 available in Google Colab delivers respectable 1.87 TFlops and has 12GB RAM, the TPUv2 available from within Google Colab comes with a whopping 180 TFlops, give or take. It also comes with 64 GB High Bandwidth Memory (HBM). Visit This For More Info
For this model the approx time/epoch=24s

Test data Prediction Accuracy [F1 accuracy]: 98.56666666666666

Flask App Deployement

For Deployment of the Saved Model python-flask is used.

The deployment is very simple and to be honest can be more optimized

Segmentation (incomplete)

The final goal of the segmentation script is to separate:

Words From Lines
Symbols From Words For the goal of separation, Connected Components are mapped with pixel distribution after "skeletonization" and finding an optimal rotation for both skewness and separation.

Example Image:

Connected Components:

Segmented Words Example:

NOTE: See how the word "মনেরে" and "ভাল-মন্দ" are rotated for an optimal position with respect to a straight line or "মাত্রা" as we call it in "বাংলা" but the word "যাহাই" is left as it is because the skewness is completely by chance in the optimal rotation for separation.

Implemented DenseNet Model Architechture

The implemented model architechture can be found at /info/model.png

Loading the image may take time due to speed and size

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
Flask-app		Flask-app
info		info
segmentation		segmentation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.ipynb		data.ipynb
requirements.txt		requirements.txt
train.ipynb		train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flask-app

Flask-app

info

info

segmentation

segmentation

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

data.ipynb

data.ipynb

requirements.txt

requirements.txt

train.ipynb

train.ipynb

Repository files navigation

Handwritten Bangla Symbol Recognition with DenseNet

Symbol List

DenseNet

Database:

Data Sample

Established Results

Version and Requirements

Colab and TPU(Tensor Processing Unit)

Flask App Deployement

Segmentation (incomplete)

Example Image:

Connected Components:

Segmented Words Example:

Implemented DenseNet Model Architechture

About

Releases

Packages

Languages

License

mnansary/pyHOCR

Folders and files

Latest commit

History

Repository files navigation

Handwritten Bangla Symbol Recognition with DenseNet

Symbol List

DenseNet

Database:

Data Sample

Established Results

Version and Requirements

Colab and TPU(Tensor Processing Unit)

Flask App Deployement

Segmentation (incomplete)

Example Image:

Connected Components:

Segmented Words Example:

Implemented DenseNet Model Architechture

About

Topics

Resources

License

Stars

Watchers

Forks

Languages