CRNN (CNN+RNN)

CRNN is a network that combines CNN and RNN to process images containing sequence information such as letters.

https://arxiv.org/pdf/1507.05717.pdf

It is mainly used for OCR technology and has the following advantages.

End-to-end learning is possible.
Sequence data of arbitrary length can be processed because of LSTM which is free in size of input and output sequence.
There is no need for a detector or cropping technique to find each character one by one.

You can use CRNN for OCR, license plate recognition, text recognition, and so on. It depends on what data you are training.

I used a slightly modified version of the original CRNN model. (Input size : 100x30 -> 128x64 & more CNN Layer)

Network

Convolutional Layer

Extracts features through CNN Layer (VGGNet, ResNet ...).

Recurrent Layer

Splits the features into a certain size and inserts them into the input of the Bidirectional LSTM or GRU.

Transcription Layer

Conversion of Feature-specific predictions to Label using CTC (Connectionist Temporal Classification).

license plate recognition using CRNN

I used CRNN to recognize license plates in Korea.

I learned the following kinds of Korean license plates.

I updated the Korean License Plate Synthetic image generator for those who lacked license plate pictures.

Result

CRNN works well for license plate recognition as follows.

How to Training

First, you need a lot of cropped license plate images.
And in my case I expressed the number of the license plate with the image file name.
(The license plate number 1234 is indicated as "1234.jpg").
(You can also define labeling with txt or csv files if you want. [(ex)0001.jpg "1234" \n 0002.jpg "0000" ...)

Since I used Korean license plates, I expressed the Korean language on the license plate in English.

(exmaple) A18sk6897
A : 서울
sk : 나

After creating training data in this way, put it in 'DB/train' directory and run training.py.

File Description

os : Ubuntu 16.04.4 LTS

GPU : GeForce GTX 1080 (8GB)

Python : 3.5.2

Tensorflow : 1.5.0

Keras : 2.1.3

CUDA, CUDNN : 9.0, 7.0

File	Description
Model .py	Network using CNN (VGG) + Bidirectional LSTM
Model_GRU. py	Network using CNN (VGG) + Bidirectional GRU
Image_Generator. py	Image batch generator for training
parameter. py	Parameters used in CRNN
training. py	CRNN training
Prediction. py	CRNN prediction

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
DB		DB
Korean-license-plate-Generator @ e4c7386		Korean-license-plate-Generator @ e4c7386
photo		photo
.deepsource.toml		.deepsource.toml
.gitmodules		.gitmodules
Image_Generator.py		Image_Generator.py
Model.py		Model.py
Model_GRU.py		Model_GRU.py
Prediction.py		Prediction.py
README.md		README.md
parameter.py		parameter.py
training.py		training.py

shishpalvishnoi/CRNN-Keras

Folders and files

Latest commit

History

Repository files navigation

CRNN (CNN+RNN)

Network

Convolutional Layer

Recurrent Layer

Transcription Layer

license plate recognition using CRNN

Result

How to Training

File Description

About

Topics

Resources

Stars

Watchers

Forks

Languages