Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

使用CTC, 识别时不限制4个字符长度,识别率如何? #65

Open
ctzhang2008 opened this issue Mar 14, 2021 · 3 comments
Open

Comments

@ctzhang2008
Copy link

请问楼主,
使用CTC,在 识别时不限制4个字符长度情况下,即设定不定长识别,识别率如何?
或者说识别出4个字符下,再统计准确率。小于、大于4,均认为识别错误。

谢谢

@ypwhs
Copy link
Owner

ypwhs commented Mar 14, 2021

我这边用 CTC 训练的模型,识别 A4 纸打印的中文字没有问题,平均每行可以有 30~40 字。

@ctzhang2008
Copy link
Author

ctzhang2008 commented Mar 14, 2021

我这边用 CTC 训练的模型,识别 A4 纸打印的中文字没有问题,平均每行可以有 30~40 字。

请问OCR识别,训练集图像如何获取的? 是自己做的:文本--》生成图像---》图像切割成行---》训练?
另外平均每行可以有30~40字,input_length设成了多少?
我也想做做。

谢谢。

@ypwhs
Copy link
Owner

ypwhs commented Mar 14, 2021

请问OCR识别,训练集图像如何获取的?

公司数据,人工标注

另外平均每行可以有30~40字,input_length设成了多少?

256

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants