Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

竖排繁体识别差,以及识别去格式问题 #63

Open
sezhai opened this issue Nov 18, 2023 · 2 comments
Open

竖排繁体识别差,以及识别去格式问题 #63

sezhai opened this issue Nov 18, 2023 · 2 comments

Comments

@sezhai
Copy link

sezhai commented Nov 18, 2023

竖排繁体识别非常差,基本是无法用的。
还有,希望增加“合并文本”的功能,就是去除所有空格与段落,方便直接粘贴使用。

@hiroi-sora
Copy link
Owner

Paddle插件中,繁体中文(v2) 的竖排识别能力是比v3要好的,可以试下。(不过,由于训练量的制约,竖排性能还是没有横排好。)

image

关于“合并文本”,是希望让OCR每次识别出的文本,都合并为单一行么

@954224685
Copy link

Paddle插件中,繁体中文(v2) 的竖排识别能力是比v3要好的,可以试下。(不过,由于训练量的制约,竖排性能还是没有横排好。)

image

关于“合并文本”,是希望让OCR每次识别出的文本,都合并为单一行么

你好,请看一下最新的问题

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants