Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to read the full contents of a manga page? #64

Open
ShadowAimai opened this issue Apr 12, 2024 · 2 comments
Open

How to read the full contents of a manga page? #64

ShadowAimai opened this issue Apr 12, 2024 · 2 comments

Comments

@ShadowAimai
Copy link

The library does indeed work, but only for small sections of a manga page. But is there a way to process an entire page without having to do it manually? It was not clear to me how to do it when reading the documentation.

@HighLiuk
Copy link

HighLiuk commented May 15, 2024

@ShadowAimai as I explained in #26:

if you look at Mokuro, looks like it combines Comic Text Detector which is great to detect Japanese text and group them to bubbles, then for each bubble you use MangaOcr and voilà (at least in theory)

@HighLiuk
Copy link

@ShadowAimai this should be fine using Mokuro:

python3 -m pip install mokuro
from mokuro.manga_page_ocr import MangaPageOcr
from mokuro.utils import dump_json

if __name__ == '__main__':
    mpocr = MangaPageOcr()
    result = mpocr('path/to/image.jpg')
    dump_json(result, 'path/to/result.json')

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants