Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training model on japanese dataset #69

Open
Preethse opened this issue Jan 26, 2023 · 4 comments
Open

Training model on japanese dataset #69

Preethse opened this issue Jan 26, 2023 · 4 comments

Comments

@Preethse
Copy link

I created my own dataset of 14M images and started training on it. Weirdly the results are like this after first epoch

Screen Shot 2023-01-26 at 15 30 09

Really would appreciate feedbacks to improve my training.

@Chenxx017
Copy link

Hi, can this model be used to identify languages, such as Chinese, Japanese, etc.

@lerndeep
Copy link

https://github.com/bharatsubedi/PARseq_torch check in this repo you just update the character set of language you need you will able to train network.

@baudm
Copy link
Owner

baudm commented Mar 28, 2023

@Preethse try early stopping. Your training is collapsing after a while. You could also try tuning the learning rates. The training dynamics drastically changes if you change the training data, so the default hyperpameters might not work well for you.

Also, for experimentation, use a bigger model, like the base Transformer configuration (d=768).

@baudm
Copy link
Owner

baudm commented Mar 28, 2023

@Chenxx017 Sorry, but no. That's outside the scope of this project.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants