Training model on japanese dataset #69

Preethse · 2023-01-26T06:33:11Z

I created my own dataset of 14M images and started training on it. Weirdly the results are like this after first epoch

Really would appreciate feedbacks to improve my training.

Chenxx017 · 2023-02-03T06:08:48Z

Hi, can this model be used to identify languages, such as Chinese, Japanese, etc.

lerndeep · 2023-03-27T05:45:35Z

https://github.com/bharatsubedi/PARseq_torch check in this repo you just update the character set of language you need you will able to train network.

baudm · 2023-03-28T04:16:08Z

@Preethse try early stopping. Your training is collapsing after a while. You could also try tuning the learning rates. The training dynamics drastically changes if you change the training data, so the default hyperpameters might not work well for you.

Also, for experimentation, use a bigger model, like the base Transformer configuration (d=768).

baudm · 2023-03-28T04:17:00Z

@Chenxx017 Sorry, but no. That's outside the scope of this project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training model on japanese dataset #69

Training model on japanese dataset #69

Preethse commented Jan 26, 2023

Chenxx017 commented Feb 3, 2023

lerndeep commented Mar 27, 2023

baudm commented Mar 28, 2023

baudm commented Mar 28, 2023

Training model on japanese dataset #69

Training model on japanese dataset #69

Comments

Preethse commented Jan 26, 2023

Chenxx017 commented Feb 3, 2023

lerndeep commented Mar 27, 2023

baudm commented Mar 28, 2023

baudm commented Mar 28, 2023