Should we update swe.training_text if new characters are added to desired_characters ? #9

aslamy · 2018-12-18T15:37:29Z

Recently I made a pull request to update the swedish desired_characters file with new characters.
Now I see swe.training_text does not contains all new added desired_characters.
Do we have to update swe.training_text and add thes new desired_characters, in order to to tesseract recognize them?

wrznr · 2019-04-16T15:39:26Z

Even if the source files (like training text and desired characters) are updated tesseract won't be able to recognize them without proper re-training. I am currently trying to find out how the training procedure for the stack models (i.e. those in the tessdata_* repos) works. Maybe the tesseract maintainers could elaborate on this...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should we update swe.training_text if new characters are added to desired_characters ? #9

Should we update swe.training_text if new characters are added to desired_characters ? #9

aslamy commented Dec 18, 2018

wrznr commented Apr 16, 2019

Should we update swe.training_text if new characters are added to desired_characters ? #9

Should we update swe.training_text if new characters are added to desired_characters ? #9

Comments

aslamy commented Dec 18, 2018

wrznr commented Apr 16, 2019