Sometimes predict long, redundancy and repetitive chars, like 'parseqqqqqqqqqqqq'(ground truth is 'parseq') #56

WenjunLiu6146 · 2022-12-08T07:12:18Z

Does any chance you know the reason? Thank you for your talented work!

baudm · 2022-12-08T08:40:12Z

Hello, please provide more details such as the model and weights used as well as the exact image you're using.

WenjunLiu6146 · 2022-12-09T01:13:46Z

Hello, please provide more details such as the model and weights used as well as the exact image you're using.

I used the parseq trained on Chinese dataset, which contains about 6K chars, and used it for test dataset inference.

baudm · 2022-12-09T11:34:58Z

Sorry but I can't help you since:

I have no access to and am not familiar with the specific model you're referring to.
I have no access to and am not familiar with the data you're using.
PARSeq was developed and tested with Latin characters on primarily English text. I am not familiar with the intricacies of Chinese text.

You might want to try increasing the number of decoder layers, or using a larger version of the model since the Chinese charset is much bigger than the Latin one.

ceyxasm · 2023-07-06T07:47:28Z

So I tinkered a lot and this is perhaps due to 'label_length' in main.yaml and the image size you are giving to the model.
In my case, with the model on hugging face; if you input an image with a single word parseq followed by white-spaces that are equivalent to 30-35 characters in total, the result is correct.

However, if we exceed this and input an image with length of lets say beyond 40 characters, redundant repetition of characters is seen.

gave me gatery.comminFreedom

gave me gateway.................

gave me gateway.

It is probably due to the face that model was trained on 1-word images and will hallucinate for longer labels.
I trained a model with label-length set to 65 and it was able to overcome this problem.

WenjunLiu6146 · 2023-07-07T12:36:38Z

Thanks. I'll try your sollution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sometimes predict long, redundancy and repetitive chars, like 'parseqqqqqqqqqqqq'(ground truth is 'parseq') #56

Sometimes predict long, redundancy and repetitive chars, like 'parseqqqqqqqqqqqq'(ground truth is 'parseq') #56

WenjunLiu6146 commented Dec 8, 2022

baudm commented Dec 8, 2022

WenjunLiu6146 commented Dec 9, 2022

baudm commented Dec 9, 2022

ceyxasm commented Jul 6, 2023

WenjunLiu6146 commented Jul 7, 2023

Sometimes predict long, redundancy and repetitive chars, like 'parseqqqqqqqqqqqq'(ground truth is 'parseq') #56

Sometimes predict long, redundancy and repetitive chars, like 'parseqqqqqqqqqqqq'(ground truth is 'parseq') #56

Comments

WenjunLiu6146 commented Dec 8, 2022

baudm commented Dec 8, 2022

WenjunLiu6146 commented Dec 9, 2022

baudm commented Dec 9, 2022

ceyxasm commented Jul 6, 2023

WenjunLiu6146 commented Jul 7, 2023