Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some bengali alphabets are probably missing includes ৎ #47

Open
eedeidk opened this issue Apr 20, 2020 · 0 comments
Open

some bengali alphabets are probably missing includes ৎ #47

eedeidk opened this issue Apr 20, 2020 · 0 comments

Comments

@eedeidk
Copy link

eedeidk commented Apr 20, 2020

trn = Transliterator(source='ben', target='eng', decode='beamsearch')
eng = trn.transform(u'ৎ')

this returns ['', '', '', '', '']
eng = trn.transform(u'শরৎ')
this returns -> ['shar', 'sar', 'sha', 'ther', 'phor'] which should be sarat or sharat

also for this eng = trn.transform(u'চট্টোপাধ্যায়')
i get this return: ['chattopadya', 'chattopadhya', 'chattopadyay', 'chattopaya', 'chattopadhyay'] where the last one is to be used. How can I use that or set as preferred? Can you explain in detail about the training etc. I have gone through your blog, as I am no expert in this I could not find my way. Please help. And thanks for such a utility.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant