Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support keywords extraction for other current languages #22

Open
stephane-martin opened this issue Dec 19, 2018 · 1 comment
Open

Support keywords extraction for other current languages #22

stephane-martin opened this issue Dec 19, 2018 · 1 comment
Labels
enhancement New feature or request

Comments

@stephane-martin
Copy link

Hello,

currently it seems that in the keywords extraction process, stop words are hard coded to be for English language. Thus, when archiving content in some other language, the selected keywords are very often stop words in that language (I mainly archive content in French...)

Maybe the list of stop words could be selected dynamically, based on automatic language detection ? (see https://github.com/Mimino666/langdetect for example)

Thanks for great product :)

@kanishka-linux
Copy link
Owner

Yes, currently only english language is supported. I'll try to look into supporting other languages as well.

@kanishka-linux kanishka-linux added the enhancement New feature or request label Dec 20, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants