Skip to content

Does Stork support CJK languages? #294

Answered by jameslittle230
Jieiku asked this question in Q&A
Discussion options

You must be logged in to vote

I'd love for Stork to support CJK languages. Unfortunately, I think there are a few areas where it falls short today.

I've discussed this previously with another user, @YikSanChan, in issue #191. My understanding is that there are two main hurdles. The first, as you mentioned, is the list of stopwords in different languages, which would be easy enough to procure. Second, though, and likely more complicated, is the assumption that Stork makes today that a bit of text is made up of words separated by spaces. That's not the case in Chinese (and perhaps Japanese and Korean as well?) so searching Chinese text is blocked until that assumption is unwound.

@YikSanChan mentioned that there are alg…

Replies: 3 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@Jieiku
Comment options

Answer selected by jameslittle230
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #293 on May 05, 2022 00:50.