Skip to content
This repository has been archived by the owner on Oct 13, 2021. It is now read-only.

File search example please #120

Open
suntong opened this issue Nov 15, 2020 · 3 comments
Open

File search example please #120

suntong opened this issue Nov 15, 2020 · 3 comments

Comments

@suntong
Copy link

suntong commented Nov 15, 2020

Description

All example that I saw are string-based searches. However, can riot somehow be used as/for a file-based search tool?

Basically it'll be just like grep, but using its persistent index to speed up the searches, while supporting 中文分词 at the same time.

The application scenario is that I have a huge collection of files in Chinese, thousands of them, thus I need something to search through them quickly, with the help of the pre-built indexes, as the content of the files will not be change (or very rarely), but more and more files are added daily. I haven't found any tools that does a good job in Chinese content search yet.

Is it possible? if so, sample code appreciated.

Thanks

@suntong
Copy link
Author

suntong commented Nov 15, 2020

Let's build a Full-Text Search engine
https://artem.krylysov.com/blog/2020/07/28/lets-build-a-full-text-search-engine/

This is the kind of tools that I'm talking about. However, because it is English based, the Inverted Index it builds is not capable of handling Chinese

@gedw99
Copy link

gedw99 commented Jan 24, 2021

Maybe bleve is better for the use case

@suntong
Copy link
Author

suntong commented Jan 24, 2021

Indeed, this is what I'm currently working on

https://github.com/suntong/doc-search

and I'm almost finished (lacks the Chinese search yet).

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants