Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request] Store scraped data #99

Open
KaKi87 opened this issue Mar 21, 2023 · 2 comments
Open

[Feature request] Store scraped data #99

KaKi87 opened this issue Mar 21, 2023 · 2 comments

Comments

@KaKi87
Copy link

KaKi87 commented Mar 21, 2023

Hello,

It would be nice of Quetre to store scraped data in a database so that it doesn't ever need to scrape it again.

Here's what happened to me today :

  1. I visited a successfully loaaded Quetre page :
  2. Shared it with someone else ;
  3. Hours later, the same page returned 503 to the person I shared it with.

Thanks

@zyachel
Copy link
Owner

zyachel commented Mar 21, 2023

well, there is an option to cache responses using Redis.
In case of my instance, I have set the cache expiry time to one hour. I could increase that time if you want.

Caching for infinite time, however, would blow up RAM usage. Furthermore, it would be akin to keeping stale answers, which is not optimal. So, I don't think it's a sound idea.

if you're worried about a particular answer not existing in the future, you can always use archive.org and save the answer forever.

As for instances being rate limited, I'm exploring ways to minimise it.

@KaKi87
Copy link
Author

KaKi87 commented Mar 21, 2023

Why not caching on disk, of not with MySQL or Postgres then with SQLite for example, which would easily allow something like a month of cache ?

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants