Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce sent indexing urls cache? #38

Open
D39-Dev opened this issue Mar 18, 2024 · 3 comments
Open

Introduce sent indexing urls cache? #38

D39-Dev opened this issue Mar 18, 2024 · 3 comments

Comments

@D39-Dev
Copy link

D39-Dev commented Mar 18, 2024

Hi, thanks for your tool!
For the large websites, like 5k+ pages, is it possible to introduce the sent urls for indexing cache?
For most of us is ok to run your tool on a daily basis in order to cover all the website pages with a daily limitation of 200 request.
Just needs to be sute that tool is not sending already sent pages.
Thanks!

@AntoineKM
Copy link
Contributor

Yes, I'd already thought of that, the problem is that there are going to be 2 schools:

  • The first will want to double-check the urls whose indexing has already been requested, because they may have already been indexed.
  • The second will want to save in the cache the urls whose indexing has been requested.

My solution would be not to recheck the urls until all of them have been requested to be indexed.

@goenning
Copy link
Owner

The usage of getPublishMetadata already prevents sending the same URL twice

@AntoineKM
Copy link
Contributor

The usage of getPublishMetadata already prevents sending the same URL twice

Yes, but the problem is that there's a rate limit here too 🥲

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants