Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scrapy cache not used for splash requests #290

Open
orzel opened this issue Aug 29, 2021 · 0 comments
Open

scrapy cache not used for splash requests #290

orzel opened this issue Aug 29, 2021 · 0 comments

Comments

@orzel
Copy link

orzel commented Aug 29, 2021

Hello. I'm using Splash and scrapy-splash. It's quite slow, but it works. My HTTPCACHE_DIR is filled with lot of stuff.
I have HTTPCACHE_STORAGE = 'scrapy_splash.SplashAwareFSCacheStorage' as specified in the doc.

Nevertheless, if i restart crawling, it's still very slow, and i can see in splash (docker logs) that all requests are done again.

So basically, my cache is filled but not used. How can I debug this ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant