Make media pipeline storage more flexible #5991

Gallaecio · 2023-07-28T09:02:52Z

Looking at how people need to use subclassing or monkeypatching for some straightforward cases of media pipeline storage, such as configuring the target Google Storage Cloud project ID or configuring a new storage class for a different service, I think we need to look into making media pipeline storage configuration more flexible.

Specifically, I think we need to consider:

Expose FilesPipeline‘s STORE_SCHEMES as a setting (with a better name) that works also for images, rather than requiring subclassing to define new storage classes.
Make storage classes Scrapy components, that can define from_crawler or from_settings to configured themselves based on settings, instead of requiring users to subclass base media pipeline classes to set class-level values based on settings, which feels like monkeypatching.

The text was updated successfully, but these errors were encountered:

Gallaecio added the enhancement label Jul 28, 2023

Gallaecio mentioned this issue Jul 28, 2023

Add support for storing images in Azure scrapy-plugins/scrapy-feedexporter-azure-storage#4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make media pipeline storage more flexible #5991

Make media pipeline storage more flexible #5991

Gallaecio commented Jul 28, 2023

Make media pipeline storage more flexible #5991

Make media pipeline storage more flexible #5991

Comments

Gallaecio commented Jul 28, 2023