Releases · scrapy/scrapy

04 Aug 19:40

Gallaecio

2.3.0

1278e76

2.3.0

Hihglights:

Feed exports now support Google Cloud Storage as a storage backend
The new FEED_EXPORT_BATCH_ITEM_COUNT setting allows to deliver output items in batches of up to the specified number of items.

It also serves as a workaround for delayed file delivery, which causes Scrapy to only start item delivery after the crawl has finished when using certain storage backends (S3, FTP, and now GCS).
The base implementation of item loaders has been moved into a separate library, itemloaders, allowing usage from outside Scrapy and a separate release schedule

See the full changelog

Assets 2

17 Jul 11:58

Gallaecio

2.2.1

e74b77d

2.2.1

The startproject command no longer makes unintended changes to the permissions of files in the destination folder, such as removing execution permissions.

Assets 2

24 Jun 11:51

Gallaecio

2.2.0

9f60481

2.2.0

Highlights:

Python 3.5.2+ is required now
dataclass objects and attrs objects are now valid item types
New TextResponse.json method
New bytes_received signal that allows canceling response download
CookiesMiddleware fixes

See the full changelog

Assets 2

24 Apr 10:51

Gallaecio

2.1.0

3878b67

2.1.0

Highlights:

New FEEDS setting to export to multiple feeds
New Response.ip_address attribute

See the full changelog

Assets 2

18 Mar 17:44

Gallaecio

2.0.1

bea154c

2.0.1

Response.follow_all now supports an empty URL iterable as input (#4408, #4420)
Removed top-level reactor imports to prevent errors about the wrong Twisted reactor being installed when setting a different Twisted reactor using TWISTED_REACTOR (#4401, #4406)